This study used a method known as long-read isoform sequencing to fully map and understand the genetic sequences present in equine lung cells. Over 300,000 genetic sequences were examined, identifying both previously known genes and transcripts, as well as discovering novel genes and transcripts.
Research Methodology and Process
- The researchers employed Pacific Biosciences long-read isoform sequencing, a modern yet complex method that uses fluorochemistry and specialized sensors to obtain the complete sequences of transcripts, including all their variant forms or isoforms, from a given sample.
- This sequencing technology was used to examine bronchoalveolar lavage fluid (BALF) cells in horses. BALF is a technique used to obtain cells and other materials from the lower respiratory tract to diagnose lung diseases.
Data Collection and Analysis
- The researchers generated a dataset that included 313,563 HiFi reads. HiFi here refers to high-accuracy reads produced by the sequencing technology, which gives scientists a more accurate and reliable set of data to work with. These reads totaled to 805 megabytes of polished sequence information, indicating comprehensive data was gathered.
- Through their analysis they were able to identify 14,234 full-length transcript isoforms originating from 7,017 unique genes within the equine BALF cells.
Key Findings
- The identified genes consisted of both previously known and new genes. A total of 6,880 were already annotated, meaning they have been previously identified and cataloged in previous research, and 137 were novel genes, not identified or recorded in existing genetic databases.
- Apart from the genes, they also found 3,428 novel transcripts in addition to 10,806 previously identified transcripts.
- This includes transcripts that were not found in existing genome annotations, transcripts that were corresponding to potentially novel genes that aren’t yet annotated, and fusion transcripts, which incorporate components from multiple different genes.
Implications
- The study provides the scientific community with transcript-level data for equine BALF cells, expanding the existing body of knowledge on horse genetics and potentially pushing forward research in equine lung health and diseases.
- Unknown sequences and genes could pave the way for novel genetic disease marker discovery, breeding programs, and understanding the genetic basis of health and disease in horses.
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.