Analyze Diet
BMC genomics2017; 18(1); 103; doi: 10.1186/s12864-016-3451-2

Tissue resolved, gene structure refined equine transcriptome.

Abstract: Transcriptome interpretation relies on a good-quality reference transcriptome for accurate quantification of gene expression as well as functional analysis of genetic variants. The current annotation of the horse genome lacks the specificity and sensitivity necessary to assess gene expression especially at the isoform level, and suffers from insufficient annotation of untranslated regions (UTR) usage. We built an annotation pipeline for horse and used it to integrate 1.9 billion reads from multiple RNA-seq data sets into a new refined transcriptome. This equine transcriptome integrates eight different tissues from 59 individuals and improves gene structure and isoform resolution, while providing considerable tissue-specific information. We utilized four levels of transcript filtration in our pipeline, aimed at producing several transcriptome versions that are suitable for different downstream analyses. Our most refined transcriptome includes 36,876 genes and 76,125 isoforms, with 6474 candidate transcriptional loci novel to the equine transcriptome. We have employed a variety of descriptive statistics and figures that demonstrate the quality and content of the transcriptome. The equine transcriptomes that are provided by this pipeline show the best tissue-specific resolution of any equine transcriptome to date and are flexible for several downstream analyses. We encourage the integration of further equine transcriptomes with our annotation pipeline to continue and improve the equine transcriptome.
Publication Date: 2017-01-20 PubMed ID: 28107812PubMed Central: PMC5251313DOI: 10.1186/s12864-016-3451-2Google Scholar: Lookup
The Equine Research Bank provides access to a large database of publicly available scientific literature. Inclusion in the Research Bank does not imply endorsement of study methods or findings by Mad Barn.
  • Journal Article
  • Research Support
  • Non-U.S. Gov't
  • Research Support
  • N.I.H.
  • Extramural

Summary

This research summary has been generated with artificial intelligence and may contain errors and omissions. Refer to the original study to confirm details provided. Submit correction.

The research article presents an enhanced methodology for the interpretation of the horse transcriptome, improving the resolution of gene structures and isoforms while providing tissue-specific details.

Understanding the Necessity of the Research

  • Accurate transcriptome interpretation is essential for assessing gene expression and performing functional analysis of genetic variants. It requires a high-quality reference transcriptome.
  • The existing horse genome annotation lacks the necessary specificity and sensitivity, especially for analyzing expression at the isoform level. The current methods also have insufficient annotation of untranslated regions (UTRs).
  • These shortcomings highlight the necessity for a more refined horse transcriptome annotation, which was the primary goal of this research.

Creation of a New Transcriptome Annotation Pipeline

  • The researchers developed a new pipeline specifically for horse transcriptome annotation that integrates multiple RNA-seq datasets. Approximately 1.9 billion reads were used to build the new refined transcriptome.
  • This enhanced equine transcriptome encompasses eight different tissues from 59 individuals, which enhances the specificity of the gene structure and isoform resolution, and also provides significant tissue-specific information.
  • The pipeline uses a four-level transcript filtration, designed to create several transcriptome versions useful for different downstream analyses. This versatility is an essential aspect for wider applications.

Results and Evaluation

  • The most refined transcriptome from the application of this pipeline includes 36,876 genes and 76,125 isoforms, with 6474 potential transcriptional loci that are new to the equine transcriptome.
  • Various descriptive statistics and figures were used to showcase the quality and content of the generated transcriptome to emphasize the improvements in the method.

Conclusion and Further Improvements

  • This study’s technique resulted in the best tissue-specific resolution of any equine transcriptomes created till date and demonstrated flexibility for several downstream analyses.
  • The researchers encourage the integration of more equine transcriptomes with their annotation pipeline to continue enhancing the quality of the equine transcriptome in the future.

Cite This Article

APA
Mansour TA, Scott EY, Finno CJ, Bellone RR, Mienaltowski MJ, Penedo MC, Ross PJ, Valberg SJ, Murray JD, Brown CT. (2017). Tissue resolved, gene structure refined equine transcriptome. BMC Genomics, 18(1), 103. https://doi.org/10.1186/s12864-016-3451-2

Publication

ISSN: 1471-2164
NlmUniqueID: 100965258
Country: England
Language: English
Volume: 18
Issue: 1
Pages: 103
PII: 103

Researcher Affiliations

Mansour, T A
  • Department of Population Health and Reproduction, University of California, Davis, Davis, USA.
  • Department of Clinical Pathology, College of Medicine, Mansoura University, Egypt, Mansoura, Egypt.
Scott, E Y
  • Department of Animal Science, University of California, Davis, Davis, USA.
Finno, C J
  • Department of Population Health and Reproduction, University of California, Davis, Davis, USA.
Bellone, R R
  • Department of Population Health and Reproduction, University of California, Davis, Davis, USA.
  • Veterinary Genetics Laboratory, University of California, Davis, Davis, USA.
Mienaltowski, M J
  • Department of Animal Science, University of California, Davis, Davis, USA.
Penedo, M C
  • Veterinary Genetics Laboratory, University of California, Davis, Davis, USA.
Ross, P J
  • Department of Animal Science, University of California, Davis, Davis, USA.
Valberg, S J
  • Large Animal Clinical Sciences, Michigan State University, College of Veterinary Medicine, East Lansing, USA.
Murray, J D
  • Department of Population Health and Reproduction, University of California, Davis, Davis, USA.
  • Department of Animal Science, University of California, Davis, Davis, USA.
Brown, C T
  • Department of Population Health and Reproduction, University of California, Davis, Davis, USA. ctbrown@ucdavis.edu.

MeSH Terms

  • Animals
  • Chromosome Mapping
  • Cluster Analysis
  • Computational Biology / methods
  • Gene Expression Profiling
  • Genome
  • Genomics / methods
  • High-Throughput Nucleotide Sequencing
  • Horses
  • Molecular Sequence Annotation
  • Organ Specificity / genetics
  • RNA Isoforms
  • Transcriptome

Grant Funding

  • K01 OD015134 / NIH HHS
  • L40 TR001136 / NCATS NIH HHS

References

This article includes 35 references
  1. Brown JB, Boley N, Eisman R, May GE, Stoiber MH, Duff MO, Booth BW, Wen J, Park S, Suzuki AM, Wan KH, Yu C, Zhang D, Carlson JW, Cherbas L, Eads BD, Miller D, Mockaitis K, Roberts J, Davis CA, Frise E, Hammonds AS, Olson S, Shenker S, Sturgill D, Samsonova AA, Weiszmann R, Robinson G, Hernandez J, Andrews J, Bickel PJ, Carninci P, Cherbas P, Gingeras TR, Hoskins RA, Kaufman TC, Lai EC, Oliver B, Perrimon N, Graveley BR, Celniker SE. Diversity and dynamics of the Drosophila transcriptome.. Nature 2014 Aug 28;512(7515):393-9.
    doi: 10.1038/nature12962pmc: PMC4152413pubmed: 24670639google scholar: lookup
  2. Melé M, Ferreira PG, Reverter F, DeLuca DS, Monlong J, Sammeth M, Young TR, Goldmann JM, Pervouchine DD, Sullivan TJ, Johnson R, Segrè AV, Djebali S, Niarchou A, Wright FA, Lappalainen T, Calvo M, Getz G, Dermitzakis ET, Ardlie KG, Guigó R. Human genomics. The human transcriptome across tissues and individuals.. Science 2015 May 8;348(6235):660-5.
    doi: 10.1126/science.aaa0355pmc: PMC4547472pubmed: 25954002google scholar: lookup
  3. Okazaki Y, Furuno M, Kasukawa T, Adachi J, Bono H, Kondo S, Nikaido I, Osato N, Saito R, Suzuki H, Yamanaka I, Kiyosawa H, Yagi K, Tomaru Y, Hasegawa Y, Nogami A, Schönbach C, Gojobori T, Baldarelli R, Hill DP, Bult C, Hume DA, Quackenbush J, Schriml LM, Kanapin A, Matsuda H, Batalov S, Beisel KW, Blake JA, Bradt D, Brusic V, Chothia C, Corbani LE, Cousins S, Dalla E, Dragani TA, Fletcher CF, Forrest A, Frazer KS, Gaasterland T, Gariboldi M, Gissi C, Godzik A, Gough J, Grimmond S, Gustincich S, Hirokawa N, Jackson IJ, Jarvis ED, Kanai A, Kawaji H, Kawasawa Y, Kedzierski RM, King BL, Konagaya A, Kurochkin IV, Lee Y, Lenhard B, Lyons PA, Maglott DR, Maltais L, Marchionni L, McKenzie L, Miki H, Nagashima T, Numata K, Okido T, Pavan WJ, Pertea G, Pesole G, Petrovsky N, Pillai R, Pontius JU, Qi D, Ramachandran S, Ravasi T, Reed JC, Reed DJ, Reid J, Ring BZ, Ringwald M, Sandelin A, Schneider C, Semple CA, Setou M, Shimada K, Sultana R, Takenaka Y, Taylor MS, Teasdale RD, Tomita M, Verardo R, Wagner L, Wahlestedt C, Wang Y, Watanabe Y, Wells C, Wilming LG, Wynshaw-Boris A, Yanagisawa M, Yang I, Yang L, Yuan Z, Zavolan M, Zhu Y, Zimmer A, Carninci P, Hayatsu N, Hirozane-Kishikawa T, Konno H, Nakamura M, Sakazume N, Sato K, Shiraki T, Waki K, Kawai J, Aizawa K, Arakawa T, Fukuda S, Hara A, Hashizume W, Imotani K, Ishii Y, Itoh M, Kagawa I, Miyazaki A, Sakai K, Sasaki D, Shibata K, Shinagawa A, Yasunishi A, Yoshino M, Waterston R, Lander ES, Rogers J, Birney E, Hayashizaki Y. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs.. Nature 2002 Dec 5;420(6915):563-73.
    doi: 10.1038/nature01266pubmed: 12466851google scholar: lookup
  4. Holl HM, Gao S, Fei Z, Andrews C, Brooks SA. Generation of a de novo transcriptome from equine lamellar tissue.. BMC Genomics 2015 Oct 3;16:739.
    doi: 10.1186/s12864-015-1948-8pmc: PMC4592545pubmed: 26432030google scholar: lookup
  5. Pacholewska A, Drögemüller M, Klukowska-Rötzler J, Lanz S, Hamza E, Dermitzakis ET, Marti E, Gerber V, Leeb T, Jagannathan V. The transcriptome of equine peripheral blood mononuclear cells.. PLoS One 2015;10(3):e0122011.
  6. Coleman SJ, Zeng Z, Wang K, Luo S, Khrebtukova I, Mienaltowski MJ, Schroth GP, Liu J, MacLeod JN. Structural annotation of equine protein-coding genes determined by mRNA sequencing.. Anim Genet 2010 Dec;41 Suppl 2:121-30.
  7. Hestand MS, Kalbfleisch TS, Coleman SJ, Zeng Z, Liu J, Orlando L, MacLeod JN. Annotation of the Protein Coding Regions of the Equine Genome.. PLoS One 2015;10(6):e0124375.
  8. Yandell M, Ence D. A beginner's guide to eukaryotic genome annotation.. Nat Rev Genet 2012 Apr 18;13(5):329-42.
    doi: 10.1038/nrg3174pubmed: 22510764google scholar: lookup
  9. Curwen V, Eyras E, Andrews TD, Clarke L, Mongin E, Searle SM, Clamp M. The Ensembl automatic gene annotation system.. Genome Res 2004 May;14(5):942-50.
    doi: 10.1101/gr.1858004pmc: PMC479124pubmed: 15123590google scholar: lookup
  10. Kitts P. The NCBI handbook: chapter 14. Genome assembly and annotation process.. 2003 ed..
  11. Taanman JW. The mitochondrial genome: structure, transcription, translation and replication.. Biochim Biophys Acta 1999 Feb 9;1410(2):103-23.
    doi: 10.1016/S0005-2728(98)00161-3pubmed: 10076021google scholar: lookup
  12. Plotz G, Casper M, Raedle J, Hinrichsen I, Heckel V, Brieger A, Trojan J, Zeuzem S. MUTYH gene expression and alternative splicing in controls and polyposis patients.. Hum Mutat 2012 Jul;33(7):1067-74.
    doi: 10.1002/humu.22059pubmed: 22473953google scholar: lookup
  13. Finno CJ, Bordbari MH, Valberg SJ, Lee D, Herron J, Hines K, Monsour T, Scott E, Bannasch DL, Mickelson J, Xu L. Transcriptome profiling of equine vitamin E deficient neuroaxonal dystrophy identifies upregulation of liver X receptor target genes.. Free Radic Biol Med 2016 Dec;101:261-271.
  14. Burset M, Guigó R. Evaluation of gene structure prediction programs.. Genomics 1996 Jun 15;34(3):353-67.
    doi: 10.1006/geno.1996.0298pubmed: 8786136google scholar: lookup
  15. Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L. Differential analysis of gene regulation at transcript resolution with RNA-seq.. Nat Biotechnol 2013 Jan;31(1):46-53.
    doi: 10.1038/nbt.2450pmc: PMC3869392pubmed: 23222703google scholar: lookup
  16. Bertone P, Stolc V, Royce TE, Rozowsky JS, Urban AE, Zhu X, Rinn JL, Tongprasit W, Samanta M, Weissman S, Gerstein M, Snyder M. Global identification of human transcribed sequences with genome tiling arrays.. Science 2004 Dec 24;306(5705):2242-6.
    doi: 10.1126/science.1103388pubmed: 15539566google scholar: lookup
  17. Khaitovich P, Kelso J, Franz H, Visagie J, Giger T, Joerchel S, Petzold E, Green RE, Lachmann M, Pääbo S. Functionality of intergenic transcription: an evolutionary comparison.. PLoS Genet 2006 Oct 13;2(10):e171.
  18. Schadt EE, Edwards SW, GuhaThakurta D, Holder D, Ying L, Svetnik V, Leonardson A, Hart KW, Russell A, Li G, Cavet G, Castle J, McDonagh P, Kan Z, Chen R, Kasarskis A, Margarint M, Caceres RM, Johnson JM, Armour CD, Garrett-Engele PW, Tsinoremas NF, Shoemaker DD. A comprehensive transcript index of the human genome generated using microarrays and computational approaches.. Genome Biol 2004;5(10):R73.
    doi: 10.1186/gb-2004-5-10-r73pmc: PMC545593pubmed: 15461792google scholar: lookup
  19. Sultan M, Amstislavskiy V, Risch T, Schuette M, Dökel S, Ralser M, Balzereit D, Lehrach H, Yaspo ML. Influence of RNA extraction methods and library selection schemes on RNA-seq data.. BMC Genomics 2014 Aug 11;15(1):675.
    doi: 10.1186/1471-2164-15-675pmc: PMC4148917pubmed: 25113896google scholar: lookup
  20. Smibert P, Miura P, Westholm JO, Shenker S, May G, Duff MO, Zhang D, Eads BD, Carlson J, Brown JB, Eisman RC, Andrews J, Kaufman T, Cherbas P, Celniker SE, Graveley BR, Lai EC. Global patterns of tissue-specific alternative polyadenylation in Drosophila.. Cell Rep 2012 Mar 29;1(3):277-89.
  21. Mangone M, Manoharan AP, Thierry-Mieg D, Thierry-Mieg J, Han T, Mackowiak SD, Mis E, Zegar C, Gutwein MR, Khivansara V, Attie O, Chen K, Salehi-Ashtiani K, Vidal M, Harkins TT, Bouffard P, Suzuki Y, Sugano S, Kohara Y, Rajewsky N, Piano F, Gunsalus KC, Kim JK. The landscape of C. elegans 3'UTRs.. Science 2010 Jul 23;329(5990):432-5.
    doi: 10.1126/science.1191244pmc: PMC3142571pubmed: 20522740google scholar: lookup
  22. Ulitsky I, Shkumatava A, Jan CH, Subtelny AO, Koppstein D, Bell GW, Sive H, Bartel DP. Extensive alternative polyadenylation during zebrafish development.. Genome Res 2012 Oct;22(10):2054-66.
    doi: 10.1101/gr.139733.112pmc: PMC3460199pubmed: 22722342google scholar: lookup
  23. Scott EY, Penedo MCT, Murray JD, Finno CJ. Defining Trends in Global Gene Expression in Arabian Horses with Cerebellar Abiotrophy.. Cerebellum 2017 Apr;16(2):462-472.
    pmc: PMC5336519pubmed: 27709457doi: 10.1007/s12311-016-0823-8google scholar: lookup
  24. Farkas MH, Grant GR, White JA, Sousa ME, Consugar MB, Pierce EA. Transcriptome analyses of the human retina identify unprecedented transcript diversity and 3.5 Mb of novel transcribed sequence via significant alternative splicing and novel genes.. BMC Genomics 2013 Jul 18;14:486.
    doi: 10.1186/1471-2164-14-486pmc: PMC3924432pubmed: 23865674google scholar: lookup
  25. Bouckenheimer J, Assou S, Riquier S, Hou C, Philippe N, Sansac C, Lavabre-Bertrand T, Commes T, Lemaître JM, Boureux A, De Vos J. Long non-coding RNAs in human early embryonic development and their potential in ART.. Hum Reprod Update 2016 Dec;23(1):19-40.
    doi: 10.1093/humupd/dmw035pubmed: 27655590google scholar: lookup
  26. Pauli A, Rinn JL, Schier AF. Non-coding RNAs as regulators of embryogenesis.. Nat Rev Genet 2011 Feb;12(2):136-49.
    doi: 10.1038/nrg2904pmc: PMC4081495pubmed: 21245830google scholar: lookup
  27. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data.. Bioinformatics 2014 Aug 1;30(15):2114-20.
  28. Macmanes MD. On the optimal trimming of high-throughput mRNA sequence data.. Front Genet 2014;5:13.
    doi: 10.3389/fgene.2014.00013pmc: PMC3908319pubmed: 24567737google scholar: lookup
  29. Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions.. Genome Biol 2013 Apr 25;14(4):R36.
    doi: 10.1186/gb-2013-14-4-r36pmc: PMC4053844pubmed: 23618408google scholar: lookup
  30. Brown C, Howe A, Zhang Q, Pyrkosz AB, Brom TH. A Reference-Free Algorithm for Computational Normalization of Shotgun Sequencing Data.. 2013.
  31. Rob Patro GD, Carl Kingsford. Accurate, fast, and model-aware transcript expression quanitification with Salmon.. bioRxiv 2015.
  32. Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, Couger MB, Eccles D, Li B, Lieber M, MacManes MD, Ott M, Orvis J, Pochet N, Strozzi F, Weeks N, Westerman R, William T, Dewey CN, Henschel R, LeDuc RD, Friedman N, Regev A. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.. Nat Protoc 2013 Aug;8(8):1494-512.
    doi: 10.1038/nprot.2013.084pmc: PMC3875132pubmed: 23845962google scholar: lookup
  33. Mi H, Muruganujan A, Casagrande JT, Thomas PD. Large-scale gene function analysis with the PANTHER classification system.. Nat Protoc 2013 Aug;8(8):1551-66.
    doi: 10.1038/nprot.2013.092pmc: PMC6519453pubmed: 23868073google scholar: lookup
  34. Kent WJ, Zweig AS, Barber G, Hinrichs AS, Karolchik D. BigWig and BigBed: enabling browsing of large distributed datasets.. Bioinformatics 2010 Sep 1;26(17):2204-7.
  35. Raney BJ, Dreszer TR, Barber GP, Clawson H, Fujita PA, Wang T, Nguyen N, Paten B, Zweig AS, Karolchik D, Kent WJ. Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser.. Bioinformatics 2014 Apr 1;30(7):1003-5.

Citations

This article has been cited 14 times.