Analyze Diet
PloS one2013; 8(7); e70125; doi: 10.1371/journal.pone.0070125

Analysis of unannotated equine transcripts identified by mRNA sequencing.

Abstract: Sequencing of equine mRNA (RNA-seq) identified 428 putative transcripts which do not map to any previously annotated or predicted horse genes. Most of these encode the equine homologs of known protein-coding genes described in other species, yet the potential exists to identify novel and perhaps equine-specific gene structures. A set of 36 transcripts were prioritized for further study by filtering for levels of expression (depth of RNA-seq read coverage), distance from annotated features in the equine genome, the number of putative exons, and patterns of gene expression between tissues. From these, four were selected for further investigation based on predicted open reading frames of greater than or equal to 50 amino acids and lack of detectable homology to known genes across species. Sanger sequencing of RT-PCR amplicons from additional equine samples confirmed expression and structural annotation of each transcript. Functional predictions were made by conserved domain searches. A single transcript, expressed in the cerebellum, contains a putative kruppel-associated box (KRAB) domain, suggesting a potential function associated with zinc finger proteins and transcriptional regulation. Overall levels of conserved synteny and sequence conservation across a 1MB region surrounding each transcript were approximately 73% compared to the human, canine, and bovine genomes; however, the four loci display some areas of low conservation and sequence inversion in regions that immediately flank these previously unannotated equine transcripts. Taken together, the evidence suggests that these four transcripts are likely to be equine-specific.
Publication Date: 2013-07-29 PubMed ID: 23922931PubMed Central: PMC3726457DOI: 10.1371/journal.pone.0070125Google Scholar: Lookup
The Equine Research Bank provides access to a large database of publicly available scientific literature. Inclusion in the Research Bank does not imply endorsement of study methods or findings by Mad Barn.
  • Journal Article
  • Research Support
  • N.I.H.
  • Extramural
  • Research Support
  • Non-U.S. Gov't

Summary

This research summary has been generated with artificial intelligence and may contain errors and omissions. Refer to the original study to confirm details provided. Submit correction.

The research identified 428 potentially new horse gene transcripts based on the sequencing of equine mRNA. The researchers prioritized a select group of these potential genes for further investigation, based primarily on their expression levels and sequence structure. After further investigation, the team suspects that these four transcripts might be unique to horses.

Objective of the Research

  • The aim of this study was to analyze unannotated equine transcripts that were identified through mRNA sequencing. 428 such potential transcripts were identified, and out of these, four were selected for further investigation based on their predicted open reading frames and lack of detectable homology to known genes. The goal was to broaden the understanding of equine genomic uniqueness and potentially discover equine-specific genes.

Methodology

  • Through mRNA sequencing, researchers identified 428 putative transcripts that don’t match any previously known or predicted horse genes. These putative transcripts could possibly be encoding for equine homologs of known protein-coding genes from other species.
  • A set of 36 of these transcripts were prioritized for further research. The selection criteria for these 36 was based on their levels of expression, distance from previously annotated features in the equine genome, the number of putative exons, and differential gene expression across tissues.
  • Out of the prioritized 36, four transcripts were chosen for deeper investigation as they had predicted open reading frames of greater than or equal to 50 amino acids and showed no detectable similarity to known genes across species.
  • Sanger sequencing was performed on RT-PCR amplicons from additional horse samples to confirm the expression and structural annotation of each of the four transcripts.

Findings

  • One of the analyzed transcripts, expressed in the cerebellum, was found to contain a putative kruppel-associated box (KRAB) domain, suggesting a potential function related to zinc finger proteins and transcriptional regulation.
  • Upon comparing the surrounding 1MB region of each transcript with the human, canine, and bovine genomes, approximately 73% of conserved synteny and sequence conservation was found. Despite the high genetic similarity, the researchers did note areas of low conservation and sequence inversion in regions directly surrounding these previously unannotated equine transcripts.
  • The results indicate these four transcripts are likely to be equine-specific as a result of the sequence inversion and regions of low conservation around their sequences.

Cite This Article

APA
Coleman SJ, Zeng Z, Hestand MS, Liu J, Macleod JN. (2013). Analysis of unannotated equine transcripts identified by mRNA sequencing. PLoS One, 8(7), e70125. https://doi.org/10.1371/journal.pone.0070125

Publication

ISSN: 1932-6203
NlmUniqueID: 101285081
Country: United States
Language: English
Volume: 8
Issue: 7
Pages: e70125
PII: e70125

Researcher Affiliations

Coleman, Stephen J
  • Maxwell H. Gluck Equine Research Center, Department of Veterinary Science, University of Kentucky, Lexington, Kentucky, United States of America.
Zeng, Zheng
    Hestand, Matthew S
      Liu, Jinze
        Macleod, James N

          MeSH Terms

          • Animals
          • Cattle
          • Computational Biology / methods
          • Dogs
          • Evolution, Molecular
          • Gene Expression Profiling
          • Horses
          • Humans
          • Molecular Sequence Annotation
          • RNA, Messenger / chemistry
          • RNA, Messenger / genetics
          • Reproducibility of Results
          • Sequence Analysis, RNA

          Grant Funding

          • P20 RR016481 / NCRR NIH HHS
          • R01 HG006272 / NHGRI NIH HHS
          • KY-INBRE P20 RR16481 / NCRR NIH HHS
          • P20 RR16481 / NCRR NIH HHS

          Conflict of Interest Statement

          The authors have declared that no competing interests exist.

          References

          This article includes 24 references
          1. Coleman SJ, Zeng Z, Wang K, Luo S, Khrebtukova I, Mienaltowski MJ, Schroth GP, Liu J, MacLeod JN. Structural annotation of equine protein-coding genes determined by mRNA sequencing.. Anim Genet 2010 Dec;41 Suppl 2:121-30.
          2. Flicek P, Amode MR, Barrell D, Beal K, Brent S, Carvalho-Silva D, Clapham P, Coates G, Fairley S, Fitzgerald S, Gil L, Gordon L, Hendrix M, Hourlier T, Johnson N, Kähäri AK, Keefe D, Keenan S, Kinsella R, Komorowska M, Koscielny G, Kulesha E, Larsson P, Longden I, McLaren W, Muffato M, Overduin B, Pignatelli M, Pritchard B, Riat HS, Ritchie GR, Ruffier M, Schuster M, Sobral D, Tang YA, Taylor K, Trevanion S, Vandrovcova J, White S, Wilson M, Wilder SP, Aken BL, Birney E, Cunningham F, Dunham I, Durbin R, Fernández-Suarez XM, Harrow J, Herrero J, Hubbard TJ, Parker A, Proctor G, Spudich G, Vogel J, Yates A, Zadissa A, Searle SM. Ensembl 2012.. Nucleic Acids Res 2012 Jan;40(Database issue):D84-90.
            pmc: PMC3245178pubmed: 22086963doi: 10.1093/nar/gkr991google scholar: lookup
          3. Zhang Z, Schwartz S, Wagner L, Miller W. A greedy algorithm for aligning DNA sequences.. J Comput Biol 2000 Feb-Apr;7(1-2):203-14.
            pubmed: 10890397doi: 10.1089/10665270050081478google scholar: lookup
          4. Kadota K, Nishimura S, Bono H, Nakamura S, Hayashizaki Y, Okazaki Y, Takahashi K. Detection of genes with tissue-specific expression patterns using Akaike's information criterion procedure.. Physiol Genomics 2003 Feb 6;12(3):251-9.
          5. Radwan A, Younis A, Luykx P, Khuri S. Prediction and analysis of nucleosome exclusion regions in the human genome.. BMC Genomics 2008 Apr 22;9:186.
            pmc: PMC2386137pubmed: 18430246doi: 10.1186/1471-2164-9-186google scholar: lookup
          6. Artimo P, Jonnalagedda M, Arnold K, Baratin D, Csardi G, de Castro E, Duvaud S, Flegel V, Fortier A, Gasteiger E, Grosdidier A, Hernandez C, Ioannidis V, Kuznetsov D, Liechti R, Moretti S, Mostaguir K, Redaschi N, Rossier G, Xenarios I, Stockinger H. ExPASy: SIB bioinformatics resource portal.. Nucleic Acids Res 2012 Jul;40(Web Server issue):W597-603.
            pmc: PMC3394269pubmed: 22661580doi: 10.1093/nar/gks400google scholar: lookup
          7. Zhang J. Protein-length distributions for the three domains of life.. Trends Genet 2000 Mar;16(3):107-9.
            pubmed: 10689349doi: 10.1016/s0168-9525(99)01922-8google scholar: lookup
          8. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.. Nucleic Acids Res 1997 Sep 1;25(17):3389-402.
            pmc: PMC146917pubmed: 9254694doi: 10.1093/nar/25.17.3389google scholar: lookup
          9. Wade CM, Giulotto E, Sigurdsson S, Zoli M, Gnerre S, Imsland F, Lear TL, Adelson DL, Bailey E, Bellone RR, Blöcker H, Distl O, Edgar RC, Garber M, Leeb T, Mauceli E, MacLeod JN, Penedo MC, Raison JM, Sharpe T, Vogel J, Andersson L, Antczak DF, Biagi T, Binns MM, Chowdhary BP, Coleman SJ, Della Valle G, Fryc S, Guérin G, Hasegawa T, Hill EW, Jurka J, Kiialainen A, Lindgren G, Liu J, Magnani E, Mickelson JR, Murray J, Nergadze SG, Onofrio R, Pedroni S, Piras MF, Raudsepp T, Rocchi M, Røed KH, Ryder OA, Searle S, Skow L, Swinburne JE, Syvänen AC, Tozaki T, Valberg SJ, Vaudin M, White JR, Zody MC, Lander ES, Lindblad-Toh K. Genome sequence, comparative analysis, and population genetics of the domestic horse.. Science 2009 Nov 6;326(5954):865-7.
            pmc: PMC3785132pubmed: 19892987doi: 10.1126/science.1178158google scholar: lookup
          10. Kent WJ. BLAT--the BLAST-like alignment tool.. Genome Res 2002 Apr;12(4):656-64.
            pmc: PMC187518pubmed: 11932250doi: 10.1101/gr.229202google scholar: lookup
          11. Kuhn RM, Haussler D, Kent WJ. The UCSC genome browser and associated tools.. Brief Bioinform 2013 Mar;14(2):144-61.
            doi: 10.1093/bib/bbs038pmc: PMC3603215pubmed: 22908213google scholar: lookup
          12. Rozen S, Skaletsky H. Primer3 on the WWW for general users and for biologist programmers.. Methods Mol Biol 2000;132:365-86.
            pubmed: 10547847doi: 10.1385/1-59259-192-2:365google scholar: lookup
          13. Vilella AJ, Severin J, Ureta-Vidal A, Heng L, Durbin R, Birney E. EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates.. Genome Res 2009 Feb;19(2):327-35.
            pmc: PMC2652215pubmed: 19029536doi: 10.1101/gr.073585.107google scholar: lookup
          14. Kasprzyk A. BioMart: driving a paradigm change in biological data management.. Database (Oxford) 2011;2011:bar049.
            pmc: PMC3215098pubmed: 22083790doi: 10.1093/database/bar049google scholar: lookup
          15. Schwartz S, Zhang Z, Frazer KA, Smit A, Riemer C, Bouck J, Gibbs R, Hardison R, Miller W. PipMaker--a web server for aligning two genomic DNA sequences.. Genome Res 2000 Apr;10(4):577-86.
            pmc: PMC310868pubmed: 10779500doi: 10.1101/gr.10.4.577google scholar: lookup
          16. Marchler-Bauer A, Bryant SH. CD-Search: protein domain annotations on the fly.. Nucleic Acids Res 2004 Jul 1;32(Web Server issue):W327-31.
            pmc: PMC441592pubmed: 15215404doi: 10.1093/nar/gkh454google scholar: lookup
          17. Marchler-Bauer A, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, Fong JH, Geer LY, Geer RC, Gonzales NR, Gwadz M, He S, Hurwitz DI, Jackson JD, Ke Z, Lanczycki CJ, Liebert CA, Liu C, Lu F, Lu S, Marchler GH, Mullokandov M, Song JS, Tasneem A, Thanki N, Yamashita RA, Zhang D, Zhang N, Bryant SH. CDD: specific functional annotation with the Conserved Domain Database.. Nucleic Acids Res 2009 Jan;37(Database issue):D205-10.
            pmc: PMC2686570pubmed: 18984618doi: 10.1093/nar/gkn845google scholar: lookup
          18. Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, Fong JH, Geer LY, Geer RC, Gonzales NR, Gwadz M, Hurwitz DI, Jackson JD, Ke Z, Lanczycki CJ, Lu F, Marchler GH, Mullokandov M, Omelchenko MV, Robertson CL, Song JS, Thanki N, Yamashita RA, Zhang D, Zhang N, Zheng C, Bryant SH. CDD: a Conserved Domain Database for the functional annotation of proteins.. Nucleic Acids Res 2011 Jan;39(Database issue):D225-9.
            pmc: PMC3013737pubmed: 21109532doi: 10.1093/nar/gkq1189google scholar: lookup
          19. Emanuelsson O, Brunak S, von Heijne G, Nielsen H. Locating proteins in the cell using TargetP, SignalP and related tools.. Nat Protoc 2007;2(4):953-71.
            pubmed: 17446895doi: 10.1038/nprot.2007.131google scholar: lookup
          20. Wang K, Singh D, Zeng Z, Coleman SJ, Huang Y, Savich GL, He X, Mieczkowski P, Grimm SA, Perou CM, MacLeod JN, Chiang DY, Prins JF, Liu J. MapSplice: accurate mapping of RNA-seq reads for splice junction discovery.. Nucleic Acids Res 2010 Oct;38(18):e178.
            pmc: PMC2952873pubmed: 20802226doi: 10.1093/nar/gkq622google scholar: lookup
          21. Collins T, Stone JR, Williams AJ. All in the family: the BTB/POZ, KRAB, and SCAN domains.. Mol Cell Biol 2001 Jun;21(11):3609-15.
          22. Looman C, Abrink M, Mark C, Hellman L. KRAB zinc finger proteins: an analysis of the molecular mechanisms governing their increase in numbers and complexity during evolution.. Mol Biol Evol 2002 Dec;19(12):2118-30.
          23. Lorenz P, Dietmann S, Wilhelm T, Koczan D, Autran S, Gad S, Wen G, Ding G, Li Y, Rousseau-Merck MF, Thiesen HJ. The ancient mammalian KRAB zinc finger gene cluster on human chromosome 8q24.3 illustrates principles of C2H2 zinc finger evolution associated with unique expression profiles in human tissues.. BMC Genomics 2010 Mar 26;11:206.
            pmc: PMC2865497pubmed: 20346131doi: 10.1186/1471-2164-11-206google scholar: lookup
          24. Bellefroid EJ, Marine JC, Ried T, Lecocq PJ, Rivière M, Amemiya C, Poncelet DA, Coulie PG, de Jong P, Szpirer C. Clustered organization of homologous KRAB zinc-finger genes with enhanced expression in human T lymphoid cells.. EMBO J 1993 Apr;12(4):1363-74.

          Citations

          This article has been cited 8 times.
          1. Premont A, Saadeh K, Edling C, Lewis R, Marr CM, Jeevaratnam K. Cardiac ion channel expression in the equine model - In-silico prediction utilising RNA sequencing data from mixed tissue samples.. Physiol Rep 2022 Jul;10(14):e15273.
            doi: 10.14814/phy2.15273pubmed: 35880716google scholar: lookup
          2. Nalamalapu RR, Yue M, Stone AR, Murphy S, Saha MS. The tweety Gene Family: From Embryo to Disease.. Front Mol Neurosci 2021;14:672511.
            doi: 10.3389/fnmol.2021.672511pubmed: 34262434google scholar: lookup
          3. Fang J, Zhang D, Cao JW, Zhang L, Liu CX, Xing YP, Wang F, Xu HY, Wang SC, Ling Y, Wang W, Zhang YR, Zhou HM. Pathways involved in pony body size development.. BMC Genomics 2021 Jan 18;22(1):58.
            doi: 10.1186/s12864-020-07323-1pubmed: 33461495google scholar: lookup
          4. Raudsepp T, Finno CJ, Bellone RR, Petersen JL. Ten years of the horse reference genome: insights into equine biology, domestication and population dynamics in the post-genome era.. Anim Genet 2019 Dec;50(6):569-597.
            doi: 10.1111/age.12857pubmed: 31568563google scholar: lookup
          5. Stefaniuk M, Ropka-Molik K. RNA sequencing as a powerful tool in searching for genes influencing health and performance traits of horses.. J Appl Genet 2016 May;57(2):199-206.
            doi: 10.1007/s13353-015-0320-7pubmed: 26446669google scholar: lookup
          6. Holl HM, Gao S, Fei Z, Andrews C, Brooks SA. Generation of a de novo transcriptome from equine lamellar tissue.. BMC Genomics 2015 Oct 3;16:739.
            doi: 10.1186/s12864-015-1948-8pubmed: 26432030google scholar: lookup
          7. Rebolledo-Mendez J, Hestand MS, Coleman SJ, Zeng Z, Orlando L, MacLeod JN, Kalbfleisch T. Comparison of the Equine Reference Sequence with Its Sanger Source Data and New Illumina Reads.. PLoS One 2015;10(6):e0126852.
            doi: 10.1371/journal.pone.0126852pubmed: 26107638google scholar: lookup
          8. Pacholewska A, Drögemüller M, Klukowska-Rötzler J, Lanz S, Hamza E, Dermitzakis ET, Marti E, Gerber V, Leeb T, Jagannathan V. The transcriptome of equine peripheral blood mononuclear cells.. PLoS One 2015;10(3):e0122011.
            doi: 10.1371/journal.pone.0122011pubmed: 25790166google scholar: lookup