Analyze Diet
BMC bioinformatics2009; 10 Suppl 11(Suppl 11); S8; doi: 10.1186/1471-2105-10-S11-S8

Structural and functional-annotation of an equine whole genome oligoarray.

Abstract: The horse genome is sequenced, allowing equine researchers to use high-throughput functional genomics platforms such as microarrays; next-generation sequencing for gene expression and proteomics. However, for researchers to derive value from these functional genomics datasets, they must be able to model this data in biologically relevant ways; to do so requires that the equine genome be more fully annotated. There are two interrelated types of genomic annotation: structural and functional. Structural annotation is delineating and demarcating the genomic elements (such as genes, promoters, and regulatory elements). Functional annotation is assigning function to structural elements. The Gene Ontology (GO) is the de facto standard for functional annotation, and is routinely used as a basis for modelling and hypothesis testing, large functional genomics datasets. Results: An Equine Whole Genome Oligonucleotide (EWGO) array with 21,351 elements was developed at Texas A&M University. This 70-mer oligoarray was designed using the approximately 7 x assembled and annotated sequence of the equine genome to be one of the most comprehensive arrays available for expressed equine sequences. To assist researchers in determining the biological meaning of data derived from this array, we have structurally annotated it by mapping the elements to multiple database accessions, including UniProtKB, Entrez Gene, NRPD (Non-Redundant Protein Database) and UniGene. We next provided GO functional annotations for the gene transcripts represented on this array. Overall, we GO annotated 14,531 gene products (68.1% of the gene products represented on the EWGO array) with 57,912 annotations. GAQ (GO Annotation Quality) scores were calculated for this array both before and after we added GO annotation. The additional annotations improved the meanGAQ score 16-fold. This data is publicly available at AgBase http://www.agbase.msstate.edu/. Conclusions: Providing additional information about the public databases which link to the gene products represented on the array allows users more flexibility when using gene expression modelling and hypothesis-testing computational tools. Moreover, since different databases provide different types of information, users have access to multiple data sources. In addition, our GO annotation underpins functional modelling for most gene expression analysis tools and enables equine researchers to model large lists of differentially expressed transcripts in biologically relevant ways.
Publication Date: 2009-10-08 PubMed ID: 19811692PubMed Central: PMC3226197DOI: 10.1186/1471-2105-10-S11-S8Google Scholar: Lookup
The Equine Research Bank provides access to a large database of publicly available scientific literature. Inclusion in the Research Bank does not imply endorsement of study methods or findings by Mad Barn.
  • Journal Article
  • Research Support
  • U.S. Gov't
  • Non-P.H.S.

Summary

This research summary has been generated with artificial intelligence and may contain errors and omissions. Refer to the original study to confirm details provided. Submit correction.

The research article discusses the creation and annotation of an Equine Whole Genome Oligonucleotide (EWGO) array, which helps in understanding the equine genome and deriving value from genomics datasets. The researchers have structurally annotated the array, mapping elements to key databases, and have provided Gene Ontology (GO) functional annotations for gene transcripts represented on the array.

Creation of the Equine Whole Genome Oligonucleotide (EWGO) Array

  • The researchers at Texas A&M University, leveraging the sequenced equine genome, developed an EWGO array with 21,351 elements.
  • This 70-mer oligoarray was designed using the assembled and annotated sequence of the equine genome, representing one of the most comprehensive arrays for equine sequences.

Structural Annotation of the EWGO Array

  • The team performed structural annotation on the array, which is the process of delineating and demarcating the genomic elements such as genes, promoters, and regulatory elements.
  • The elements of the array were mapped to multiple database accessions including UniProtKB, Entrez Gene, NRPD (Non-Redundant Protein Database) and UniGene.

Functional Annotation of the EWGO Array and Results

  • Functional annotation, assigning function to structural elements, was carried out using the Gene Ontology (GO), a standard system for these annotations.
  • The researchers provided GO functional annotations for the gene transcripts represented on this array.
  • From the array’s total, 14,531 gene products (which constitute 68.1% of the gene products) were annotated with 57,912 annotations.
  • The quality of the annotations was evaluated using the GAQ (GO Annotation Quality) scores both before and after adding the GO annotation, showing a 16-fold improvement in the mean GAQ score after adding the annotations.

Conclusion and Implications

  • The additional information provided about the public databases, linked to the gene products on the array, offers users a greater flexibility when using computational tools for gene expression modelling and hypothesis testing.
  • By providing access to different types of information from different databases, it enhances the utility of the array for various research purposes.
  • The GO annotation aids functional modelling for most gene expression analysis tools, enabling researchers to model large lists of differentially expressed transcripts in biologically relevant ways.
  • The data derived from this research is publicly available, providing a valuable resource for equine researchers globally.

Cite This Article

APA
Bright LA, Burgess SC, Chowdhary B, Swiderski CE, McCarthy FM. (2009). Structural and functional-annotation of an equine whole genome oligoarray. BMC Bioinformatics, 10 Suppl 11(Suppl 11), S8. https://doi.org/10.1186/1471-2105-10-S11-S8

Publication

ISSN: 1471-2105
NlmUniqueID: 100965194
Country: England
Language: English
Volume: 10 Suppl 11
Issue: Suppl 11
Pages: S8

Researcher Affiliations

Bright, Lauren A
  • Department of Clinical Sciences, College of Veterinary Medicine, Mississippi State University, PO Box 6100, Mississippi State, MS, 39762, USA. lbright@cvm.msstate.edu.
Burgess, Shane C
    Chowdhary, Bhanu
      Swiderski, Cyprianna E
        McCarthy, Fiona M

          MeSH Terms

          • Animals
          • Databases, Genetic
          • Genome
          • Genomics / methods
          • Horses / genetics
          • Oligonucleotide Array Sequence Analysis / methods

          References

          This article includes 22 references
          1. Horse Genome Assembled. http://www.genome.gov/20519480
          2. Lewis SE. Gene Ontology: looking backwards and forwards.. Genome Biol 2005;6(1):103.
            doi: 10.1186/gb-2004-6-1-103pmc: PMC549054pubmed: 15642104google scholar: lookup
          3. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.. Nat Genet 2000 May;25(1):25-9.
            doi: 10.1038/75556pmc: PMC3037419pubmed: 10802651google scholar: lookup
          4. Hill DP, Smith B, McAndrews-Hill MS, Blake JA. Gene Ontology annotations: what they mean and where they come from.. BMC Bioinformatics 2008 Apr 29;9 Suppl 5(Suppl 5):S2.
            doi: 10.1186/1471-2105-9-S5-S2pmc: PMC2367625pubmed: 18460184google scholar: lookup
          5. Barrell D, Dimmer E, Huntley RP, Binns D, O'Donovan C, Apweiler R. The GOA database in 2009--an integrated Gene Ontology Annotation resource.. Nucleic Acids Res 2009 Jan;37(Database issue):D396-403.
            doi: 10.1093/nar/gkn803pmc: PMC2686469pubmed: 18957448google scholar: lookup
          6. Memili E, Peddinti D, Shack LA, Nanduri B, McCarthy F, Sagirkaya H, Burgess SC. Bovine germinal vesicle oocyte and cumulus cell proteomics.. Reproduction 2007 Jun;133(6):1107-20.
            pubmed: 17636165doi: 10.1530/rep-06-0149google scholar: lookup
          7. Finucane KA, McFadden TB, Bond JP, Kennelly JJ, Zhao FQ. Onset of lactation in the bovine mammary gland: gene expression profiling indicates a strong inhibition of gene expression in cell proliferation.. Funct Integr Genomics 2008 Aug;8(3):251-64.
            doi: 10.1007/s10142-008-0074-ypubmed: 18259788google scholar: lookup
          8. Tan SH, Reverter A, Wang Y, Byrne KA, McWilliam SM, Lehnert SA. Gene expression profiling of bovine in vitro adipogenesis using a cDNA microarray.. Funct Integr Genomics 2006 Jul;6(3):235-49.
            doi: 10.1007/s10142-005-0016-xpubmed: 16470362google scholar: lookup
          9. Diez-Tascón C, Keane OM, Wilson T, Zadissa A, Hyndman DL, Baird DB, McEwan JC, Crawford AM. Microarray analysis of selection lines from outbred populations to identify genes involved with nematode parasite resistance in sheep.. Physiol Genomics 2005 Mar 21;21(1):59-69.
          10. Jensen K, Paxton E, Waddington D, Talbot R, Darghouth MA, Glass EJ. Differences in the transcriptional responses induced by Theileria annulata infection in bovine monocytes derived from resistant and susceptible cattle breeds.. Int J Parasitol 2008 Mar;38(3-4):313-25.
            doi: 10.1016/j.ijpara.2007.08.007pubmed: 17949724google scholar: lookup
          11. McCarthy FM, Bridges SM, Burgess SC. GOing from functional genomics to biological significance.. Cytogenet Genome Res 2007;117(1-4):278-87.
            doi: 10.1159/000103189pubmed: 17675869google scholar: lookup
          12. Khatri P, Bhavsar P, Bawa G, Draghici S. Onto-Tools: an ensemble of web-accessible, ontology-based tools for the functional design and interpretation of high-throughput gene expression experiments.. Nucleic Acids Res 2004 Jul 1;32(Web Server issue):W449-56.
            doi: 10.1093/nar/gkh409pmc: PMC441547pubmed: 15215428google scholar: lookup
          13. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks.. Genome Res 2003 Nov;13(11):2498-504.
            doi: 10.1101/gr.1239303pmc: PMC403769pubmed: 14597658google scholar: lookup
          14. Chowdhary BP, Paria N, Raudsepp T. Potential applications of equine genomics in dissecting diseases and fertility.. Anim Reprod Sci 2008 Sep;107(3-4):208-18.
          15. Chowdhary BP, Raudsepp T. The horse genome derby: racing from map to whole genome sequence.. Chromosome Res 2008;16(1):109-27.
            doi: 10.1007/s10577-008-1204-zpubmed: 18274866google scholar: lookup
          16. Chen X, Wang L. Integrating biological knowledge with gene expression profiles for survival prediction of cancer.. J Comput Biol 2009 Feb;16(2):265-78.
            doi: 10.1089/cmb.2008.12TTpmc: PMC3198940pubmed: 19183004google scholar: lookup
          17. Hopwood B, Tsykin A, Findlay DM, Fazzalari NL. Microarray gene expression profiling of osteoarthritic bone suggests altered bone remodelling, WNT and transforming growth factor-beta/bone morphogenic protein signalling.. Arthritis Res Ther 2007;9(5):R100.
            doi: 10.1186/ar2301pmc: PMC2212557pubmed: 17900349google scholar: lookup
          18. van den Berg BH, Konieczka JH, McCarthy FM, Burgess SC. ArrayIDer: automated structural re-annotation pipeline for DNA microarrays.. BMC Bioinformatics 2009 Jan 23;10:30.
            doi: 10.1186/1471-2105-10-30pmc: PMC2636773pubmed: 19166590google scholar: lookup
          19. McCarthy FM, Bridges SM, Wang N, Magee GB, Williams WP, Luthe DS, Burgess SC. AgBase: a unified resource for functional analysis in agriculture.. Nucleic Acids Res 2007 Jan;35(Database issue):D599-603.
            doi: 10.1093/nar/gkl936pmc: PMC1751552pubmed: 17135208google scholar: lookup
          20. Kumar R, Jaiswal K, Nandrui B, McCarthy FM. Automated pipeline for adding Gene Ontology for non model species. BMC Bioinformatics 2009. in press.
          21. Buza TJ, Kumar R, Gresham CR, Burgess SC, McCarthy FM. Facilitating functional annotation of chicken microarray data.. BMC Bioinformatics 2009 Oct 8;10 Suppl 11(Suppl 11):S2.
            pmc: PMC3226191pubmed: 19811685doi: 10.1186/1471-2105-10-s11-s2google scholar: lookup
          22. Buza TJ, McCarthy FM, Wang N, Bridges SM, Burgess SC. Gene Ontology annotation quality analysis in model eukaryotes.. Nucleic Acids Res 2008 Feb;36(2):e12.
            doi: 10.1093/nar/gkm1167pmc: PMC2241866pubmed: 18187504google scholar: lookup

          Citations

          This article has been cited 9 times.
          1. Bright LA, Dittmar W, Nanduri B, McCarthy FM, Mujahid N, Costa LR, Burgess SC, Swiderski CE. Modeling the pasture-associated severe equine asthma bronchoalveolar lavage fluid proteome identifies molecular events mediating neutrophilic airway inflammation. Vet Med (Auckl) 2019;10:43-63.
            doi: 10.2147/VMRR.S194427pubmed: 31119093google scholar: lookup
          2. Kachroo P, Ivanov I, Seabury AG, Liu M, Chowdhary BP, Cohen ND. Age-related changes following in vitro stimulation with Rhodococcus equi of peripheral blood leukocytes from neonatal foals. PLoS One 2013;8(5):e62879.
            doi: 10.1371/journal.pone.0062879pubmed: 23690962google scholar: lookup
          3. Das PJ, McCarthy F, Vishnoi M, Paria N, Gresham C, Li G, Kachroo P, Sudderth AK, Teague S, Love CC, Varner DD, Chowdhary BP, Raudsepp T. Stallion sperm transcriptome comprises functionally coherent coding and regulatory RNAs as revealed by microarray analysis and RNA-seq. PLoS One 2013;8(2):e56535.
            doi: 10.1371/journal.pone.0056535pubmed: 23409192google scholar: lookup
          4. Pareek CS, Smoczynski R, Tretyn A. Sequencing technologies and genome sequencing. J Appl Genet 2011 Nov;52(4):413-35.
            doi: 10.1007/s13353-011-0057-xpubmed: 21698376google scholar: lookup
          5. McCarthy FM, Gresham CR, Buza TJ, Chouvarine P, Pillai LR, Kumar R, Ozkan S, Wang H, Manda P, Arick T, Bridges SM, Burgess SC. AgBase: supporting functional modeling in agricultural organisms. Nucleic Acids Res 2011 Jan;39(Database issue):D497-506.
            doi: 10.1093/nar/gkq1115pubmed: 21075795google scholar: lookup
          6. Wren JD, Kupfer DM, Perkins EJ, Bridges S, Berleant D. Proceedings of the 2010 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) conference. BMC Bioinformatics 2010 Oct 7;11 Suppl 6(Suppl 6):S1.
            doi: 10.1186/1471-2105-11-S6-S1pubmed: 20946592google scholar: lookup
          7. Brosnahan MM, Brooks SA, Antczak DF. Equine clinical genomics: A clinician's primer. Equine Vet J 2010 Oct;42(7):658-70.
          8. McGivney BA, McGettigan PA, Browne JA, Evans AC, Fonseca RG, Loftus BJ, Lohan A, MacHugh DE, Murphy BA, Katz LM, Hill EW. Characterization of the equine skeletal muscle transcriptome identifies novel functional responses to exercise training. BMC Genomics 2010 Jun 23;11:398.
            doi: 10.1186/1471-2164-11-398pubmed: 20573200google scholar: lookup
          9. Wren JD, Gusev Y, Isokpehi RD, Berleant D, Braga-Neto U, Wilkins D, Bridges S. Proceedings of the 2009 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) conference. Introduction. BMC Bioinformatics 2009 Oct 8;10 Suppl 11(Suppl 11):S1.
            doi: 10.1186/1471-2105-10-S11-S1pubmed: 19811674google scholar: lookup