Whole genome sequence and analysis of the Marwari horse breed and its genetic origin.
- Journal Article
- Research Support
- Non-U.S. Gov't
Summary
This research article focuses on analyzing the genome of the Marwari horse breed, a unique breed originating from crossbreeding of local Indian ponies with Arabian horses. Genetic components from both Arabian and Mongolian breeds were identified in the Marwari genome. The study also identified variants related to olfactory functions and suggested a potential genetic variant linked to the breed’s signature inward-turning ear tip shape.
Characterizing the Marwari Horse Genome
The scientists in this study sought to examine the genome of the Marwari horse breed known for its distinctive inward-turned ear tips. This was done by generating about 101 Gb of whole genome sequences from a Marwari horse using the Illumina HiSeq2000 sequencer. They were able to:
- Map the sequences they generated to the reference genome of horses at an approximate mapping rate of 98%.
- Achieve a coverage of roughly 95% for the genome, with at least 10 times coverage.
Identified Genetic Variations in the Marwari Breed
The study successfully identified several genetic variations. These include:
- Approximately 5.9 million single nucleotide variations. These are changes in a single nucleotide, the basic building blocks of DNA.
- About 0.6 million small insertions or deletions. These are removals or additions of small chunks of DNA sequences.
- 2,569 copy number variation blocks. These are sections of the genome where the number of copies varies between individuals in a species.
Tracing the Genetic Origin of the Marwari Breed
The researchers reported that the Marwari genome displayed a strong Arabian and Mongolian genetic component, tracing back to its believed historical origins from crossbreeding of local Indian ponies with Arabian horses.
Annotation of Variants, Their Possible Functions and Traits
The team went further to annotate the novel variants they discovered in the Marwari sequences. This helped in understanding the roles these variations may play in the breed. They found out that the variants show enrichment in olfactory functions. The researchers also identified a potential functional genetic variant in the TSHZ1 gene, which they suggest could be associated with the characteristic inward-turning ear tip shape of the Marwari horses.
Significance of the Study
This study is the first to present genomic data for an Asian breed. The findings serve as a valuable resource for future research aiming to explore genetic variation associated with phenotypes and diseases in horses.
Cite This Article
Publication
Researcher Affiliations
MeSH Terms
- Amino Acid Sequence
- Animals
- Evolution, Molecular
- Genetic Variation
- Genome / genetics
- Genomics
- Genotype
- Horses / genetics
- Humans
- Hybridization, Genetic
- Male
- Molecular Sequence Data
- Phenotype
- Selection, Genetic
- Sequence Analysis, DNA
- Species Specificity
Grant Funding
- R01 HG006876 / NHGRI NIH HHS
References
- Wade CM, Giulotto E, Sigurdsson S, Zoli M, Gnerre S, Imsland F, Lear TL, Adelson DL, Bailey E, Bellone RR, Blocker H, Distl O, Edgar RC, Garber M, Leeb T, Mauceli E, MacLeod JN, Penedo MC, Raison JM, Sharpe T, Vogel J, Andersson L, Antczak DF, Biagi T, Binns MM, Chowdhary BP, Coleman SJ, Della Valle G, Fryc S, Guerin G. Genome sequence, comparative analysis, and population genetics of the domestic horse.. Science 2009;326:865–867.
- Warmuth V, Eriksson A, Bower MA, Barker G, Barrett E, Hanks BK, Li S, Lomitashvili D, Ochir-Goryaeva M, Sizonov GV, Soyonov V, Manica A. Reconstructing the origin and spread of horse domestication in the Eurasian steppe.. Proc Natl Acad Sci USA 2012;109:8202–8206.
- Warmuth V, Eriksson A, Bower MA, Cañon J, Cothran G, Distl O, Glowatzki-Mullis ML, Hunt H, Luís C, do Mar Oom M, Yupanqui IT, Ząbek T, Manica A. European Domestic Horses Originated in Two Holocene Refugia.. PLoS One 2011;6:e18194.
- Doan R, Cohen ND, Sawyer J, Ghaffari N, Johnson CD, Dindot SV. Whole-Genome Sequencing and Genetic Variant Analysis of a Quarter Horse Mare.. BMC Genomics 2012;13:78.
- Online Mendelian Inheritance in Animals. http://omia.angis.org.au/home
- Orlando L, Ginolhac A, Zhang G, Froese D, Albrechtsen A, Stiller M, Schubert M, Cappellini E, Petersen B, Moltke I, Johnson PL, Fumagalli M, Vilstrup JT, Raghavan M, Korneliussen T, Malaspinas AS, Vogt J, Szklarczyk D, Kelstrup CD, Vinther J, Dolocan A, Stenderup J, Velazquez AM, Cahill J, Rasmussen M, Wang X, Min J, Zazula GD, Seguin-Orlando A, Mortensen C. Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse.. Nature 2013;499:74–81.
- Hendricks B. International Encyclopedia of Horse Breeds. Norman: University of Oklahoma Press; 1995.
- Elwyn Hartley Edwards. The Encyclopedia of the Horse. New York: Dorling Kindersley; 1994.
- Wendy Doniger. The Hindus: An Alternative History. New Delhi: Penguin Books; 2009.
- Behl R, Behl J, Gupta N, Gupta SC. Genetic relationships of five Indian horse breeds using microsatellite markers.. Animal 2007;4:483–488.
- Dutson Judith. Storey's Illustrated Guide to 96 Horse Breeds of North America. North adams: Storey Publishing; 2005.
- Gupta AK, Chauhan M, Tandon SN, Sonia. Genetic diversity and bottleneck studies in the Marwari horse breed.. J Genet 2005;84:295–301.
- Petersen JL, Mickelson JR, Rendahl AK, Valberg SJ, Andersson LS, Axelsson J, Bailey E, Bannasch D, Binns MM, Borges AS, Brama P, da Câmara Machado A, Capomaccio S, Cappelli K, Cothran EG, Distl O, Fox-Clipsham L, Graves KT, Guérin G, Haase B, Hasegawa T, Hemmann K, Hill EW, Leeb T, Lindgren G, Lohi H, Lopes MS, McGivney BA, Mikko S, Orr N. Genome-wide analysis reveals selection for important traits in domestic horse breeds.. PLoS Genet 2013;9:e1003211.
- IBM Corp. IBM SPSS Statistics for Windows, Version 22.0. NY: IBM; 2013.
- Huang da W, Sherman BT, Zheng X, Yang J, Imamichi T, Stephens R, Lempicki RA. Extracting biological meaning from large gene lists with DAVID.. Curr Protoc Bioinformatics 2009. Chapter 13:Unit 13.11.
- Miller CA, Hampton O, Coarfa C, Milosavljevic A. ReadDepth: a parallel R package for detecting copy number alterations from short sequencing reads.. PLoS One 2011;6:e16327.
- John F, Wall. Famous Running Horses: Their Forebears and Descendants. Whitefish: Literary Licensing; 2013.
- Robert Moorman Denhardt. The Quarter Horse Running: America's Oldest Breed. Norman: University of Oklahoma Press; 2003.
- Llamas. This is the Spanish Horse. London: J A Allen & Co Ltd; 1999.
- Milner. Godolphin Arabian: Story of the Matchem Line. London: J. A. Allen; 1990.
- Breed of Livestock. http://www.ansi.okstate.edu/breeds/horses/
- International Museum of the HORSE. http://www.imh.org/exhibits/online/breeds-of-the-world
- Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: linked loci and1 correlated allele frequencies.. Genetics 2003;164:1567–1587.
- Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data.. Genetics 2000;155:945–959.
- Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR. A method and server for predicting damaging missense mutations.. Nat Methods 2010;7:248–249.
- ALTMANN F. Congenital atresia of the ear in man and animals.. Ann Otol Rhinol Laryngol 1955;64:824–858.
- Yellon RF, Branstetter BF 4th. Prospective blinded study of computed tomography in congenital aural atresia.. Int J Pediatr Otorhinolaryngol 2010;74:1286–1291.
- Coré N, Caubit X, Metchat A, Boned A, Djabali M, Fasano L. Tshz1 is required for axial skeleton, soft palate and middle ear development in mice.. Dev Biol 2007;308:407–420.
- Hill EW, Gu J, McGivney BA, MacHugh DE. Targets of selection in the Thoroughbred genome contain exercise-relevant gene SNPs associated with elite racecourse performance.. Anim Genet 2010;41:56–63.
- Bellone RR, Forsyth G, Leeb T, Archer S, Sigurdsson S, Imsland F, Mauceli E, Engensteiner M, Bailey E, Sandmeyer L, Grahn B, Lindblad-Toh K, Wade CM. Fine-mapping and mutation analysis of TRPM1: a candidate gene for leopard complex (LP) spotting and congenital stationary night blindness in horses.. Brief Funct Genomics 2010;9:193–207.
- Tryon RC, White SD, Bannasch DL. Homozygosity mapping approach identifies a missense mutation in equine cyclophilin B (PPIB) associated with HERDA in the American Quarter Horse.. Genomics 2007;90:93–102.
- Brooks SA, Gabreski N, Miller D, Brisbin A, Brown HE, Streeter C, Mezey J, Cook D, Antczak DF. Whole-genome SNP association in the horse: identification of a deletion in myosin Va responsible for Lavender Foal Syndrome.. PLoS Genet 2010;6:e1000909.
- Marklund L, Moller MJ, Sandberg K, Andersson L. A missense mutation in the gene for melanocyte-stimulating hormone receptor (MC1R) is associated with the chestnut coat color in horses.. Mamm Genome 1996;7:895–899.
- Wagner HJ, Reissmann M. New polymorphism detected in the horse MC1R gene.. Anim Genet 2000;31:289–290.
- Brooks SA, Bailey E. Exon skipping in the KIT gene causes a Sabino spotting pattern in horses.. Mamm Genome 2005;16:893–902.
- Brooks SA, Lear TL, Adelson DL, Bailey E. A chromosome inversion near the KIT gene and the Tobiano spotting pattern in horses.. Cytogenet Genome Res 2007;119:225–230.
- Makvandi-Nejad S, Hoffman GE, Allen JJ, Chu E, Gu E, Chandler AM, Loredo AI, Bellone RR, Mezey JG, Brooks SA, Sutter NB. Four loci explain 83% of size variation in the horse.. PLoS One 2012;7:e39929.
- Signer-Hasler H, Flury C, Haase B, Burger D, Simianer H, Leeb T, Rieder S. A genome-wide association study reveals loci influencing height and other conformation traits in horses.. PLoS One 2012;7:e37282.
- Spirito F, Charlesworth A, Linder K, Ortonne JP, Baird J, Meneguzzi G. Animal models for skin blistering conditions: absence of laminin 5 causes hereditary junctional mechanobullous disease in the Belgian horse.. J Invest Dermatol 2002;119:684–691.
- Brunberg E, Andersson L, Cothran G, Sandberg K, Mikko S, Lindgren G. A missense mutation in PMEL17 is associated with the Silver coat color in the horse.. BMC Genet 2006;7:46.
- Graves KT, Henney PJ, Ennis RB. Partial deletion of the LAMA3 gene is responsible for hereditary junctional epidermolysis bullosa in the American Saddlebred Horse.. Anim Genet 2009;40:35–41.
- Shin EK, Perryman LE, Meek K. A kinase-negative mutation of DNA-PK(CS) in equine SCID results in defective coding and signal joint formation.. J Immunol 1997;158:3565–3569.
- Aleman M, Riehl J, Aldridge BM, Lecouteur RA, Stott JL, Pessah IN. Association of a mutation in the ryanodine receptor 1 gene with equine malignant hyperthermia.. Muscle Nerve 2004;30:356–365.
- Gu J, MacHugh DE, McGivney BA, Park SD, Katz LM, Hill EW. Association of sequence variants in CKM (creatine kinase, muscle) and COX4I2 (cytochrome c oxidase, subunit 4, isoform 2) genes with racing performance in Thoroughbred horses.. Equine Vet J 2010;42:569–75.
- McCue ME, Valberg SJ, Miller MB, Wade C, DiMauro S, Akman HO, Mickelson JR. Glycogen synthase (GYS1) mutation causes a novel skeletal muscle glycogenosis.. Genomics 2008;91:458–466.
- Cannon SC, Hayward LJ, Beech J, Brown RH Jr. Sodium channel inactivation is impaired in equine hyperkalemic periodic paralysis.. J Neurophysiol 1995;73:1892–1899.
- Orr N, Back W, Gu J, Leegwater P, Govindarajan P, Conroy J, Ducro B, Van Arendonk JA, MacHugh DE, Ennis S, Hill EW, Brama PA. Genome-wide SNP association-based localization of a dwarfism gene in Friesian dwarf horses.. Anim Genet 2010;41:2–7.
- Cook D, Brooks S, Bellone R, Bailey E. Missense mutation in exon 2 of SLC36A1 responsible for champagne dilution in horses.. PLoS Genet 2008;4:e1000195.
- Hansen M, Knorr C, Hall AJ, Broad TE, Brenig B. Sequence analysis of the equine SLC26A2 gene locus on chromosome 14q15-->q21.. Cytogenet Genome Res 2007;118:55–62.
- Yang GC, Croaker D, Zhang AL, Manglick P, Cartmill T, Cass D. A dinucleotide mutation in the endothelin-B receptor gene is associated with lethal white foal syndrome (LWFS); a horse variant of Hirschsprung disease.. Hum Mol Gene 1998;7:1047–1052.
- Hill EW, McGivney BA, Gu J, Whiston R, Machugh DE. A genome-wide SNP association study confirms a sequence variant (g.66493737C > T) in the equine myostatin (MSTN) gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses.. BMC Genomics 2010;11:552.
- Mariat D, Taourit S, Guérin G. A mutation in the MATP gene causes the cream coat colour in the horse.. Genet Sel Evol 2003;35:119–133.
- Rieder S, Taourit S, Mariat D, Langlois B, Guérin G. Mutations in the agouti (ASIP), the extension (MC1R), and the brown (TYRP1) loci and their association to coat color phenotypes in horses (Equus caballus). Mamm Genome 2001;12:450–455.
- Andersson LS, Larhammar M, Memic F, Wootz H, Schwochow D, Rubin CJ, Patra K, Arnason T, Wellbring L, Hjälm G, Imsland F, Petersen JL, McCue ME, Mickelson JR, Cothran G, Ahituv N, Roepstorff L, Mikko S, Vallstedt A, Lindgren G, Andersson L, Kullander K. Mutations in DMRT3 affect locomotion in horses and spinal circuit function in mice.. Nature 2012;488:642–646.
- Rosengren Pielberg G, Golovko A, Sundström E, Curik I, Lennartsson J, Seltenhammer MH, Druml T, Binns M, Fitzsimmons C, Lindgren G, Sandberg K, Baumung R, Vetterlein M, Strömberg S, Grabherr M, Wade C, Lindblad-Toh K, Pontén F, Heldin CH, Sölkner J, Andersson L. A cis-acting regulatory mutation causes premature hair graying and susceptibility to melanoma in the horse.. Nat Genet 2008;40:1004–1009.
- Nielsen R, Bustamante C, Clark AG, Glanowski S, Sackton TB, Hubisz MJ, Fledel-Alon A, Tanenbaum DM, Civello D, White TJ, J Sninsky J, Adams MD, Cargill M. A scan for positively selected genes in the genomes of humans and chimpanzees.. PLoS Biol 2005;3:el70.
- Li L, Stoeckert CJ Jr, Roos DS. OrthoMCL: identification of ortholog groups for eukaryotic genomes.. Genome Res 2003;13:2178–2189.
- Macfadden BJ. Fossil Horses: Systematics, Paleobiology, and Evolution of the Family Equidae. Cambridge:Cambridge University Press; 1994.
- Macfadden BJ. Evolution. Fossil horses--evidence for evolution.. Science 2005;307:1728–1730.
- Patel RK, Jain M. NGS QC Toolkit: A Toolkit for Quality Control of Next Generation Sequencing Data.. PLoS One 2012;7:e30619.
- Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler Transform.. Bioinformatics 2009;25:1754–1760.
- McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data.. Genome Res 2010;20:1297–1303.
- Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, Tang J, Wu G, Zhang H, Shi Y, Liu Y, Yu C, Wang B, Lu Y, Han C, Cheung DW, Yiu SM, Peng S, Xiaoqian Z, Liu G, Liao X, Li Y, Yang H, Wang J, Lam TW, Wang J. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.. Gigascience 2012;1:18.
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. The Sequence alignment/map (SAM) format and SAMtools.. Bioinformatics 2009;25:2078–2079.
- Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.. Fly(Austin) 2012;6:80–92.
- Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC. PLINK: a toolset for whole-genome association and population-based linkage analysis.. Amer J Hum Genet 2007;81:559–575.
- Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.. Bioinformatics 2006;22:2688–2690.
- Stamatakis A, Aberer AJ, Goll C, Smith SA, Berger SA, Izquierdo-Carrasco F. RAxML-Light: a tool for computing terabyte phylogenies.. Bioinformatics 2012;28:2064–2066.
- Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis Version 6.0.. Mol Biol Evol 2013;30:2725–2729.
- Ihaka R, Gentleman R. R: A Language for Data Analysis and Graphics.. J Comput Graph Stat 1996;5:299–314.
- Earl DA, Vonholdt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method.. Conserv Genet Resour 2012;4:359–361.
- Rosenberg NA. DISTRUCT: a program for the graphical display of population structure.. Mol Ecol Notes 2004;4:137–138.
- Yim HS, Cho YS, Guang X, Kang SG, Jeong JY, Cha SS, Oh HM, Lee JH, Yang EC, Kwon KK, Kim YJ, Kim TW, Kim W, Jeon JH, Kim SJ, Choi DH, Jho S, Kim HM, Ko J, Kim H, Shin YA, Jung HJ, Zheng Y, Wang Z, Chen Y, Chen M, Jiang A, Li E, Zhang S, Hou H. Minke whale genome and aquatic adaptation in cetaceans.. Nat Genet 2014;46:88–92.
- Ji R, Cui P, Ding F, Geng J, Gao H, Zhang H, Yu J, Hu S, Meng H. Monophyletic origin of domestic bactrian camel (Camelus bactrianus) and its evolutionary relationship with the extant wild camel (Camelus bactrianus ferus). Anim Genet 2009;40:377–382.
- Yang Z. PAML 4: phylogenetic analysis by maximum likelihood.. Mol Biol Evol 2007;24:1586–1591.
- Zhang J, Nielsen R, Yang Z. Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level.. Mol Biol Evol 2005;22:2472–2479.
Citations
This article has been cited 13 times.- Sharma M, Singh A, Kumar V, Olla N, Arora R, Sharma R, Mohan NH, Ahlawat S. Advances in Equine Genomics: Decoding the Genetic Architecture of Morphology, Performance, Behavior, and Adaptation. Mol Biotechnol 2025 Dec 19;.
- Bhardwaj A, Tandon G, Pal Y, Sharma NK, Nayan V, Soni S, Iquebal MA, Jaiswal S, Legha RA, Talluri TR, Bhattacharya TK, Kumar D, Rai A, Tripathi BN. Genome-Wide Single-Nucleotide Polymorphism-Based Genomic Diversity and Runs of Homozygosity for Selection Signatures in Equine Breeds. Genes (Basel) 2023 Aug 14;14(8).
- Vincelette A. The Characteristics, Distribution, Function, and Origin of Alternative Lateral Horse Gaits. Animals (Basel) 2023 Aug 8;13(16).
- Arslan M. Whole-genome sequencing and genomic analysis of Norduz goat (Capra hircus). Mamm Genome 2023 Sep;34(3):437-448.
- Polani S, Dean M, Lichter-Peled A, Hendrickson S, Tsang S, Fang X, Feng Y, Qiao W, Avni G, Kahila Bar-Gal G. Sequence Variant in the TRIM39-RPP21 Gene Readthrough is Shared Across a Cohort of Arabian Foals Diagnosed with Juvenile Idiopathic Epilepsy. J Genet Mutat Disord 2022 Jan;1(1).
- Li J, Fan Z, Shen F, Pendleton AL, Song Y, Xing J, Yue B, Kidd JM, Li J. Genomic Copy Number Variation Study of Nine Macaca Species Provides New Insights into Their Genetic Divergence, Adaptation, and Biomedical Application. Genome Biol Evol 2020 Dec 6;12(12):2211-2230.
- Li B, He X, Zhao Y, Bai D, Du M, Song L, Liu Z, Yin Z, Manglai D. Transcriptome profiling of developing testes and spermatogenesis in the Mongolian horse. BMC Genet 2020 Apr 28;21(1):46.
- Felkel S, Vogl C, Rigler D, Dobretsberger V, Chowdhary BP, Distl O, Fries R, Jagannathan V, Janečka JE, Leeb T, Lindgren G, McCue M, Metzger J, Neuditschko M, Rattei T, Raudsepp T, Rieder S, Rubin CJ, Schaefer R, Schlötterer C, Thaller G, Tetens J, Velie B, Brem G, Wallner B. The horse Y chromosome as an informative marker for tracing sire lines. Sci Rep 2019 Apr 15;9(1):6095.
- Seong HS, Kim NY, Kim DC, Hwang NH, Son DH, Shin JS, Lee JH, Chung WH, Choi JW. Whole genome sequencing analysis of horse populations inhabiting the Korean Peninsula and Przewalski's horse. Genes Genomics 2019 Jun;41(6):621-628.
- Li J, Fan Z, Sun T, Peng C, Yue B, Li J. Comparative genome-wide survey of single nucleotide variation uncovers the genetic diversity and potential biomedical applications among six Macaca species. Int J Mol Sci 2018 Oct 11;19(10).
- Schurink A, da Silva VH, Velie BD, Dibbits BW, Crooijmans RPMA, Franҫois L, Janssens S, Stinckens A, Blott S, Buys N, Lindgren G, Ducro BJ. Copy number variations in Friesian horses and genetic risk factors for insect bite hypersensitivity. BMC Genet 2018 Jul 30;19(1):49.
- Lee KH, Lim D, Greenhalgh D, Cho K. Highly Variable Genomic Landscape of Endogenous Retroviruses in the C57BL/6J Inbred Strain, Depending on Individual Mouse, Gender, Organ Type, and Organ Location. Int J Genomics 2017;2017:3152410.
- Schönbach C, Tan T, Ranganathan S. InCoB2014: mining biological data from genomics for transforming industry and health. BMC Genomics 2014;15 Suppl 9(Suppl 9):I1.