Short Insertion and Deletion Discoveries via Whole-Genome Sequencing of 101 Thoroughbred Racehorses.
Abstract: Thoroughbreds are some of the most famous racehorses worldwide and are currently animals of high economic value. To understand genomic variability in Thoroughbreds, we identified genome-wide insertions and deletions (INDELs) and obtained their allele frequencies in this study. INDELs were obtained from whole-genome sequencing data of 101 Thoroughbred racehorses by mapping sequence reads to the horse reference genome. By integrating individual data, 1,453,349 and 113,047 INDELs were identified in the autosomal (1-31) and X chromosomes, respectively, while 18 INDELs were identified on the mitochondrial genome, totaling 1,566,414 INDELs. Of those, 779,457 loci (49.8%) were novel INDELs, while 786,957 loci (50.2%) were already registered in Ensembl. The sizes of diallelic INDELs ranged from -286 to +476, and the majority, 717,736 (52.14%) and 220,672 (16.03%), were 1-bp and 2-bp variants, respectively. Numerous INDELs were found to have lower frequencies of alternative (Alt) alleles. Many rare variants with low Alt allele frequencies (<0.5%) were also detected. In addition, 5955 loci were genotyped as having a minor allele frequency of 0.5 and being heterogeneous genotypes in all the horses. While short-read sequencing and its mapping to reference genome is a simple way of detecting variants, fake variants may be detected. Therefore, our data help to identify true variants in Thoroughbred horses. The INDEL database we constructed will provide useful information for genetic studies and industrial applications in Thoroughbred horses, including a gene-editing test for gene-doping control and a parentage test using INDELs for horse registration and identification.
Publication Date: 2023-03-03 PubMed ID: 36980910PubMed Central: PMC10048024DOI: 10.3390/genes14030638Google Scholar: Lookup
The Equine Research Bank provides access to a large database of publicly available scientific literature. Inclusion in the Research Bank does not imply endorsement of study methods or findings by Mad Barn.
- Journal Article
- Research Support
- Non-U.S. Gov't
Summary
This research summary has been generated with artificial intelligence and may contain errors and omissions. Refer to the original study to confirm details provided. Submit correction.
The research article discusses a study which maps the entire genome of 101 Thoroughbred racehorses to identify genetic variations, specifically insertions and deletions (INDELs), in order to construct a database that can assist in genetic studies and commercial applications.
Methodology
- The research team conducted whole-genome sequencing of 101 Thoroughbred racehorses. The aim was to understand the genetic diversity present within these horses by identifying genome-wide insertions and deletions (INDELs).
- The sequence reads were then mapped to a reference horse genome, which allowed the scientists to identify sections of the genome where sequences were either inserted or deleted.
Findings
- The researchers were able to identify approximately 1.5 million INDELs within the genomes. Some INDELs were identified on the mitochondrial genome, while the rest were identified in the autosomal and X chromosomes.
- About half of these INDELs were novel or not previously documented, and the other half were already known and registered in the Ensembl database.
- Many of the identified INDELs are of very small size i.e., 1- to 2-bp (base pair) variants. These represent substantial genetic variability among Thoroughbreds.
- It was observed that many INDELs have lower frequencies of alternative alleles, indicating that these genetic variations are not common among the population of Thoroughbreds.
- In addition to common variants, researchers also detected many rare variants with low alternate allele frequencies.
Implications
- The data obtained from the research could be useful for identifying true genetic variants within Thoroughbred horses. This can be critical in studies which aim to understand the genetic makeup of these horses and differentiate between genuine and fake variants.
- The INDELs identified form a database that could potentially be used for a range of applications – both scientific and corporate – involving Thoroughbred horses. For example, it could be useful in gene-editing tests for gene-doping control in horse racing. It could also assist with parentage tests using INDELs for horse registration and identification purposes.
Cite This Article
APA
Tozaki T, Ohnuma A, Kikuchi M, Ishige T, Kakoi H, Hirota KI, Takahashi Y, Nagata SI.
(2023).
Short Insertion and Deletion Discoveries via Whole-Genome Sequencing of 101 Thoroughbred Racehorses.
Genes (Basel), 14(3), 638.
https://doi.org/10.3390/genes14030638 Publication
Researcher Affiliations
- Genetic Analysis Department, Laboratory of Racing Chemistry, 1731-2 Tsurutamachi, Utsunomiya 320-0851, Tochigi, Japan.
- Genetic Analysis Department, Laboratory of Racing Chemistry, 1731-2 Tsurutamachi, Utsunomiya 320-0851, Tochigi, Japan.
- Genetic Analysis Department, Laboratory of Racing Chemistry, 1731-2 Tsurutamachi, Utsunomiya 320-0851, Tochigi, Japan.
- Genetic Analysis Department, Laboratory of Racing Chemistry, 1731-2 Tsurutamachi, Utsunomiya 320-0851, Tochigi, Japan.
- Genetic Analysis Department, Laboratory of Racing Chemistry, 1731-2 Tsurutamachi, Utsunomiya 320-0851, Tochigi, Japan.
- Genetic Analysis Department, Laboratory of Racing Chemistry, 1731-2 Tsurutamachi, Utsunomiya 320-0851, Tochigi, Japan.
- Equine Research Institute, Japan Racing Association, 1400-4 Shiba, Shimotsuke 329-0412, Tochigi, Japan.
- Genetic Analysis Department, Laboratory of Racing Chemistry, 1731-2 Tsurutamachi, Utsunomiya 320-0851, Tochigi, Japan.
MeSH Terms
- Horses / genetics
- Animals
- Genotype
- Sequence Analysis
- Genomics
- INDEL Mutation
- Genome, Mitochondrial
Conflict of Interest Statement
There are no competing interests, including patents, products in development, or marketed products, to declare concerning this study.
References
This article includes 34 references
- Bower MA, Campana MG, Whitten M, Edwards CJ, Jones H, Barrett E, Cassidy R, Nisbet RE, Hill EW, Howe CJ, Binns M. The cosmopolitan maternal heritage of the Thoroughbred racehorse breed shows a significant contribution from British and Irish native mares.. Biol Lett 2011 Apr 23;7(2):316-20.
- International Stud Book Committee (Resouces/Thoroughbred Horse Statistics) [(accessed on 2 February 2023)]. Available online: https://www.internationalstudbook.com/resources/
- Wade CM, Giulotto E, Sigurdsson S, Zoli M, Gnerre S, Imsland F, Lear TL, Adelson DL, Bailey E, Bellone RR, Blöcker H, Distl O, Edgar RC, Garber M, Leeb T, Mauceli E, MacLeod JN, Penedo MC, Raison JM, Sharpe T, Vogel J, Andersson L, Antczak DF, Biagi T, Binns MM, Chowdhary BP, Coleman SJ, Della Valle G, Fryc S, Guérin G, Hasegawa T, Hill EW, Jurka J, Kiialainen A, Lindgren G, Liu J, Magnani E, Mickelson JR, Murray J, Nergadze SG, Onofrio R, Pedroni S, Piras MF, Raudsepp T, Rocchi M, Røed KH, Ryder OA, Searle S, Skow L, Swinburne JE, Syvänen AC, Tozaki T, Valberg SJ, Vaudin M, White JR, Zody MC, Lander ES, Lindblad-Toh K. Genome sequence, comparative analysis, and population genetics of the domestic horse.. Science 2009 Nov 6;326(5954):865-7.
- Kalbfleisch TS, Rice ES, DePriest MS Jr, Walenz BP, Hestand MS, Vermeesch JR, O Connell BL, Fiddes IT, Vershinina AO, Saremi NF, Petersen JL, Finno CJ, Bellone RR, McCue ME, Brooks SA, Bailey E, Orlando L, Green RE, Miller DC, Antczak DF, MacLeod JN. Improved reference genome for the domestic horse increases assembly contiguity and composition.. Commun Biol 2018;1:197.
- Jagannathan V, Gerber V, Rieder S, Tetens J, Thaller G, Drögemüller C, Leeb T. Comprehensive characterization of horse genome variation by whole-genome sequencing of 88 horses.. Anim Genet 2019 Feb;50(1):74-77.
- Durward-Akhurst SA, Schaefer RJ, Grantham B, Carey WK, Mickelson JR, McCue ME. Genetic Variation and the Distribution of Variant Types in the Horse.. Front Genet 2021;12:758366.
- Tozaki T, Ohnuma A, Kikuchi M, Ishige T, Kakoi H, Hirota KI, Kusano K, Nagata SI. Rare and common variant discovery by whole-genome sequencing of 101 Thoroughbred racehorses.. Sci Rep 2021 Aug 6;11(1):16057.
- Tozaki T, Hamilton NA. Control of gene doping in human and horse sports.. Gene Ther 2022 Apr;29(3-4):107-112.
- Moro LN, Viale DL, Bastón JI, Arnold V, Suvá M, Wiedenmann E, Olguín M, Miriuka S, Vichera G. Generation of myostatin edited horse embryos using CRISPR/Cas9 technology and somatic cell nuclear transfer.. Sci Rep 2020 Sep 24;10(1):15587.
- Kim DE, Lee JH, Ji KB, Park KS, Kil TY, Koo O, Kim MK. Generation of genome-edited dogs by somatic cell nuclear transfer.. BMC Biotechnol 2022 Jul 13;22(1):19.
- Sheets TP, Park CH, Park KE, Powell A, Donovan DM, Telugu BP. Somatic Cell Nuclear Transfer Followed by CRIPSR/Cas9 Microinjection Results in Highly Efficient Genome Editing in Cloned Pigs.. Int J Mol Sci 2016 Dec 3;17(12).
- Hill EW, Gu J, Eivers SS, Fonseca RG, McGivney BA, Govindarajan P, Orr N, Katz LM, MacHugh DE. A sequence polymorphism in MSTN predicts sprinting ability and racing stamina in thoroughbred horses.. PLoS One 2010 Jan 20;5(1):e8645.
- Tozaki T, Sato F, Hill EW, Miyake T, Endo Y, Kakoi H, Gawahara H, Hirota K, Nakano Y, Nambo Y, Kurosawa M. Sequence variants at the myostatin gene locus influence the body composition of Thoroughbred horses.. J Vet Med Sci 2011 Dec;73(12):1617-24.
- Tozaki T, Ohnuma A, Nakamura K, Hano K, Takasu M, Takahashi Y, Tamura N, Sato F, Shimizu K, Kikuchi M, Ishige T, Kakoi H, Hirota KI, Hamilton NA, Nagata SI. Detection of Indiscriminate Genetic Manipulation in Thoroughbred Racehorses by Targeted Resequencing for Gene-Doping Control.. Genes (Basel) 2022 Sep 4;13(9).
- Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, Brown CG, Hall KP, Evers DJ, Barnes CL, Bignell HR, Boutell JM, Bryant J, Carter RJ, Keira Cheetham R, Cox AJ, Ellis DJ, Flatbush MR, Gormley NA, Humphray SJ, Irving LJ, Karbelashvili MS, Kirk SM, Li H, Liu X, Maisinger KS, Murray LJ, Obradovic B, Ost T, Parkinson ML, Pratt MR, Rasolonjatovo IM, Reed MT, Rigatti R, Rodighiero C, Ross MT, Sabot A, Sankar SV, Scally A, Schroth GP, Smith ME, Smith VP, Spiridou A, Torrance PE, Tzonev SS, Vermaas EH, Walter K, Wu X, Zhang L, Alam MD, Anastasi C, Aniebo IC, Bailey DM, Bancarz IR, Banerjee S, Barbour SG, Baybayan PA, Benoit VA, Benson KF, Bevis C, Black PJ, Boodhun A, Brennan JS, Bridgham JA, Brown RC, Brown AA, Buermann DH, Bundu AA, Burrows JC, Carter NP, Castillo N, Chiara E Catenazzi M, Chang S, Neil Cooley R, Crake NR, Dada OO, Diakoumakos KD, Dominguez-Fernandez B, Earnshaw DJ, Egbujor UC, Elmore DW, Etchin SS, Ewan MR, Fedurco M, Fraser LJ, Fuentes Fajardo KV, Scott Furey W, George D, Gietzen KJ, Goddard CP, Golda GS, Granieri PA, Green DE, Gustafson DL, Hansen NF, Harnish K, Haudenschild CD, Heyer NI, Hims MM, Ho JT, Horgan AM, Hoschler K, Hurwitz S, Ivanov DV, Johnson MQ, James T, Huw Jones TA, Kang GD, Kerelska TH, Kersey AD, Khrebtukova I, Kindwall AP, Kingsbury Z, Kokko-Gonzales PI, Kumar A, Laurent MA, Lawley CT, Lee SE, Lee X, Liao AK, Loch JA, Lok M, Luo S, Mammen RM, Martin JW, McCauley PG, McNitt P, Mehta P, Moon KW, Mullens JW, Newington T, Ning Z, Ling Ng B, Novo SM, O'Neill MJ, Osborne MA, Osnowski A, Ostadan O, Paraschos LL, Pickering L, Pike AC, Pike AC, Chris Pinkard D, Pliskin DP, Podhasky J, Quijano VJ, Raczy C, Rae VH, Rawlings SR, Chiva Rodriguez A, Roe PM, Rogers J, Rogert Bacigalupo MC, Romanov N, Romieu A, Roth RK, Rourke NJ, Ruediger ST, Rusman E, Sanches-Kuiper RM, Schenker MR, Seoane JM, Shaw RJ, Shiver MK, Short SW, Sizto NL, Sluis JP, Smith MA, Ernest Sohna Sohna J, Spence EJ, Stevens K, Sutton N, Szajkowski L, Tregidgo CL, Turcatti G, Vandevondele S, Verhovsky Y, Virk SM, Wakelin S, Walcott GC, Wang J, Worsley GJ, Yan J, Yau L, Zuerlein M, Rogers J, Mullikin JC, Hurles ME, McCooke NJ, West JS, Oaks FL, Lundberg PL, Klenerman D, Durbin R, Smith AJ. Accurate whole human genome sequencing using reversible terminator chemistry.. Nature 2008 Nov 6;456(7218):53-9.
- Tozaki T, Ohnuma A, Kikuchi M, Ishige T, Kakoi H, Hirota KI, Kusano K, Nagata SI. Identification of processed pseudogenes in the genome of Thoroughbred horses: Possibility of gene-doping detection considering the presence of pseudogenes.. Anim Genet 2022 Apr;53(2):183-192.
- Kloosterman WP, Francioli LC, Hormozdiari F, Marschall T, Hehir-Kwa JY, Abdellaoui A, Lameijer EW, Moed MH, Koval V, Renkens I, van Roosmalen MJ, Arp P, Karssen LC, Coe BP, Handsaker RE, Suchiman ED, Cuppen E, Thung DT, McVey M, Wendl MC, Uitterlinden A, van Duijn CM, Swertz MA, Wijmenga C, van Ommen GB, Slagboom PE, Boomsma DI, Schönhuth A, Eichler EE, de Bakker PI, Ye K, Guryev V. Characteristics of de novo structural changes in the human genome.. Genome Res 2015 Jun;25(6):792-801.
- Al Abri MA, Holl HM, Kalla SE, Sutter NB, Brooks SA. Whole genome detection of sequence and structural polymorphism in six diverse horses.. PLoS One 2020;15(4):e0230899.
- Miller D, Tallmadge RL, Binns M, Zhu B, Mohamoud YA, Ahmed A, Brooks SA, Antczak DF. Polymorphism at expressed DQ and DR loci in five common equine MHC haplotypes.. Immunogenetics 2017 Mar;69(3):145-156.
- Tallmadge RL, Lear TL, Antczak DF. Genomic characterization of MHC class I genes of the horse.. Immunogenetics 2005 Nov;57(10):763-74.
- Rieder S, Taourit S, Mariat D, Langlois B, Guérin G. Mutations in the agouti (ASIP), the extension (MC1R), and the brown (TYRP1) loci and their association to coat color phenotypes in horses (Equus caballus).. Mamm Genome 2001 Jun;12(6):450-5.
- Kakoi H, Tozaki T, Nagata S, Gawahara H, Kijima-Suda I. Development of a method for simultaneously genotyping multiple horse coat colour loci and genetic investigation of basic colour variation in Thoroughbred and Misaki horses in Japan.. J Anim Breed Genet 2009 Dec;126(6):425-31.
- Rooney MF, Hill EW, Kelly VP, Porter RK. The "speed gene" effect of myostatin arises in Thoroughbred horses due to a promoter proximal SINE insertion.. PLoS One 2018;13(10):e0205664.
- Petersen JL, Valberg SJ, Mickelson JR, McCue ME. Haplotype diversity in the equine myostatin gene with focus on variants associated with race distance propensity and muscle fiber type proportions.. Anim Genet 2014 Dec;45(6):827-35.
- Chen J, Guo JT. Structural and functional analysis of somatic coding and UTR indels in breast and lung cancer genomes.. Sci Rep 2021 Oct 27;11(1):21178.
- Song F, Lang M, Li L, Luo H, Hou Y. Forensic features and genetic background exploration of a new 47-autosomal InDel panel in five representative Han populations residing in Northern China.. Mol Genet Genomic Med 2020 May;8(5):e1224.
- Chen X, Nie S, Hu L, Fang Y, Cui W, Xu H, Zhao C, Zhu BF. Forensic efficacy evaluation and genetic structure exploration of the Yunnan Miao group by a multiplex InDel panel.. Electrophoresis 2022 Sep;43(16-17):1765-1773.
- Huang Y, Liu C, Xiao C, Chen X, Han X, Yi S, Huang D. Mutation analysis of 28 autosomal short tandem repeats in the Chinese Han population.. Mol Biol Rep 2021 Jun;48(6):5363-5369.
- Kakoi H, Nagata S, Kurosawa M. DNA Typing with 17 microsatellites for parentage verification of racehorses in Japan. Anim. Sci. J. 2001;72:453–460.
- Tozaki T, Kakoi H, Mashima S, Hirota K, Hasegawa T, Ishida N, Miura N, Choi-Miura NH, Tomita M. Population study and validation of paternity testing for Thoroughbred horses by 15 microsatellite loci.. J Vet Med Sci 2001 Nov;63(11):1191-7.
- Hirota K, Kakoi H, Gawahara H, Hasegawa T, Tozaki T. Construction and validation of parentage testing for thoroughbred horses by 53 single nucleotide polymorphisms.. J Vet Med Sci 2010 Jun;72(6):719-26.
- Holl HM, Vanhnasy J, Everts RE, Hoefs-Martin K, Cook D, Brooks SA, Carpenter ML, Bustamante CD, Lafayette C. Single nucleotide polymorphisms for DNA typing in the domestic horse.. Anim Genet 2017 Dec;48(6):669-676.
- Tozaki T, Ohnuma A, Kikuchi M, Ishige T, Kakoi H, Hirota KI, Kusano K, Nagata SI. Microfluidic Quantitative PCR Detection of 12 Transgenes from Horse Plasma for Gene Doping Control.. Genes (Basel) 2020 Apr 23;11(4).
- Tozaki T, Kwak HG, Nakamura K, Takasu M, Ishii H, Ohnuma A, Kikuchi M, Ishige T, Kakoi H, Hirota KI, Kusano K, Hirata M, Nirasawa T, Nagata SI. Sequence determination of phosphorothioated oligonucleotides using MALDI-TOF mass spectrometry for controlling gene doping in equestrian sports.. Drug Test Anal 2022 Jan;14(1):175-180.
Citations
This article has been cited 0 times.Use Nutrition Calculator
Check if your horse's diet meets their nutrition requirements with our easy-to-use tool Check your horse's diet with our easy-to-use tool
Talk to a Nutritionist
Discuss your horse's feeding plan with our experts over a free phone consultation Discuss your horse's diet over a phone consultation
Submit Diet Evaluation
Get a customized feeding plan for your horse formulated by our equine nutritionists Get a custom feeding plan formulated by our nutritionists