Abstract: Gene editing and genome manipulation offer great promise for treating diseases in both humans and animals. There is a danger, however, that this technology could be used for other purposes such as performance enhancement. To detect such 'gene doping' events, we evaluated a targeted enrichment panel and next-generation sequencing to assess its reproducibility, sensitivity, and capability of variant detection on a wide variety of samples and biological matrices. The panel was verified against existing data for the myostatin gene, a PCR-based SNP panel, and whole genome sequencing in a subset of samples. As successful detection of seamless edits will rely on a detailed understanding of the natural population, we also screened over 170 Thoroughbreds and catalogued numerous novel variants. These included several resulting in coding alterations, and a structural variant. Samples spiked with transgenic cDNA-based material to simulate gene doping events were detected down to 3.2% mosaicism, giving confidence that mosaic mutations resulting from embryonic introduction of gene editing reagents can be detected using these methods. The ability of software packages to detect gene doping events was also assessed, including multiple genome alignment tools, variant callers, and structural variant callers. Freebayes performed strongest at SNP-based editing detection, and Delly and Manta had complementary advantages depending on the mutation type. For routine testing, a multi-faceted approach to calling should be taken to maximise the detection capabilities.
The Equine Research Bank provides access to a large database of publicly available scientific literature. Inclusion in the Research Bank does not imply endorsement of study methods or findings by Mad Barn.
This research summary has been generated with artificial intelligence and may contain errors and omissions. Refer to the original study to confirm details provided. Submit correction.
Plain Language Overview
This study evaluated a specialized genetic testing method to detect gene editing in Thoroughbred horses, focusing on identifying both natural genetic variation and artificially introduced changes that could be used for performance enhancement (“gene doping”).
The research demonstrated the effectiveness of the method in detecting small levels of edited genetic material and assessed various software tools to optimize reliable detection of gene edits in horse populations.
Introduction to the Research
Gene editing and genome manipulation have potential applications for treating diseases in humans and animals, but there is concern about misuse for non-therapeutic purposes such as enhancing athletic performance in horses.
Detecting unauthorized gene editing or “gene doping” in animals presents challenges because edits can be subtle and difficult to differentiate from natural genetic variation.
This research aimed to evaluate a targeted enrichment panel combined with next-generation sequencing (NGS) technology for reliably detecting gene editing in Thoroughbred horses.
Methods and Testing Approach
The targeted enrichment panel was designed to focus on specific gene regions, including the myostatin gene, which affects muscle growth and is often targeted for performance enhancement.
The panel’s performance was benchmarked against:
Existing genetic data for the myostatin gene
A PCR-based single nucleotide polymorphism (SNP) panel
Whole genome sequencing data from a subset of samples
The study included over 170 Thoroughbred horses to build a comprehensive catalog of natural genetic variation, including novel variants and structural variants—key for differentiating spontaneous from artificial edits.
To simulate gene doping events, horse DNA samples were spiked with transgenic cDNA to test the method’s sensitivity in detecting edited sequences, even when present at low levels (mosaicism as low as 3.2%).
Various bioinformatic tools were evaluated for their ability to detect gene editing from sequencing data, including:
Multiple genome alignment tools
Variant callers such as Freebayes
Structural variant callers like Delly and Manta
Key Findings
The targeted enrichment panel showed high reproducibility and sensitivity, effectively detecting gene edits in a variety of biological samples and matrices.
The extensive cataloging of variants in Thoroughbreds revealed numerous novel SNPs and at least one structural variant, highlighting the complexity of natural genetic diversity.
The detection limit for introduced gene edits was as low as 3.2% mosaicism, indicating that even minor edited cell populations (such as those arising from early embryonic gene editing) can be identified.
Freebayes outperformed other variant callers in detecting SNP-based gene edits, while Delly and Manta had complementary strengths in identifying structural variants, depending on mutation type.
For routine surveillance or testing, using a combination of variant calling software improves the probability of detecting different types of edits, suggesting a multi-faceted bioinformatics approach is optimal.
Implications and Conclusions
This research demonstrates the feasibility of detecting gene doping in horses using targeted sequencing panels combined with advanced computational analyses.
The comprehensive understanding of natural population genetics is critical to confidently identify artificial edits versus naturally occurring variants.
Multi-tool software pipelines enhance detection capabilities and should be implemented in monitoring programs for gene editing in animals involved in sports or breeding.
The approaches validated here can serve as a model for developing gene doping detection frameworks in other species or contexts where gene editing may be misused.
Cite This Article
APA
Maniego J, Swinburne J, Hincks P, Habershon-Butcher J, Given J, Ryder E.
(2025).
Evaluation of a targeted enrichment panel for gene editing detection and assessment of population variation in Thoroughbred horses.
Anim Genet, 56(5), e70047.
https://doi.org/10.1111/age.70047
Argentina breeds gene‐edited polo super ponies (2025). https://www.reuters.com/science/argentina‐breeds‐gene‐edited‐polo‐super‐ponies‐2025‐02‐04/. Accessed 28 April 2025.
Bafunno V, Divella C, Sessa F, Tiscia GL, Castellano G, Gesualdo L. De novo homozygous mutation of the C1 inhibitor gene in a patient with hereditary angioedema. Journal of Allergy and Clinical Immunology 2013;132(3):748–750.e3.
Boel A, de Saffel H, Steyaert W, Callewaert B, de Paepe A, Coucke PJ. CRISPR/Cas9‐mediated homology‐directed repair by ssODNs in zebrafish induces complex mutational patterns resulting from genomic integration of repair‐template fragments. Disease Models and Mechanisms 2018;11(10):dmm035352.
Cezard T, Cunningham F, Hunt SE, Koylass B, Kumar N, Saunders G. The European variation archive: a FAIR resource of genomic variation for all species. Nucleic Acids Research 2022;50(D1):D1216–D1220.
Chen X, Schulz-Trieglaff O, Shaw R, Barnes B, Schlesinger F, Källberg M. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics 2016;32(8):1220–1222.
Conway JR, Lex A, Gehlenborg N. UpSetR: an R package for the visualization of intersecting sets and their properties. Bioinformatics 2017;33(18):2938–2940.
El Refaey M, Xu L, Gao Y, Canan BD, Adesanya TMA, Warner SC. In vivo genome editing restores dystrophin expression and cardiac function in dystrophic mice. Circulation Research 2017;121(8):923–929.
Erbs V, Lorentz R, Eisenman B, Schaeffer L, Luppi L, Lindner L. Increased on‐target rate and risk of concatemerization after CRISPR‐enhanced targeting in ES cells. Genes 2023;14(2):401.
Garrison E, Marth G. Haplotype‐based variant detection from short‐read sequencing. .
Genovese G, Handsaker RE, Li H, Kenny EE, McCarroll SA. Mapping the human reference Genome's missing sequence by three‐way admixture in Latino genomes. American Journal of Human Genetics 93(3), 411–421.
Johnson PL, Dodds KG, Bain WE, Greer GJ, McLean NJ, McLaren RJ. Investigations into the GDF8 g+6723G‐A polymorphism in New Zealand Texel sheep. Journal of Animal Science 87(6), 1856–1864.
Kechin A, Borobova V, Boyarskikh U, Khrapov E, Subbotin S, Filipenko M. NGS‐PrimerPlex: high‐throughput primer design for multiplex polymerase chain reactions. PLoS Computational Biology 16(12), e1008468.
Lefouili M, Nam K. The evaluation of Bcftools mpileup and GATK HaplotypeCaller for variant calling in non‐human species. Scientific Reports 12(1), 11331.
Lord J, McMullan DJ, Eberhardt RY, Rinck G, Hamilton SJ, Quinlan-Jones E. Prenatal exome sequencing analysis in fetal structural anomalies detected by ultrasonography (PAGE): a cohort study. The Lancet 393(10173), 747–757.
Luqman MW, Jenjaroenpun P, Spathos J, Shingte N, Cummins M, Nimsamer P. Long read sequencing reveals transgene concatemerization and vector sequences integration following AAV-driven electroporation of CRISPR RNP complexes in mouse zygotes. Frontiers in Genome Editing 7, 1582097.
Mahmoud M, Gobet N, Cruz-Dávalos DI, Mounier N, Dessimoz C, Sedlazeck FJ. Structural variant calling: the long and the short of it. Genome Biology 20(1), 246.
Meynert AM, Ansari M, FitzPatrick DR, Taylor MS. Variant detection sensitivity and biases in whole genome and exome sequencing. BMC Bioinformatics 15(1), 247.
Mianné J, Codner GF, Caulder A, Fell R, Hutchison M, King R. Analysing the outcome of CRISPR-aided genome editing in embryos: screening, genotyping and quality control. Methods 121, 68–76.
Monteys AM, Ebanks SA, Keiser MS, Davidson BL. CRISPR/Cas9 editing of the mutant huntingtin allele in vitro and in vivo. Molecular Therapy 25(1), 12–23.
Mu W, Lu HM, Chen J, Li S, Elliott AM. Sanger confirmation is required to achieve optimal sensitivity and specificity in next-generation sequencing panel testing. Journal of Molecular Diagnostics 18(6), 923–932.
Munté E, Roca C, del Valle J, Feliubadaló L, Pineda M, Gel B. Detection of germline CNVs from gene panel data: benchmarking the state of the art. Briefings in Bioinformatics 26(1), 645.
Musunuru K, Grandinette SA, Wang X, Hudson TR, Briseno K, Berry AM. Patient-specific in vivo gene editing to treat a rare genetic disease. New England Journal of Medicine 392, 2235–2243.
Poplin R, Ruano-Rubio V, DePristo MA, Fennell TJ, Carneiro MO, Van der Auwera GA. Scaling accurate genetic variant discovery to tens of thousands of samples. bioRxiv p. 201178.
Rajaby R, Liu DX, Au CH, Cheung YT, Lau AYT, Yang QY. INSurVeyor: improving insertion calling from short read sequencing data. Nature Communications 14(1), 3243.
Ryder E, Gleeson D, Sethi D, Vyas S, Miklejewska E, Dalvi P. Molecular characterization of mutant mouse strains generated from the EUCOMM/KOMP-CSD ES cell resource. Mammalian Genome 24(7–8), 286–294.
Sasani TA, Pedersen BS, Gao Z, Baird L, Przeworski M, Jorde LB. Large, three-generation human families reveal post-zygotic mosaicism and variability in germline mutation accumulation. eLife 8, e46922.
Schuelke M, Wagner KR, Stolz LE, Hübner C, Riebel T, Kömen W. Myostatin mutation associated with gross muscle hypertrophy in a child. New England Journal of Medicine 350(26), 2682–2688.
Tourvas N, Ganopoulos I, Koubouris G, Kostelenos G, Manthos I, Bazakos C. Wild and cultivated olive tree genetic diversity in Greece: a diverse resource in danger of erosion. Frontiers in Genetics 14, 1298565.
Van der Auwera, G., O'Connor, B. & Safari. (2020) Genomics in the cloud: using docker, GATK, and WDL in Terra. O'Reilly Media, p. 300. https://www.oreilly.com/library/view/genomics‐in‐the/9781491975183/
Van LTK, Hien HTD, Kieu HTT, Hieu NLT, Vinh LS, Hoa G. De novo homozygous variant of the SCN1A gene in a patient with severe Dravet syndrome complicated by acute encephalopathy. Neurogenetics 22(2), 133–136.
Van Nieuwerburgh F, Goetghebeur E, Vandewoestyne M, Deforce D. Impact of allelic dropout on evidential value of forensic DNA profiles using RMNE. Bioinformatics 25(2), 225–229.
Wright CF, Campbell P, Eberhardt RY, Aitken S, Perrett D, Brent S. Genomic diagnosis of rare pediatric disease in the United Kingdom and Ireland. New England Journal of Medicine 388(17), 1559–1571.
Young AE, Mansour TA, McNabb BR, Owen JR, Trott JF, Brown CT. Genomic and phenotypic analyses of six offspring of a genome-edited hornless bull. Nature Biotechnology 38(2), 225–232.
Zhang X, Qiu M, Han B, Liao L, Peng X, Lin J. Generation and propagation of high fecundity gene edited fine wool sheep by CRISPR/Cas9. Scientific Reports 15(1), 2557.
Zhang ZD, du J, Lam H, Abyzov A, Urban AE, Snyder M. Identification of genomic indels and structural variations using split reads. BMC Genomics 12, 375.