Abstract: Osteochondritis dissecans (OCD) is a common developmental orthopedic condition in Thoroughbred racehorses and although arthroscopic surgery is widely used for treatment, its long-term effects on race performance remain unclear. This retrospective study evaluated the effect of OCD surgery on race performance, compared the predictive power of pedigree and management variables and applied interpretable machine learning methods for forecasting race performance outcomes. Data were collected from 75 Thoroughbreds that underwent OCD surgery between 2015 and 2017 and 257 maternal siblings without recorded OCD surgery (controls). Variables included biometric, pedigree, surgical and race performance measures. We also derived additional predictors, including earnings per start and sale-price ratios and quantified racing performance using the field-adjusted percentile metric 'race_pts_avg'. We trained gradient-boosting models (XGBoost and CatBoost) and evaluated predictive performance using R² scores and Shapley Additive Explanations (SHAP). Models that included derived features consistently outperformed those without and the best model (CatBoost) achieved R² = 0.7983. Variables related to surgical history, including age at surgery, showed limited predictive value and lesion severity did not rank among the dominant predictors. In contrast, pedigree, particularly the group-mean encoded family identifier 'h_family', ranked highest. These results indicate that OCD surgery does not significantly impair long-term race performance, while pedigree is the strongest predictor in this cohort. Although this observational study is limited by its regional scope and by unmeasured factors that may influence both surgery status and performance, the findings provide objective context for breeding, treatment and sales decisions involving Thoroughbreds with a history of OCD surgery.
The Equine Research Bank provides access to a large database of publicly available scientific literature. Inclusion in the Research Bank does not imply endorsement of study methods or findings by Mad Barn.
This research summary has been generated with artificial intelligence and may contain errors and omissions. Refer to the original study to confirm details provided. Submit correction.
Overview
This study used machine learning to investigate how arthroscopic surgery for osteochondritis dissecans (OCD) affects the long-term race performance of Thoroughbred horses.
It also compared the influence of pedigree versus surgery and management factors on predicting race outcomes, finding that pedigree was the strongest predictor and surgery had limited impact.
Background and Objectives
Osteochondritis dissecans (OCD) is a developmental orthopedic condition frequently seen in Thoroughbred racehorses that can potentially impact their athletic performance.
Arthroscopic surgery is commonly performed to treat OCD lesions, but there is uncertainty about how this surgery affects a horse’s long-term racing success.
The study aimed to:
Evaluate the long-term impact of OCD surgery on race performance.
Compare the predictive strength of pedigree data and management factors for race results.
Apply interpretable machine learning techniques to forecast race outcomes and understand contributing variables.
Data and Methods
Retrospective data were collected for 75 Thoroughbreds that underwent OCD surgery between 2015 and 2017, along with 257 maternal siblings without surgery to serve as controls.
Variables included:
Biometric data (e.g., horse measurements)
Pedigree information (family identifiers and relationships)
Additional derived features included earnings per start and sale-price ratios to provide richer predictors of performance.
Race performance was quantified using a field-adjusted metric called ‘race_pts_avg’, which normalizes performance relative to competition.
Two gradient boosting algorithms, XGBoost and CatBoost, were trained and compared for predictive accuracy using R² scores.
Interpretability was achieved using SHapley Additive exPlanations (SHAP), which quantify how each feature influences the output of the machine learning model.
Main Findings
Models that incorporated the derived performance features (like earnings per start) consistently performed better than models that did not include them.
The best-performing model, CatBoost, achieved a high R² value of 0.7983, indicating strong predictive power.
Variables related directly to OCD surgery, such as age at surgery and lesion severity, showed limited importance in predicting race performance.
Pedigree information, particularly a feature called ‘h_family’ which encoded maternal family lineage, was the most influential predictor of performance.
This suggests that genetic lineage outweighs the impact of OCD surgery on long-term racing success in this cohort.
Implications
The study provides evidence that undergoing arthroscopic surgery for OCD does not significantly reduce a Thoroughbred’s future race performance potential.
Pedigree remains the most critical factor in forecasting performance, reinforcing the importance of breeding decisions.
These insights may guide breeders, trainers, veterinarians, and buyers when making treatment, management, and sales decisions for horses with a history of OCD surgery.
Additionally, the use of interpretable machine learning models allows stakeholders to understand which factors drive performance prediction, increasing trust in the results.
Limitations
The study was limited geographically and used a relatively small sample size which may affect generalizability.
As an observational study, there could be unmeasured confounding factors that influence both the likelihood of surgery and race performance outcomes.
Future research with larger, more diverse populations and prospective designs would be beneficial to confirm these results.
Cite This Article
APA
An SJ, Sohn Y, Forbes E, Ryu SH.
(2026).
Machine learning-based prediction and quantification of OCD surgery and pedigree effects on racehorse performance.
Vet J, 316, 106607.
https://doi.org/10.1016/j.tvjl.2026.106607
Department of Artificial Intelligence, Cheju Halla University, Jeju 63092, South Korea.
Sohn, Y
Veterinary Department, Korea Racing Authority, Gwacheon 13822, South Korea.
Forbes, E
Racing Integrity Board, Private Bag 17902, Greenlane, Auckland 1546, New Zealand.
Ryu, S-H
Department of Equine Resources Science, Cheju Halla University, Jeju 63092, South Korea. Electronic address: batmanryu@hanmail.net.
MeSH Terms
Animals
Horses
Pedigree
Retrospective Studies
Machine Learning
Horse Diseases / surgery
Horse Diseases / genetics
Male
Female
Conflict of Interest Statement
Declaration of Competing Interest None of the authors has any other financial or personal relationships that could inappropriately influence or bias the content of the paper.