Evaluation of videoendoscopic examinations of arytenoid function in the 2-year-old Thoroughbred: Can we all agree?
Abstract: Upper respiratory tract (URT) endoscopic examination is a routine part of prepurchase examinations. Discrepancies have been documented in the assessment of arytenoid function grades (AFG) between veterinarians. Objective: To document intra- and interobserver agreement for a population of multi-experience level veterinarians for assessment of AFG of 2-year-old Thoroughbreds. Methods: Observational cohort study. Methods: One-hundred and fourteen URT videoendoscopic examinations were evaluated by 10 veterinarians. Veterinarians were categorised based on experience, into five groups, each group with two veterinarians. Arytenoid function was graded using the Havemeyer ordinal scale and then reclassified by the authors dichotomously into 'meets conditions of sale' (MCS) or 'does not meet conditions of sale'(DNMCS). Interobserver agreement of arytenoid function was assessed across all 10 veterinarians using Fleiss' kappa and between veterinarians of similar experience levels in the five subgroups using Cohen's unweighted (k) and Cohen's linear weighted kappa (Ck). Intraobserver agreement was similarly calculated for each reviewer using 22 repeated video clips. Results: Overall interobserver agreement using ordinal scales was fair (k = 0.27, 95% CI 0.22-0.31) to moderate (mean weighted Ck = 0.57, 95% CI 0.46-0.69) depending on statistical methodology used. Using the dichotomous classification, interobserver agreement was good (k = 0.7, 95% CI 0.63-0.77). Overall intraobserver agreement using ordinal scales was fair (mean k = 0.26, 95% CI 0.14-0.38) to good (mean Ck = 0.61, 95% CI 0.50-0.71); and for the dichotomous reclassification it was good (k = 0.73, 95% CI 0.59-0.87). Experience level differences were identified. Conclusions: The low number of veterinarians in each experience subgroup. Conclusions: Subjectivity exists in arytenoid function grading, despite the existence of a well-defined scale. Agreement variation exists depending on the grading scales and statistical methods used for analysis. Future studies pre- and post veterinarian training are indicated to determine if agreement can be improved.
© 2018 EVJ Ltd.
Publication Date: 2018-10-27 PubMed ID: 30281841DOI: 10.1111/evj.13030Google Scholar: Lookup
The Equine Research Bank provides access to a large database of publicly available scientific literature. Inclusion in the Research Bank does not imply endorsement of study methods or findings by Mad Barn.
- Journal Article
- Observational Study
- Veterinary
- Clinical Examination
- Clinical Study
- Comparative Study
- Diagnosis
- Diagnostic Technique
- Disease Diagnosis
- Epidemiology
- Equine Health
- Horses
- Laryngeal Dysfunction
- Observational Study
- Respiratory Health
- Statistical Analysis
- Thoroughbreds
- Veterinarians
- Veterinary Medicine
- Veterinary Practice
- Veterinary Procedure
- Veterinary Research
- Veterinary Science
Summary
This research summary has been generated with artificial intelligence and may contain errors and omissions. Refer to the original study to confirm details provided. Submit correction.
This study evaluates how variously experienced veterinarians agree when assessing the arytenoid function (part of the horse’s larynx) in two-year-old Thoroughbreds, and highlights discrepancies in these assessments. The study utilized videoendoscopy, a routine procedure in prepurchase horse examinations. It suggests that the grading scale for the arytenoid function and the statistical methods used for analysis could have influenced disagreements among the veterinarians in their final assessments.
Study Design and Methods
- The study was an observational cohort design, focusing on the reliability and agreement in assessing the arytenoid function in horses.
- A total of 114 Upper Respiratory Tract (URT) videoendoscopic examinations of two-year-old Thoroughbreds were evaluated by 10 veterinarians, grouped based on their professional experience.
- The arytenoid function was graded using the widely known Havemeyer ordinal scale, and later these grades were classified dichotomously into two categories: ‘meets conditions of sale’ (MCS) or ‘does not meet conditions of sale'(DNMCS).
- The purpose of the study was to identify the interobserver and intraobserver agreement in these evaluations by measuring them using Fleiss’ kappa (for the interobserver agreement across all 10 veterinarians) and Cohen’s unweighted (k) and Cohen’s linear weighted kappa (Ck) for the intraobserver and interobserver agreement within the five experience-based subgroups.
Results and Findings
- The overall interobserver agreement, when using ordinal scales, ranged from fair to average depending on the methodology used to gauge it.
- However, the agreement improved notably when the results were classified into the two categories (MCS and DNMCS), indicating that the dichotomous classification resulted in a high level of agreement among vets.
- Meanwhile, the intraobserver agreement (agreement within individual vets looking at the same case multiple times) was also within the fair to good range on both criteria- ordinal and dichotomous scales.
- The study did identify that a veterinarian’s experience level appears to impact their assessments, hinting at a need for standardized training to achieve more uniform evaluations.
Conclusions
- The study observes that despite having a well-defined arytenoid function grading scale, there’s a degree of subjectivity in assessments among veterinarians. This subjectivity creates a possibility for inconsistencies especially critical during pre-purchase equine inspections.
- These variations in the agreement might depend more on the grading scales used and the statistical methods employed for analysis rather than individual interpreters.
- The researchers recommend conducting future studies pre- and post-veterinarian training to verify if the level of agreement in evaluations can be improved through standardized training.
Cite This Article
APA
McLellan J, Plevin S.
(2018).
Evaluation of videoendoscopic examinations of arytenoid function in the 2-year-old Thoroughbred: Can we all agree?
Equine Vet J, 51(3), 364-369.
https://doi.org/10.1111/evj.13030 Publication
Researcher Affiliations
- Florida Equine Veterinary Associates, Ocala, Florida, USA.
- Florida Equine Veterinary Associates, Ocala, Florida, USA.
MeSH Terms
- Animals
- Arytenoid Cartilage / anatomy & histology
- Arytenoid Cartilage / diagnostic imaging
- Cohort Studies
- Endoscopy / methods
- Endoscopy / veterinary
- Horses / anatomy & histology
- Observer Variation
- Video Recording
Citations
This article has been cited 2 times.- Hardwick JL, Ahern BJ, Crawford KL, Allen KJ, Anderson BH, Rose KJ, Franklin SH. Yearling laryngeal function grades II.2 and below are not associated with reduced performance. Equine Vet J 2025 Jul;57(4):953-966.
- Hardwick JL, Ahern BJ, Crawford KL, Allen KJ, Franklin SH. Yearling laryngeal function in Thoroughbreds that underwent a laryngoplasty differs from controls. Equine Vet J 2025 Mar;57(2):431-440.
Use Nutrition Calculator
Check if your horse's diet meets their nutrition requirements with our easy-to-use tool Check your horse's diet with our easy-to-use tool
Talk to a Nutritionist
Discuss your horse's feeding plan with our experts over a free phone consultation Discuss your horse's diet over a phone consultation
Submit Diet Evaluation
Get a customized feeding plan for your horse formulated by our equine nutritionists Get a custom feeding plan formulated by our nutritionists