GWAS by GBLUP: Single and Multimarker EMMAX and Bayes Factors, with an Example in Detection of a Major Gene for Horse Gait.
Abstract: Bayesian models for genomic prediction and association mapping are being increasingly used in genetics analysis of quantitative traits. Given a point estimate of variance components, the popular methods SNP-BLUP and GBLUP result in joint estimates of the effect of all markers on the analyzed trait; single and multiple marker frequentist tests (EMMAX) can be constructed from these estimates. Indeed, BLUP methods can be seen simultaneously as Bayesian or frequentist methods. So far there is no formal method to produce Bayesian statistics from GBLUP. Here we show that the Bayes Factor, a commonly admitted statistical procedure, can be computed as the ratio of two normal densities: the first, of the estimate of the marker effect over its posterior standard deviation; the second of the null hypothesis (a value of 0 over the prior standard deviation). We extend the BF to pool evidence from several markers and of several traits. A real data set that we analyze, with ours and existing methods, analyzes 630 horses genotyped for 41711 polymorphic SNPs for the trait "outcome of the qualification test" (which addresses gait, or ambling, of horses) for which a known major gene exists. In the horse data, single marker EMMAX shows a significant effect at the right place at Bonferroni level. The BF points to the same location although with low numerical values. The strength of evidence combining information from several consecutive markers increases using the BF and decreases using EMMAX, which comes from a fundamental difference in the Bayesian and frequentist schools of hypothesis testing. We conclude that our BF method complements frequentist EMMAX analyses because it provides a better pooling of evidence across markers, although its use for primary detection is unclear due to the lack of defined rejection thresholds.
Copyright © 2018 Legarra et al.
Publication Date: 2018-07-02 PubMed ID: 29748199PubMed Central: PMC6027892DOI: 10.1534/g3.118.200336Google Scholar: Lookup
The Equine Research Bank provides access to a large database of publicly available scientific literature. Inclusion in the Research Bank does not imply endorsement of study methods or findings by Mad Barn.
- Journal Article
- Research Support
- Non-U.S. Gov't
Summary
This research summary has been generated with artificial intelligence and may contain errors and omissions. Refer to the original study to confirm details provided. Submit correction.
The research article presents a method for producing Bayesian statistics from a Genomic Best Linear Unbiased Predictors (GBLUP) model with an emphasis on highlighting its applicability in the genetics analysis of quantitative traits. The method designed to examine a major gene for horse gait uses Bayes Factor to quantify evidence from multiple markers and traits.
Introduction to Bayesian Models in Genetic Analysis
- The study emphasizes the growing use of Bayesian models in genetic analysis of quantitative traits. These models bear a unique approach – it holds certain parameters as random variables backed with a probability distribution.
- It revolves around SNP-BLUP (Single Nucleotide Polymorphism Best Linear Unbiased Predictor) and GBLUP, both of which offer combined effect estimates of all analyzed markers on a specific trait.
- The study establishes the dual characteristic of BLUP methods – their ability to adapt to both Bayesian and frequentist statistical techniques.
Bayes Factor Computation
- The researchers introduced a fresh approach that permits generating Bayesian statistics from GBLUP models. The method involves calculating the Bayes Factor as per the ratio of two normal densities.
- The mathematical aspect involves the estimation of the marker effect over its subsequent standard deviation and value 0 over the prior standard deviation in regard to the null hypothesis.
- The researchers also broadened the Bayes Factor to merge evidence obtained from several markers and more than one trait which is not possible through conventional methods.
Empirical Evidence from Horse Data
- This study employs real data from horse genotypes and exhibits the practical implementation of the introduced method using EMMAX (Efficient Mixed-Model Association Expedited).
- The data comprised of 630 horses genotyped for 41711 polymorphic SNPs concerning the trait “outcome of the qualification test” (which pertains to the gait, or ambling of horses).
- With the help of EMMAX, a significant effect was recorded correctly at the Bonferroni level. Furthermore, the Bayes Factor was also able to point to the same location although with a lesser numerical value.
Comparison of Bayesian and Frequentist Methods
- There’s a fundamental difference in how the Bayesian and frequentist schools view hypothesis testing. The strength of evidence combining information from several markers increased using the Bayes Factor and decreased with EMMAX.
- However, it is important to note that although BF method offers a better evidence pooling across markers, its use for primary detection remains unclear due to undefined rejection thresholds.
- Thus, it is concluded that the presented BF method can serve as a complementary tool to frequentist EMMAX analysis in genetic studies.
Cite This Article
APA
Legarra A, Ricard A, Varona L.
(2018).
GWAS by GBLUP: Single and Multimarker EMMAX and Bayes Factors, with an Example in Detection of a Major Gene for Horse Gait.
G3 (Bethesda), 8(7), 2301-2308.
https://doi.org/10.1534/g3.118.200336 Publication
Researcher Affiliations
- INRA (Institut National de la Recherche Agronomique), UMR 1388 GenPhySE, F-31326 Castanet-Tolosan, France andres.legarra@inra.fr.
- INRA (Institut National de la Recherche Agronomique), UMR 1313 GABI, 78352 Jouy-en-Josas, France.
- IFCE (Institut Francais du Cheval et de l'Equitation), Recherche et Innovation, 61310 Exmes, France.
- Departamento de Anatomía, Embriología y Genética, Universidad de Zaragoza, 50013 Zaragoza, Spain.
- Instituto Agroalimentario de Aragón (IA2), 50013 Zaragoza, Spain.
MeSH Terms
- Algorithms
- Animals
- Bayes Theorem
- Databases, Genetic
- Gait
- Genetic Markers
- Genome-Wide Association Study / methods
- Genomics / methods
- Horses
- Models, Genetic
- Quantitative Trait Loci
- Quantitative Trait, Heritable
- Selection, Genetic
References
This article includes 45 references
- Aguilar I, Misztal I, Johnson D, Legarra A, Tsuruta S. Hot topic: A unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score1. J. Dairy Sci. 93: 743–752.
- Andersson L S, Larhammar M, Memic F, Wootz H, Schwochow D. Mutations in DMRT3 affect locomotion in horses and spinal circuit function in mice. Nature 488: 642–646.
- Bernal Rubio Y L, Gualdrón Duarte J L, Bates R O, Ernst C W, Nonneman D. Meta-analysis of genome-wide association from genomic prediction models. Anim. Genet. 47: 36–48.
- Brard S, Ricard A. Should we use the single nucleotide polymorphism linked to in genomic evaluation of French trotter?. J. Anim. Sci. 93: 4651–4659.
- Casiró S, Velez-Irizarry D, Ernst C W, Raney N E, Bates R O. Genome-wide association study in an F2 Duroc x Pietrain resource population for economically important meat quality and carcass traits. J. Anim. Sci. 95: 545–558.
- Chen C, Steibel J P, Tempelman R J. Genome-Wide Association Analyses Based on Broadly Different Specifications for Prior Distributions, Genomic Windows, and Estimation Methods. Genetics 206: 1791–1806.
- Christensen O F. Compatibility of pedigree-based and marker-based relationship matrices for single-step genetic evaluation. Genet. Sel. Evol. 44: 37.
- Dikmen S, Cole J B, Null D J, Hansen P J. Genome-wide association mapping for identification of quantitative trait loci for rectal temperature during heat stress in Holstein cattle. PLoS One 8: e69202.
- Endelman J B. Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 4: 250–255.
- Fernando R L, Habier D, Stricker C, Dekkers J C M, Totir L R. Genomic selection. Acta Agric. Scand. A 57: 192–195.
- Fernando R L, Garrick D. Bayesian methods applied to GWAS. Genome-Wide Assoc. Stud. Genomic Predict. 237–274.
- Gualdrón Duarte J L, Cantet R J, Bates R O, Ernst C W, Raney N E. Rapid screening for phenotype-genotype associations by linear transformations of genomic evaluations. BMC Bioinformatics 15: 246.
- Habier D, Fernando R L, Dekkers J C M. The impact of genetic relationship information on genome-assisted breeding values. Genetics 177: 2389–2397.
- Habier D, Fernando R L, Kizilkaya K, Garrick D J. Extension of the Bayesian alphabet for genomic selection. BMC Bioinformatics 12: 186.
- Hayes B J, Pryce J, Chamberlain A J, Bowman P J, Goddard M E. Genetic Architecture of Complex Traits and Accuracy of Genomic Prediction: Coat Colour, Milk-Fat Percentage, and Type in Holstein Cattle as Contrasting Model Traits. PLoS Genet. 6: e1001139.
- Heath S C. Markov chain Monte Carlo segregation and linkage analysis for oligogenic models. Am. J. Hum. Genet. 61: 748–760.
- Jarquín D, Crossa J, Lacaze X, Cheyron P D, Daucourt J. A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theor. Appl. Genet. 127: 595–607.
- Jia Y, Jannink J-L. Multiple-Trait Genomic Selection Methods Increase Genetic Value Prediction Accuracy. Genetics 192: 1513–1522.
- Kang H M, Sul J H, Zaitlen N A, Kong S, Freimer N B. Variance component model to account for sample structure in genome-wide association studies. Nat. Genet. 42: 348–354.
- Kass R E, Raftery A E. Bayes factors. J. Am. Stat. Assoc. 90: 773–795.
- Kennedy B, Quinton M, Van Arendonk J. Estimation of effects of single genes on quantitative traits. J. Anim. Sci. 70: 2000–2012.
- Kruglyak L. Prospects for whole-genome linkage disequilibrium mapping of common disease genes. Nat. Genet. 22: 139–144.
- Legarra A, Misztal I. Technical note: Computing strategies in genome-wide selection. J. Dairy Sci. 91: 360–366.
- Legarra A, Ricardi A, Filangi O. GS3: Genomic Selection, Gibbs Sampling, Gauss-Seidel (and BayesCp). .
- Legarra A, Croiseau P, Sanchez M P, Teyssèdre S, Sallé G. A comparison of methods for whole-genome QTL mapping using dense markers in four livestock species. Genet. Sel. Evol. 47: 6.
- Legarra A, Vitezica Z G. Genetic evaluation with major genes and polygenic inheritance when some animals are not genotyped using gene content multiple-trait BLUP. Genet. Sel. Evol. 47: 89.
- Lourenco D L, Tsuruta S, Fragomeni B O, Masuda Y, Aguilar I. Genetic evaluation using single-step genomic best linear unbiased predictor in American Angus. J. Anim. Sci. 93: 2653–2662.
- Maier R, Moser G, Chen G-B, Ripke S, Absher D. Joint Analysis of Psychiatric Disorders Increases Accuracy of Risk Prediction for Schizophrenia, Bipolar Disorder, and Major Depressive Disorder. Am. J. Hum. Genet. 96: 283–294.
- Misztal I, Tsuruta S, Strabel T, Auvray B, Druet T. BLUPF90 and related programs (BGF90), pp. 28–07 in: 7th World Congress on Genetics Applied to Livestock Production, CD-ROM Communication, Montpellier, France.. .
- Moser G, Lee S H, Hayes B J, Goddard M E, Wray N R. Simultaneous Discovery, Estimation and Prediction Analysis of Complex Traits Using a Bayesian Mixture Model. PLoS Genet. 11: e1004969.
- Nagamine Y, Pong-Wong R, Navarro P, Vitart V, Hayward C. Localising Loci underlying Complex Trait Variation Using Regional Genomic Relationship Mapping. PLoS One 7: e46501.
- Parker Gaddis K L, Cole J B, Clay J S, Maltecca C. Genomic selection for producer-recorded health event data in US dairy cattle. J. Dairy Sci. 97: 3190–3199.
- Pérez-Enciso M, Varona L. Quantitative trait loci mapping in F2 crosses between outbred lines. Genetics 155: 391–405.
- Ricard A. Does heterozygosity at the DMRT3 gene make French trotters better racers?. Genet. Sel. Evol. 47: 10.
- Strandén I, Garrick D J. Technical note: Derivation of equivalent computing algorithms for genomic predictions and reliabilities of animal merit. J. Dairy Sci. 92: 2971–2975.
- Teyssèdre S, Elsen J-M, Ricard A. Statistical distributions of test statistics used for quantitative trait association mapping in structured populations. Genet. Sel. Evol. 44: 32.
- Tsuruta S, Misztal I, Aguilar I, Lawlor T. Multiple-trait genomic evaluation of linear type traits using genomic and phenotypic data in US Holsteins. J. Dairy Sci. 94: 4198–4204.
- VanRaden P M. Efficient Methods to Compute Genomic Predictions. J. Dairy Sci. 91: 4414–4423.
- Varona L, García-Cortés L A, Pérez-Enciso M. Bayes factors for detection of quantitative trait loci. Genet. Sel. Evol. 33: 133–152.
- Varona L. Understanding the use of Bayes factor for testing candidate genes. J. Anim. Breed. Genet. 127: 16–25.
- Wakefield J. Bayes factors for genome‐wide association studies: comparison with P‐values. Genet. Epidemiol. 33: 79–86.
- Wakefield J. Commentary: Genome-wide significance thresholds via Bayes factors. Int. J. Epidemiol. 41: 286–291.
- Wang H, Misztal I, Aguilar I, Legarra A, Muir W. Genome-wide association mapping including phenotypes from relatives without genotypes. Genet. Res. 94: 73–83.
- Wang T, Chen Y-P P, Bowman P J, Goddard M E, Hayes B J. A hybrid expectation maximisation and MCMC sampling algorithm to implement Bayesian mixture model based genomic prediction and QTL mapping. BMC Genomics 17: 744.
- Yang J, Benyamin B, McEvoy B P, Gordon S, Henders A K. Common SNPs explain a large proportion of the heritability for human height. Nat. Genet. 42: 565–569.
Use Nutrition Calculator
Check if your horse's diet meets their nutrition requirements with our easy-to-use tool Check your horse's diet with our easy-to-use tool
Talk to a Nutritionist
Discuss your horse's feeding plan with our experts over a free phone consultation Discuss your horse's diet over a phone consultation
Submit Diet Evaluation
Get a customized feeding plan for your horse formulated by our equine nutritionists Get a custom feeding plan formulated by our nutritionists