IDEAS home Printed from https://ideas.repec.org/a/eee/thpobi/v132y2020icp47-59.html
   My bibliography  Save this article

Inferring trait-specific similarity among individuals from molecular markers and phenotypes with Bayesian regression

Author

Listed:
  • Gianola, Daniel
  • Fernando, Rohan L.
  • Schön, Chris-Carolin

Abstract

Modeling covariance structure based on genetic similarity between pairs of relatives plays an important role in evolutionary, quantitative and statistical genetics. Historically, genetic similarity between individuals has been quantified from pedigrees via the probability that randomly chosen homologous alleles between individuals are identical by descent (IBD). At present, however, many genetic analyses rely on molecular markers, with realized measures of genomic similarity replacing IBD-based expected similarities. Animal and plant breeders, for example, now employ marker-based genomic relationship matrices between individuals in prediction models and in estimation of genome-based heritability coefficients. Phenotypes convey information about genetic similarity as well. For instance, if phenotypic values are at least partially the result of the action of quantitative trait loci, one would expect the former to inform about the latter, as in genome-wide association studies. Statistically, a non-trivial conditional distribution of unknown genetic similarities, given phenotypes, is to be expected. A Bayesian formalism is presented here that applies to whole-genome regression methods where some genetic similarity matrix, e.g., a genomic relationship matrix, can be defined. Our Bayesian approach, based on phenotypes and markers, converts prior (markers only) expected similarity into trait-specific posterior similarity. A simulation illustrates situations under which effective Bayesian learning from phenotypes occurs. Pinus and wheat data sets were used to demonstrate applicability of the concept in practice. The methodology applies to a wide class of Bayesian linear regression models, it extends to the multiple-trait domain, and can also be used to develop phenotype-guided similarity kernels in prediction problems.

Suggested Citation

  • Gianola, Daniel & Fernando, Rohan L. & Schön, Chris-Carolin, 2020. "Inferring trait-specific similarity among individuals from molecular markers and phenotypes with Bayesian regression," Theoretical Population Biology, Elsevier, vol. 132(C), pages 47-59.
  • Handle: RePEc:eee:thpobi:v:132:y:2020:i:c:p:47-59
    DOI: 10.1016/j.tpb.2019.11.008
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040580919301972
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tpb.2019.11.008?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Xiaochen Sun & Long Qu & Dorian J Garrick & Jack C M Dekkers & Rohan L Fernando, 2012. "A Fast EM Algorithm for BayesA-Like Prediction of Genomic Breeding Values," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-9, November.
    2. Xiaolei Liu & Meng Huang & Bin Fan & Edward S Buckler & Zhiwu Zhang, 2016. "Iterative Usage of Fixed and Random Effect Models for Powerful and Efficient Genome-Wide Association Studies," PLOS Genetics, Public Library of Science, vol. 12(2), pages 1-24, February.
    3. Park, Trevor & Casella, George, 2008. "The Bayesian Lasso," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 681-686, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ganwen Zhang & Jianini Zhao & Jieru Wang & Guo Lin & Lin Li & Fengfei Ban & Meiting Zhu & Yangjun Wen & Jin Zhang, 2024. "An Improved Expectation–Maximization Bayesian Algorithm for GWAS," Mathematics, MDPI, vol. 12(13), pages 1-14, June.
    2. Cox Lwaka Tamba & Yuan-Li Ni & Yuan-Ming Zhang, 2017. "Iterative sure independence screening EM-Bayesian LASSO algorithm for multi-locus genome-wide association studies," PLOS Computational Biology, Public Library of Science, vol. 13(1), pages 1-20, January.
    3. Niloy Biswas & Anirban Bhattacharya & Pierre E. Jacob & James E. Johndrow, 2022. "Coupling‐based convergence assessment of some Gibbs samplers for high‐dimensional Bayesian regression with shrinkage priors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(3), pages 973-996, July.
    4. Li, Chunyu & Lou, Chenxin & Luo, Dan & Xing, Kai, 2021. "Chinese corporate distress prediction using LASSO: The role of earnings management," International Review of Financial Analysis, Elsevier, vol. 76(C).
    5. Armagan, Artin & Dunson, David, 2011. "Sparse variational analysis of linear mixed models for large data sets," Statistics & Probability Letters, Elsevier, vol. 81(8), pages 1056-1062, August.
    6. Martin Feldkircher & Florian Huber & Gary Koop & Michael Pfarrhofer, 2022. "APPROXIMATE BAYESIAN INFERENCE AND FORECASTING IN HUGE‐DIMENSIONAL MULTICOUNTRY VARs," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 63(4), pages 1625-1658, November.
    7. Eliaz, Kfir & Spiegler, Ran, 2022. "On incentive-compatible estimators," Games and Economic Behavior, Elsevier, vol. 132(C), pages 204-220.
    8. Oguzhan Cepni & I. Ethem Guney & Norman R. Swanson, 2020. "Forecasting and nowcasting emerging market GDP growth rates: The role of latent global economic policy uncertainty and macroeconomic data surprise factors," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(1), pages 18-36, January.
    9. Hauzenberger, Niko, 2021. "Flexible Mixture Priors for Large Time-varying Parameter Models," Econometrics and Statistics, Elsevier, vol. 20(C), pages 87-108.
    10. Korobilis, Dimitris, 2015. "Quantile forecasts of inflation under model uncertainty," MPRA Paper 64341, University Library of Munich, Germany.
    11. Bernardi, Mauro & Costola, Michele, 2019. "High-dimensional sparse financial networks through a regularised regression model," SAFE Working Paper Series 244, Leibniz Institute for Financial Research SAFE.
    12. Damien Rousselière, 2019. "A Flexible Approach to Age Dependence in Organizational Mortality: Comparing the Life Duration for Cooperative and Non-Cooperative Enterprises Using a Bayesian Generalized Additive Discrete Time Survi," Journal of Quantitative Economics, Springer;The Indian Econometric Society (TIES), vol. 17(4), pages 829-855, December.
    13. Chakraborty, Sounak, 2012. "Bayesian multiple response kernel regression model for high dimensional data and its practical applications in near infrared spectroscopy," Computational Statistics & Data Analysis, Elsevier, vol. 56(9), pages 2742-2755.
    14. Virginia X. He & Matt P. Wand, 2024. "Bayesian generalized additive model selection including a fast variational option," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 108(3), pages 639-668, September.
    15. Ji, Yonggang & Lin, Nan & Zhang, Baoxue, 2012. "Model selection in binary and tobit quantile regression using the Gibbs sampler," Computational Statistics & Data Analysis, Elsevier, vol. 56(4), pages 827-839.
    16. Hu, Guanyu, 2021. "Spatially varying sparsity in dynamic regression models," Econometrics and Statistics, Elsevier, vol. 17(C), pages 23-34.
    17. Lee Anthony & Caron Francois & Doucet Arnaud & Holmes Chris, 2012. "Bayesian Sparsity-Path-Analysis of Genetic Association Signal using Generalized t Priors," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(2), pages 1-31, January.
    18. Jan Prüser, 2021. "Forecasting US inflation using Markov dimension switching," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 40(3), pages 481-499, April.
    19. Michael Pfarrhofer & Anna Stelzer, 2019. "High-frequency and heteroskedasticity identification in multicountry models: Revisiting spillovers of monetary shocks," Papers 1912.03158, arXiv.org, revised Dec 2024.
    20. Luis Castro-Martín & Maria del Mar Rueda & Ramón Ferri-García, 2020. "Inference from Non-Probability Surveys with Statistical Matching and Propensity Score Adjustment Using Modern Prediction Techniques," Mathematics, MDPI, vol. 8(6), pages 1-19, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:thpobi:v:132:y:2020:i:c:p:47-59. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/intelligence .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.