IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0091693.html
   My bibliography  Save this article

Enhancing Genome-Enabled Prediction by Bagging Genomic BLUP

Author

Listed:
  • Daniel Gianola
  • Kent A Weigel
  • Nicole Krämer
  • Alessandra Stella
  • Chris-Carolin Schön

Abstract

We examined whether or not the predictive ability of genomic best linear unbiased prediction (GBLUP) could be improved via a resampling method used in machine learning: bootstrap aggregating sampling (“bagging”). In theory, bagging can be useful when the predictor has large variance or when the number of markers is much larger than sample size, preventing effective regularization. After presenting a brief review of GBLUP, bagging was adapted to the context of GBLUP, both at the level of the genetic signal and of marker effects. The performance of bagging was evaluated with four simulated case studies including known or unknown quantitative trait loci, and an application was made to real data on grain yield in wheat planted in four environments. A metric aimed to quantify candidate-specific cross-validation uncertainty was proposed and assessed; as expected, model derived theoretical reliabilities bore no relationship with cross-validation accuracy. It was found that bagging can ameliorate predictive performance of GBLUP and make it more robust against over-fitting. Seemingly, 25–50 bootstrap samples was enough to attain reasonable predictions as well as stable measures of individual predictive mean squared errors.

Suggested Citation

  • Daniel Gianola & Kent A Weigel & Nicole Krämer & Alessandra Stella & Chris-Carolin Schön, 2014. "Enhancing Genome-Enabled Prediction by Bagging Genomic BLUP," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-18, April.
  • Handle: RePEc:plo:pone00:0091693
    DOI: 10.1371/journal.pone.0091693
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0091693
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0091693&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0091693?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Inoue, Atsushi & Kilian, Lutz, 2008. "How Useful Is Bagging in Forecasting Economic Time Series? A Case Study of U.S. Consumer Price Inflation," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 511-522, June.
    2. Gustavo de los Campos & Ana I Vazquez & Rohan Fernando & Yann C Klimentidis & Daniel Sorensen, 2013. "Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor," PLOS Genetics, Public Library of Science, vol. 9(7), pages 1-15, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Eric Hillebrand & Huiyu Huang & Tae-Hwy Lee & Canlin Li, 2018. "Using the Entire Yield Curve in Forecasting Output and Inflation," Econometrics, MDPI, vol. 6(3), pages 1-27, August.
    2. Kim, Hyun Hak & Swanson, Norman R., 2018. "Mining big data using parsimonious factor, machine learning, variable selection and shrinkage methods," International Journal of Forecasting, Elsevier, vol. 34(2), pages 339-354.
    3. Julieta Fuentes & Pilar Poncela & Julio Rodríguez, 2015. "Sparse Partial Least Squares in Time Series for Macroeconomic Forecasting," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(4), pages 576-595, June.
    4. Lahiri, Kajal & Yang, Liu, 2013. "Forecasting Binary Outcomes," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1025-1106, Elsevier.
    5. Rama K. Malladi, 2024. "Benchmark Analysis of Machine Learning Methods to Forecast the U.S. Annual Inflation Rate During a High-Decile Inflation Period," Computational Economics, Springer;Society for Computational Economics, vol. 64(1), pages 335-375, July.
    6. Medeiros, Marcelo C. & Vasconcelos, Gabriel F.R., 2016. "Forecasting macroeconomic variables in data-rich environments," Economics Letters, Elsevier, vol. 138(C), pages 50-52.
    7. Szafranek, Karol, 2019. "Bagged neural networks for forecasting Polish (low) inflation," International Journal of Forecasting, Elsevier, vol. 35(3), pages 1042-1059.
    8. Tommaso Proietti, 2016. "On the Selection of Common Factors for Macroeconomic Forecasting," Advances in Econometrics, in: Dynamic Factor Models, volume 35, pages 593-628, Emerald Group Publishing Limited.
    9. Antonello Loddo & Shawn Ni & Dongchu Sun, 2011. "Selection of Multivariate Stochastic Volatility Models via Bayesian Stochastic Search," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(3), pages 342-355, July.
    10. Domenico Giannone & Michele Lenza & Giorgio E. Primiceri, 2021. "Economic Predictions With Big Data: The Illusion of Sparsity," Econometrica, Econometric Society, vol. 89(5), pages 2409-2437, September.
    11. Lee, Ji Hyung & Shi, Zhentao & Gao, Zhan, 2022. "On LASSO for predictive regression," Journal of Econometrics, Elsevier, vol. 229(2), pages 322-349.
    12. Samuels, Jon D. & Sekkel, Rodrigo M., 2017. "Model Confidence Sets and forecast combination," International Journal of Forecasting, Elsevier, vol. 33(1), pages 48-60.
    13. Korobilis, Dimitris, 2013. "Hierarchical shrinkage priors for dynamic regressions with many predictors," International Journal of Forecasting, Elsevier, vol. 29(1), pages 43-59.
    14. Catherine Doz & Peter Fuleky, 2019. "Dynamic Factor Models," Working Papers 2019-4, University of Hawaii Economic Research Organization, University of Hawaii at Manoa.
    15. Francesco Audrino & Marcelo C. Medeiros, 2011. "Modeling and forecasting short‐term interest rates: The benefits of smooth regimes, macroeconomic variables, and bagging," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 26(6), pages 999-1022, September.
    16. Barbara Rossi, 2019. "Forecasting in the presence of instabilities: How do we know whether models predict well and how to improve them," Economics Working Papers 1711, Department of Economics and Business, Universitat Pompeu Fabra, revised Jul 2021.
    17. Luo, Qin & Bu, Jinfeng & Xu, Weiju & Huang, Dengshi, 2023. "Stock market volatility prediction: Evidence from a new bagging model," International Review of Economics & Finance, Elsevier, vol. 87(C), pages 445-456.
    18. Granziera, Eleonora & Sekhposyan, Tatevik, 2019. "Predicting relative forecasting performance: An empirical investigation," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1636-1657.
    19. Lenza, Michele & Moutachaker, Inès & Paredes, Joan, 2023. "Density forecasts of inflation: a quantile regression forest approach," CEPR Discussion Papers 18298, C.E.P.R. Discussion Papers.
    20. Gang Cheng & Sicong Wang & Yuhong Yang, 2015. "Forecast Combination under Heavy-Tailed Errors," Econometrics, MDPI, vol. 3(4), pages 1-28, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0091693. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.