IDEAS home Printed from https://ideas.repec.org/a/tsj/stataj/v9y2009i4p547-570.html
   My bibliography  Save this article

Bootstrap assessment of the stability of multivariable models

Author

Listed:
  • Patrick Royston

    (MRC Clinical Trials Unit)

  • Willi Sauerbrei

    (Institute for Medical Biometry and Medical Informatics, Freiburg University Medical Center)

Abstract

Assessing the instability of a multivariable model is important but is rarely done in practice. Model instability occurs when selected predictors--and for multivariable fractional polynomial modeling, selected functions of continuous predictors--are sensitive to small changes in the data. Bootstrap analysis is a useful technique for investigating variations among selected models in samples drawn at random with replacement. Such samples mimic datasets that are structurally similar to that under study and that could plausibly have arisen instead. The bootstrap inclusion fraction of a candidate variable usefully indicates the importance of the variable. We describe Stata tools for stability analysis in the context of the mfp command for multivariable model building. We offer practical guidance and illustrate the application of the tools to a study in prostate cancer. Copyright 2009 by StataCorp LP.

Suggested Citation

  • Patrick Royston & Willi Sauerbrei, 2009. "Bootstrap assessment of the stability of multivariable models," Stata Journal, StataCorp LP, vol. 9(4), pages 547-570, December.
  • Handle: RePEc:tsj:stataj:v:9:y:2009:i:4:p:547-570
    as

    Download full text from publisher

    File URL: http://www.stata-journal.com/article.html?article=st0177
    Download Restriction: no

    File URL: http://www.stata-journal.com/software/sj9-4/st0177/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Willi Sauerbrei, 1999. "The Use of Resampling Methods to Simplify Regression Models in Medical Statistics," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 48(3), pages 313-329.
    2. Harald Binder & Willi Sauerbrei, 2009. "Stability analysis of an additive spline model for respiratory health data by using knot removal," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 58(5), pages 577-600, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ivana Blažková & Ondřej Dvouletý, 2022. "Zombies: Who are they and how do firms become zombies?," Journal of Small Business Management, Taylor & Francis Journals, vol. 60(1), pages 119-145, January.
    2. Liu, Zicheng & Lesselier, Dominique & Sudret, Bruno & Wiart, Joe, 2020. "Surrogate modeling based on resampled polynomial chaos expansions," Reliability Engineering and System Safety, Elsevier, vol. 202(C).
    3. Patrick Royston, 2012. "Tools to simulate realistic censored survival-time distributions," Stata Journal, StataCorp LP, vol. 12(4), pages 639-654, December.
    4. Dvouletý, Ondřej, 2018. "How to analyse determinants of entrepreneurship and self-employment at the country level? A methodological contribution," Journal of Business Venturing Insights, Elsevier, vol. 9(C), pages 92-99.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hapfelmeier Alexander & Hothorn Torsten & Riediger Carina & Ulm Kurt, 2014. "Estimation of a Predictor’s Importance by Random Forests When There Is Missing Data: RISK Prediction in Liver Surgery using Laboratory Data," The International Journal of Biostatistics, De Gruyter, vol. 10(2), pages 165-183, November.
    2. Sauerbrei, W. & Meier-Hirmer, C. & Benner, A. & Royston, P., 2006. "Multivariable regression model building by using fractional polynomials: Description of SAS, STATA and R programs," Computational Statistics & Data Analysis, Elsevier, vol. 50(12), pages 3464-3485, August.
    3. Hapfelmeier, A. & Ulm, K., 2013. "A new variable selection approach using Random Forests," Computational Statistics & Data Analysis, Elsevier, vol. 60(C), pages 50-69.
    4. Toshiki Doi & Suguru Yamamoto & Takatoshi Morinaga & Ken-ei Sada & Noriaki Kurita & Yoshihiro Onishi, 2015. "Risk Score to Predict 1-Year Mortality after Haemodialysis Initiation in Patients with Stage 5 Chronic Kidney Disease under Predialysis Nephrology Care," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-14, June.
    5. Patrick Royston & Willi Sauerbrei, 2007. "Multivariable modeling with cubic regression splines: A principled approach," Stata Journal, StataCorp LP, vol. 7(1), pages 45-70, February.
    6. Dunkler, Daniela & Sauerbrei, Willi & Heinze, Georg, 2016. "Global, Parameterwise and Joint Shrinkage Factor Estimation," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 69(i08).
    7. Harald Binder & Willi Sauerbrei, 2009. "Stability analysis of an additive spline model for respiratory health data by using knot removal," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 58(5), pages 577-600, December.
    8. Simone P Rauh & Martijn W Heymans & Anitra D M Koopman & Giel Nijpels & Coen D Stehouwer & Barbara Thorand & Wolfgang Rathmann & Christa Meisinger & Annette Peters & Tonia de las Heras Gala & Charlott, 2017. "Predicting glycated hemoglobin levels in the non-diabetic general population: Development and validation of the DIRECT-DETECT prediction model - a DIRECT study," PLOS ONE, Public Library of Science, vol. 12(2), pages 1-13, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:9:y:2009:i:4:p:547-570. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum or Lisa Gilmore (email available below). General contact details of provider: http://www.stata-journal.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.