IDEAS home Printed from https://ideas.repec.org/a/sae/medema/v21y2001i1p45-56.html
   My bibliography  Save this article

Prognostic Modeling with Logistic Regression Analysis

Author

Listed:
  • Ewout W. Steyerberg

    (Center for Clinical Decision Sciences, Department of Public Health, Erasmus University, Rotterdam, the Netherlands)

  • Marinus J. C. Eijkemans

    (Center for Clinical Decision Sciences, Department of Public Health, Erasmus University, Rotterdam, the Netherlands)

  • Frank E. Harrell Jr

    (Division of Biostatistics and Epidemiology, Department of Health Evaluation Sciences, University of Virginia, Charlottesville, Virginia)

  • J. Dik F. Habbema

    (Center for Clinical Decision Sciences, Department of Public Health, Erasmus University, Rotterdam, the Netherlands)

Abstract

Clinical decision making often requires estimates of the likelihood of a dichotomous outcome in individual patients. When empirical data are available, these estimates may well be obtained from a logistic regression model. Several strategies may be followed in the development of such a model. In this study, the authors compare alternative strategies in 23 small subsamples from a large data set of patients with an acute myocardial infarction, where they developed predictive models for 30-day mortality. Evaluations were performed in an independent part of the data set. Specifically, the authors studied the effect of coding of covariables and stepwise selection on discriminative ability of the resulting model, and the effect of statistical “shrinkage†techniques on calibration. As expected, dichotomization of continuous covariables implied a loss of information. Remarkably, stepwise selection resulted in less discriminating models compared to full models including all available covariables, even when more than half of these were randomly associated with the outcome. Using qualitative information on the sign of the effect of predictors slightly improved the predictive ability. Calibration improved when shrinkage was applied on the standard maximum likelihood estimates of the regression coefficients. In conclusion, a sensible strategy in small data sets is to apply shrinkage methods in full models that include well-coded predictors that are selected based on external information.

Suggested Citation

  • Ewout W. Steyerberg & Marinus J. C. Eijkemans & Frank E. Harrell Jr & J. Dik F. Habbema, 2001. "Prognostic Modeling with Logistic Regression Analysis," Medical Decision Making, , vol. 21(1), pages 45-56, February.
  • Handle: RePEc:sae:medema:v:21:y:2001:i:1:p:45-56
    DOI: 10.1177/0272989X0102100106
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0272989X0102100106
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0272989X0102100106?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Chris Chatfield, 1995. "Model Uncertainty, Data Mining and Statistical Inference," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 158(3), pages 419-444, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Phung Khanh Lam & Dong Thi Hoai Tam & Nguyen Minh Dung & Nguyen Thi Hanh Tien & Nguyen Tan Thanh Kieu & Cameron Simmons & Jeremy Farrar & Bridget Wills & Marcel Wolbers, 2015. "A Prognostic Model for Development of Profound Shock among Children Presenting with Dengue Shock Syndrome," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-13, May.
    2. Mohammad Mojtahedi & Sidney Newton & Jason Meding, 2017. "Predicting the resilience of transport infrastructure to a natural disaster using Cox’s proportional hazards regression model," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 85(2), pages 1119-1133, January.
    3. Areti Kontogianni & Dimitris Damigos & Michail Skourtos & Christos Tourkolias & Eleanor Denny & Ibon Galarraga & Steffen Kallbekken & Edin Lakić, 2021. "Model Validity and Transferability Informing Behavioral Energy Policies," Energies, MDPI, vol. 14(11), pages 1-20, May.
    4. Haridarshan Patel & Robert J DiDomenico & Katie J Suda & Glen T Schumock & Gregory S Calip & Todd A Lee, 2020. "Risk of cardiac events with azithromycin—A prediction model," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-11, October.
    5. Beatrice Asenso Barnieh & Li Jia & Massimo Menenti & Min Jiang & Jie Zhou & Yelong Zeng & Ali Bennour, 2021. "Modeling the Underlying Drivers of Natural Vegetation Occurrence in West Africa with Binary Logistic Regression Method," Sustainability, MDPI, vol. 13(9), pages 1-37, April.
    6. Fraser S Brown & Stella A Glasmacher & Patrick K A Kearns & Niall MacDougall & David Hunt & Peter Connick & Siddharthan Chandran, 2020. "Systematic review of prediction models in relapsing remitting multiple sclerosis," PLOS ONE, Public Library of Science, vol. 15(5), pages 1-13, May.
    7. Tri-Long Nguyen & Géraldine Leguelinel-Blache & Jean-Marie Kinowski & Clarisse Roux-Marson & Marion Rougier & Jessica Spence & Yannick Le Manach & Paul Landais, 2017. "Improving medication safety: Development and impact of a multivariate model-based strategy to target high-risk patients," PLOS ONE, Public Library of Science, vol. 12(2), pages 1-13, February.
    8. Layla Khoja & Maxwell Chipulu & Ranadeva Jayasekera, 2016. "Analysing corporate insolvency in the Gulf Cooperation Council using logistic regression and multidimensional scaling," Review of Quantitative Finance and Accounting, Springer, vol. 46(3), pages 483-518, April.
    9. Lisa Cherry & Darren Mollendor & Bill Eisenstein & Terri S. Hogue & Katharyn Peterman & John E. McCray, 2019. "Predicting Parcel-Scale Redevelopment Using Linear and Logistic Regression—the Berkeley Neighborhood Denver, Colorado Case Study," Sustainability, MDPI, vol. 11(7), pages 1-16, March.
    10. Luuk Wieske & Esther Witteveen & Camiel Verhamme & Daniela S Dettling-Ihnenfeldt & Marike van der Schaaf & Marcus J Schultz & Ivo N van Schaik & Janneke Horn, 2014. "Early Prediction of Intensive Care Unit–Acquired Weakness Using Easily Available Parameters: A Prospective Observational Study," PLOS ONE, Public Library of Science, vol. 9(10), pages 1-8, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    2. Riccardo (Jack) Lucchetti & Luca Pedini, 2020. "ParMA: Parallelised Bayesian Model Averaging for Generalised Linear Models," Working Papers 2020:28, Department of Economics, University of Venice "Ca' Foscari".
    3. Robert Lehmann & Antje Weyh, 2016. "Forecasting Employment in Europe: Are Survey Results Helpful?," Journal of Business Cycle Research, Springer;Centre for International Research on Economic Tendency Surveys (CIRET), vol. 12(1), pages 81-117, September.
    4. Castle Jennifer L. & Doornik Jurgen A & Hendry David F., 2011. "Evaluating Automatic Model Selection," Journal of Time Series Econometrics, De Gruyter, vol. 3(1), pages 1-33, February.
    5. Lee, Yun Shin & Scholtes, Stefan, 2014. "Empirical prediction intervals revisited," International Journal of Forecasting, Elsevier, vol. 30(2), pages 217-234.
    6. Coleman, Stephen, 2005. "Testing Theories with Qualitative and Quantitative Predictions," MPRA Paper 105171, University Library of Munich, Germany.
    7. Mark F. J. Steel, 2020. "Model Averaging and Its Use in Economics," Journal of Economic Literature, American Economic Association, vol. 58(3), pages 644-719, September.
    8. Brooks, Jeremy S., 2010. "The Buddha mushroom: Conservation behavior and the development of institutions in Bhutan," Ecological Economics, Elsevier, vol. 69(4), pages 779-795, February.
    9. Brian Knaeble & Seth Dutter, 2017. "Reversals of Least-Square Estimates and Model-Invariant Estimation for Directions of Unique Effects," The American Statistician, Taylor & Francis Journals, vol. 71(2), pages 97-105, April.
    10. Steven M. Shugan, 2002. "In Search of Data: An Editorial," Marketing Science, INFORMS, vol. 21(4), pages 369-377.
    11. Michael Schomaker & Christian Heumann, 2020. "When and when not to use optimal model averaging," Statistical Papers, Springer, vol. 61(5), pages 2221-2240, October.
    12. Clark, Todd & McCracken, Michael, 2013. "Advances in Forecast Evaluation," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1107-1201, Elsevier.
    13. W. Robert Reed, 2009. "The Determinants Of U.S. State Economic Growth: A Less Extreme Bounds Analysis," Economic Inquiry, Western Economic Association International, vol. 47(4), pages 685-700, October.
    14. Nicholas Weller & Jeb Barnes, 2016. "Pathway Analysis and the Search for Causal Mechanisms," Sociological Methods & Research, , vol. 45(3), pages 424-457, August.
    15. S. P. Brooks & E. A. Catchpole & B. J. T. Morgan & M. P. Harris, 2002. "Bayesian methods for analysing ringing data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 29(1-4), pages 187-206.
    16. Nick Inglis & Bruce Vanstone & Tobias Hahn, 2019. "Modelling momentum winner/loser asymmetry: the sources of winner and loser returns in the ASX200 and S&P500," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 59(S1), pages 657-684, April.
    17. Ormerod, Paul, 2015. "The economics of radical uncertainty," Economics - The Open-Access, Open-Assessment E-Journal (2007-2020), Kiel Institute for the World Economy (IfW Kiel), vol. 9, pages 1-20.
    18. Sai Ding & John Knight, 2011. "Why has China Grown So Fast? The Role of Physical and Human Capital Formation," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 73(2), pages 141-174, April.
    19. M. Pir Bavaghar, 2015. "Deforestation modelling using logistic regression and GIS," Journal of Forest Science, Czech Academy of Agricultural Sciences, vol. 61(5), pages 193-199.
    20. Ke Yang & Langnan Chen & Fengping Tian, 2015. "Realized Volatility Forecast of Stock Index Under Structural Breaks," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 34(1), pages 57-82, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:medema:v:21:y:2001:i:1:p:45-56. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.