IDEAS home Printed from https://ideas.repec.org/p/osf/socarx/tyszr.html
   My bibliography  Save this paper

Prediction of Attrition in Large Longitudinal Studies: Tree-based methods versus Multinomial Logistic Models

Author

Listed:
  • Best, Katherine Laura
  • Speyer, Lydia Gabriela
  • Murray, Aja Louise
  • Ushakova, Anastasia

Abstract

Identifying predictors of attrition is essential for designing longitudinal studies such that attrition bias can be minimised, and for identifying the variables that can be used as auxiliary in statistical techniques to help correct for non-random drop-out. This paper provides a comparative overview of predictive techniques that can be used to model attrition and identify important risk factors that help in its prediction. Logistic regression and several tree-based machine learning methods were applied to Wave 2 dropout in an illustrative sample of 5000 individuals from a large UK longitudinal study, Understanding Society. Each method was evaluated based on accuracy, AUC-ROC, plausibility of key assumptions and interpretability. Our results suggest a 10% improvement in accuracy for random forest compared to logistic regression methods. However, given the differences in estimation procedures we suggest that both models could be used in conjunction to provide the most comprehensive understanding of attrition predictors.

Suggested Citation

  • Best, Katherine Laura & Speyer, Lydia Gabriela & Murray, Aja Louise & Ushakova, Anastasia, 2021. "Prediction of Attrition in Large Longitudinal Studies: Tree-based methods versus Multinomial Logistic Models," SocArXiv tyszr, Center for Open Science.
  • Handle: RePEc:osf:socarx:tyszr
    DOI: 10.31219/osf.io/tyszr
    as

    Download full text from publisher

    File URL: https://osf.io/download/603d4b59035cf702bfc831d3/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/tyszr?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:socarx:tyszr. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://arabixiv.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.