Author
Listed:
- Lee, Chaewon
- Gates, Kathleen
Abstract
Machine learning (ML) has extended the scope of psychological research by enabling data-driven discovery of patterns in complex datasets, complementing traditional hypothesis-driven approaches and enriching individual-level prediction. As a principal subfield, supervised ML has advanced mental health diagnostics and behavior prediction through classification and regression tasks. However, the complexity of ML methodologies and the absence of established norms and standardized pipelines often limit its adoption among psychologists. Furthermore, the black-box nature of advanced ML algorithms obscures how decisions are made, making it difficult to identify the most influential variables. Automated ML (AutoML) addresses these challenges by automating key steps such as model selection and hyperparameter optimization, while enhancing interpretability through explainable AI. By streamlining workflows and improving efficiency, AutoML empowers users of all technical levels to implement advanced ML methods effectively. Despite its transformative potential, AutoML remains underutilized in psychological research, with no dedicated educational material available. This tutorial aims to bridge the gap by introducing AutoML to psychologists. We cover advanced AutoML methods, including combined algorithm selection and hyperparameter optimization (CASH), stacked ensemble generalization, and explainable AI. The utility of AutoML is demonstrated using the ‘H2O AutoML’ R package with publicly available psychological datasets, performing regression on multi-individual cross-sectional data and classification on single-individual time-series data. We also provide practical workarounds for ML methods currently unavailable in the package, allowing researchers to use alternative approaches when needed. These examples illustrate how AutoML democratizes ML, making it more accessible while providing advanced methodologies for psychological research.
Suggested Citation
Lee, Chaewon & Gates, Kathleen, 2025.
"Automated Machine Learning for Classification and Regression: A Tutorial for Psychologists,"
OSF Preprints
j4xuq_v1, Center for Open Science.
Handle:
RePEc:osf:osfxxx:j4xuq_v1
DOI: 10.31219/osf.io/j4xuq_v1
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:osfxxx:j4xuq_v1. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://osf.io/preprints/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.