IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v18y2021i9p4943-d549537.html
   My bibliography  Save this article

The Comprehensive Machine Learning Analytics for Heart Failure

Author

Listed:
  • Chao-Yu Guo

    (Institute of Public Health, School of Medicine, National Yang-Ming University, Taipei 112, Taiwan
    Institute of Public Health, School of Medicine, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan)

  • Min-Yang Wu

    (Institute of Public Health, School of Medicine, National Yang-Ming University, Taipei 112, Taiwan
    Institute of Public Health, School of Medicine, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan)

  • Hao-Min Cheng

    (Institute of Public Health, School of Medicine, National Yang-Ming University, Taipei 112, Taiwan
    Institute of Public Health, School of Medicine, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
    Center for Evidence-Based Medicine, Veteran General Hospital, Taipei 112, Taiwan
    Department of Medicine, School of Medicine, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan)

Abstract

Background : Early detection of heart failure is the basis for better medical treatment and prognosis. Over the last decades, both prevalence and incidence rates of heart failure have increased worldwide, resulting in a significant global public health issue. However, an early diagnosis is not an easy task because symptoms of heart failure are usually non-specific. Therefore, this study aims to develop a risk prediction model for incident heart failure through a machine learning-based predictive model. Although African Americans have a higher risk of incident heart failure among all populations, few studies have developed a heart failure risk prediction model for African Americans. Methods : This research implemented the Least Absolute Shrinkage and Selection Operator (LASSO) logistic regression, support vector machine, random forest, and Extreme Gradient Boosting (XGBoost) to establish the Jackson Heart Study’s predictive model. In the analysis of real data, missing data are problematic when building a predictive model. Here, we evaluate predictors’ inclusion with various missing rates and different missing imputation strategies to discover the optimal analytics. Results : According to hundreds of models that we examined, the best predictive model was the XGBoost that included variables with a missing rate of less than 30 percent, and we imputed missing values by non-parametric random forest imputation. The optimal XGBoost machine demonstrated an Area Under Curve (AUC) of 0.8409 to predict heart failure for the Jackson Heart Study. Conclusion : This research identifies variations of diabetes medication as the most crucial risk factor for heart failure compared to the complete cases approach that failed to discover this phenomenon.

Suggested Citation

  • Chao-Yu Guo & Min-Yang Wu & Hao-Min Cheng, 2021. "The Comprehensive Machine Learning Analytics for Heart Failure," IJERPH, MDPI, vol. 18(9), pages 1-17, May.
  • Handle: RePEc:gam:jijerp:v:18:y:2021:i:9:p:4943-:d:549537
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/18/9/4943/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/18/9/4943/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zvjezdana Gvozdanović & Nikolina Farčić & Hrvoje Šimić & Vikica Buljanović & Lea Gvozdanović & Sven Katalinić & Stana Pačarić & Domagoj Gvozdanović & Željka Dujmić & Blaženka Miškić & Ivana Barać & Na, 2021. "The Impact of Education, COVID-19 and Risk Factors on the Quality of Life in Patients with Type 2 Diabetes," IJERPH, MDPI, vol. 18(5), pages 1-14, February.
    2. Ramon Casanova & Santiago Saldana & Sean L Simpson & Mary E Lacy & Angela R Subauste & Chad Blackshear & Lynne Wagenknecht & Alain G Bertoni, 2016. "Prediction of Incident Diabetes in the Jackson Heart Study Using High-Dimensional Machine Learning," PLOS ONE, Public Library of Science, vol. 11(10), pages 1-12, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Micheal O. Olusanya & Ropo Ebenezer Ogunsakin & Meenu Ghai & Matthew Adekunle Adeleke, 2022. "Accuracy of Machine Learning Classification Models for the Prediction of Type 2 Diabetes Mellitus: A Systematic Survey and Meta-Analysis Approach," IJERPH, MDPI, vol. 19(21), pages 1-19, November.
    2. Božica Lovrić & Harolt Placento & Nikolina Farčić & Metka Lipič Baligač & Štefica Mikšić & Marin Mamić & Tihomir Jovanović & Hrvoje Vidić & Sandra Karabatić & Sabina Cviljević & Lada Zibar & Ivan Vuko, 2022. "Association between Health Literacy and Prevalence of Obesity, Arterial Hypertension, and Diabetes Mellitus," IJERPH, MDPI, vol. 19(15), pages 1-14, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:18:y:2021:i:9:p:4943-:d:549537. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.