IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v18y2021i6p3317-d522643.html
   My bibliography  Save this article

Prediction of Type 2 Diabetes Based on Machine Learning Algorithm

Author

Listed:
  • Henock M. Deberneh

    (Department of Information and Communications Engineering, Myongji University, 116 Myongji-ro, Yongin, Gyeonggi 17058, Korea)

  • Intaek Kim

    (Department of Information and Communications Engineering, Myongji University, 116 Myongji-ro, Yongin, Gyeonggi 17058, Korea)

Abstract

Prediction of type 2 diabetes (T2D) occurrence allows a person at risk to take actions that can prevent onset or delay the progression of the disease. In this study, we developed a machine learning (ML) model to predict T2D occurrence in the following year (Y + 1) using variables in the current year (Y). The dataset for this study was collected at a private medical institute as electronic health records from 2013 to 2018. To construct the prediction model, key features were first selected using ANOVA tests, chi-squared tests, and recursive feature elimination methods. The resultant features were fasting plasma glucose (FPG), HbA1c, triglycerides, BMI, gamma-GTP, age, uric acid, sex, smoking, drinking, physical activity, and family history. We then employed logistic regression, random forest, support vector machine, XGBoost, and ensemble machine learning algorithms based on these variables to predict the outcome as normal (non-diabetic), prediabetes, or diabetes. Based on the experimental results, the performance of the prediction model proved to be reasonably good at forecasting the occurrence of T2D in the Korean population. The model can provide clinicians and patients with valuable predictive information on the likelihood of developing T2D. The cross-validation (CV) results showed that the ensemble models had a superior performance to that of the single models. The CV performance of the prediction models was improved by incorporating more medical history from the dataset.

Suggested Citation

  • Henock M. Deberneh & Intaek Kim, 2021. "Prediction of Type 2 Diabetes Based on Machine Learning Algorithm," IJERPH, MDPI, vol. 18(6), pages 1-14, March.
  • Handle: RePEc:gam:jijerp:v:18:y:2021:i:6:p:3317-:d:522643
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/18/6/3317/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/18/6/3317/
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rosy Oh & Hong Kyu Lee & Youngmi Kim Pak & Man-Suk Oh, 2022. "An Interactive Online App for Predicting Diabetes via Machine Learning from Environment-Polluting Chemical Exposure Data," IJERPH, MDPI, vol. 19(10), pages 1-17, May.
    2. Yan Gao & Min Wang & Guogang Zhang & Lingjun Zhou & Jingming Luo & Lijue Liu, 2022. "Cluster-Based Ensemble Learning Model for Aortic Dissection Screening," IJERPH, MDPI, vol. 19(9), pages 1-14, May.
    3. Norma Latif Fitriyani & Muhammad Syafrudin & Siti Maghfirotul Ulyah & Ganjar Alfian & Syifa Latif Qolbiyani & Chuan-Kai Yang & Jongtae Rhee & Muhammad Anshari, 2023. "Performance Analysis and Assessment of Type 2 Diabetes Screening Scores in Patients with Non-Alcoholic Fatty Liver Disease," Mathematics, MDPI, vol. 11(10), pages 1-25, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:18:y:2021:i:6:p:3317-:d:522643. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.