IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v21y2024i12p1694-d1547386.html
   My bibliography  Save this article

A Machine Learning Classification Model for Gastrointestinal Health in Cancer Survivors: Roles of Telomere Length and Social Determinants of Health

Author

Listed:
  • Claire J. Han

    (Center for Healthy Aging, Self-Management and Complex Care, College of Nursing, The Ohio State University, Columbus, OH 43210, USA
    The James: Cancer Treatment and Research Center, The Ohio State University, Columbus, OH 43210, USA)

  • Xia Ning

    (Clinical Informatics and Implementation Science, Biomedical Informatics (BMI), College of Medicine, The Ohio State University, Columbus, OH 43210, USA
    Computer Science and Engineering (CSE), College of Engineering, The Ohio State University, Columbus, OH 43210, USA)

  • Christin E. Burd

    (Departments of Molecular Genetics, Cancer Biology, and Genetics, The Ohio State University, Columbus, OH 43210, USA)

  • Fode Tounkara

    (The James: Cancer Treatment and Research Center, The Ohio State University, Columbus, OH 43210, USA
    Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, USA)

  • Matthew F. Kalady

    (Division of Colon and Rectal Surgery, Clinical Cancer Genetics Program, The James: Cancer Treatment and Research Center, The Ohio State University, Columbus, OH 43210, USA)

  • Anne M. Noonan

    (GI Medical Oncology Section, The James: Cancer Treatment and Research Center, The Ohio State University, Columbus, OH 43210, USA)

  • Diane Von Ah

    (Center for Healthy Aging, Self-Management and Complex Care, College of Nursing, The Ohio State University, Columbus, OH 43210, USA
    The James: Cancer Treatment and Research Center, The Ohio State University, Columbus, OH 43210, USA)

Abstract

Background: Gastrointestinal (GI) distress is prevalent and often persistent among cancer survivors, impacting their quality of life, nutrition, daily function, and mortality. GI health screening is crucial for preventing and managing this distress. However, accurate classification methods for GI health remain unexplored. We aimed to develop machine learning (ML) models to classify GI health status (better vs. worse) by incorporating biological aging and social determinants of health (SDOH) indicators in cancer survivors. Methods: We included 645 adult cancer survivors from the 1999–2002 NHANES survey. Using training and test datasets, we employed six ML models to classify GI health conditions (better vs. worse). These models incorporated leukocyte telomere length (TL), SDOH, and demographic/clinical data. Results: Among the ML models, the random forest (RF) performed the best, achieving a high area under the curve (AUC = 0.98) in the training dataset. The gradient boosting machine (GBM) demonstrated excellent classification performance with a high AUC (0.80) in the test dataset. TL, several socio-economic factors, cancer risk behaviors (including lifestyle choices), and inflammatory markers were associated with GI health. The most significant input features for better GI health in our ML models were longer TL and an annual household income above the poverty level, followed by routine physical activity, low white blood cell counts, and food security. Conclusions: Our findings provide valuable insights into classifying and identifying risk factors related to GI health, including biological aging and SDOH indicators. To enhance model predictability, further longitudinal studies and external clinical validations are necessary.

Suggested Citation

  • Claire J. Han & Xia Ning & Christin E. Burd & Fode Tounkara & Matthew F. Kalady & Anne M. Noonan & Diane Von Ah, 2024. "A Machine Learning Classification Model for Gastrointestinal Health in Cancer Survivors: Roles of Telomere Length and Social Determinants of Health," IJERPH, MDPI, vol. 21(12), pages 1-20, December.
  • Handle: RePEc:gam:jijerp:v:21:y:2024:i:12:p:1694-:d:1547386
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/21/12/1694/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/21/12/1694/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Claire J. Han & Fode Tounkara & Matthew Kalady & Anne M. Noonan & Natasha R. Burse & Electra D. Paskett & Diane Von Ah, 2023. "Risk Factors of Health-Related Quality of Life among Gastrointestinal Cancer Survivors in the U.S.: With a Focus on Social and Behavioral Determinants of Health (SBDH)," IJERPH, MDPI, vol. 20(17), pages 1-15, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.

      Corrections

      All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:21:y:2024:i:12:p:1694-:d:1547386. See general information about how to correct material in RePEc.

      If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

      If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

      If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

      For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

      Please note that corrections may take a couple of weeks to filter through the various RePEc services.

      IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.