IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v19y2022i15p9756-d882899.html
   My bibliography  Save this article

An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors

Author

Listed:
  • Cheuk-Kay Sun

    (Division of Hepatology and Gastroenterology, Department of Internal Medicine, Shin Kong Wu Ho-Su Memorial Hospital, Taipei 11101, Taiwan
    Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 24205, Taiwan
    School of Medicine, Fu Jen Catholic University, New Taipei City 24205, Taiwan
    School of Medicine, Taipei Medical University, Taipei 11031, Taiwan)

  • Yun-Xuan Tang

    (Department of Radiology, Shin Kong Wu Ho-Su Memorial Hospital, Taipei 11101, Taiwan
    Department of Medical Imaging and Radiological Technology, Yuanpei University of Medical Technology, Hsinchu 30015, Taiwan)

  • Tzu-Chi Liu

    (Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 24205, Taiwan)

  • Chi-Jie Lu

    (Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 24205, Taiwan
    Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City 24205, Taiwan
    Department of Information Management, Fu Jen Catholic University, New Taipei City 24205, Taiwan)

Abstract

This study aimed to investigate the important predictors related to predicting positive mammographic findings based on questionnaire-based demographic and obstetric/gynecological parameters using the proposed integrated machine learning (ML) scheme. The scheme combines the benefits of two well-known ML algorithms, namely, least absolute shrinkage and selection operator (Lasso) logistic regression and extreme gradient boosting (XGB), to provide adequate prediction for mammographic anomalies in high-risk individuals and the identification of significant risk factors. We collected questionnaire data on 18 breast-cancer-related risk factors from women who participated in a national mammographic screening program between January 2017 and December 2020 at a single tertiary referral hospital to correlate with their mammographic findings. The acquired data were retrospectively analyzed using the proposed integrated ML scheme. Based on the data from 21,107 valid questionnaires, the results showed that the Lasso logistic regression models with variable combinations generated by XGB could provide more effective prediction results. The top five significant predictors for positive mammography results were younger age, breast self-examination, older age at first childbirth, nulliparity, and history of mammography within 2 years, suggesting a need for timely mammographic screening for women with these risk factors.

Suggested Citation

  • Cheuk-Kay Sun & Yun-Xuan Tang & Tzu-Chi Liu & Chi-Jie Lu, 2022. "An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors," IJERPH, MDPI, vol. 19(15), pages 1-17, August.
  • Handle: RePEc:gam:jijerp:v:19:y:2022:i:15:p:9756-:d:882899
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/19/15/9756/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/19/15/9756/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Khishigsuren Davagdorj & Van Huy Pham & Nipon Theera-Umpon & Keun Ho Ryu, 2020. "XGBoost-Based Framework for Smoking-Induced Noncommunicable Disease Prediction," IJERPH, MDPI, vol. 17(18), pages 1-22, September.
    2. John Tomkinson, 2019. "Age at first birth and subsequent fertility: The case of adolescent mothers in France and England and Wales," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 40(27), pages 761-798.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yung-Chuan Huang & Yu-Chen Cheng & Mao-Jhen Jhou & Mingchih Chen & Chi-Jie Lu, 2023. "Integrated Machine Learning Decision Tree Model for Risk Evaluation in Patients with Non-Valvular Atrial Fibrillation When Taking Different Doses of Dabigatran," IJERPH, MDPI, vol. 20(3), pages 1-15, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jet Wildeman & Jeroen Smits & Sandor Schrijner, 2023. "Ethnic Variation in Fertility Preferences in Sub-Saharan Africa," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 42(4), pages 1-23, August.
    2. Ann Garbett & Brienna Perelli‐Harris & Sarah Neal, 2021. "The Untold Story of 50 Years of Adolescent Fertility in West Africa: A Cohort Perspective on the Quantum, Timing, and Spacing of Adolescent Childbearing," Population and Development Review, The Population Council, Inc., vol. 47(1), pages 7-40, March.
    3. Marie-Caroline Compans, 2021. "Late motherhood, late fatherhood, and permanent childlessness: Trends by educational level and cohorts (1950–1970) in France," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 45(10), pages 329-344.
    4. Kwang Ho Park & Erdenebileg Batbaatar & Yongjun Piao & Nipon Theera-Umpon & Keun Ho Ryu, 2021. "Deep Learning Feature Extraction Approach for Hematopoietic Cancer Subtype Classification," IJERPH, MDPI, vol. 18(4), pages 1-24, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:15:p:9756-:d:882899. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.