IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v19y2022i22p15171-d975548.html
   My bibliography  Save this article

Risk Association of Liver Cancer and Hepatitis B with Tree Ensemble and Lifestyle Features

Author

Listed:
  • Eunji Koh

    (School of Industrial and Management Engineering, Korea University, 145 Anamro, Seongbuk-gu, Seoul 02841, Republic of Korea)

  • Younghoon Kim

    (Department of Industrial and Management Systems Engineering, Kyung Hee University, 1732, Deogyeong-daero, Giheung-gu, Yongin-si 17104, Republic of Korea)

Abstract

The second-largest cause of death by cancer in Korea is liver cancer, which leads to acute morbidity and mortality. Hepatitis B is the most common cause of liver cancer. About 70% of liver cancer patients suffer from hepatitis B. Early risk association of liver cancer and hepatitis B can help prevent fatal conditions. We propose a risk association method for liver cancer and hepatitis B with only lifestyle features. The diagnostic features were excluded to reduce the cost of gathering medical data. The data source is the Korea National Health and Nutrition Examination Survey (KNHANES) from 2007 to 2019. We use 3872 and 4640 subjects for liver cancer and hepatitis B model, respectively. Random forest is employed to determine functional relationships between liver diseases and lifestyle features. The performance of our proposed method was compared with six machine learning methods. The results showed the proposed method outperformed the other methods in the area under the receiver operator characteristic curve of 0.8367. The promising results confirm the superior performance of the proposed method and show that the proposed method with only lifestyle features provides significant advantages, potentially reducing the cost of detecting patients who require liver health care in advance.

Suggested Citation

  • Eunji Koh & Younghoon Kim, 2022. "Risk Association of Liver Cancer and Hepatitis B with Tree Ensemble and Lifestyle Features," IJERPH, MDPI, vol. 19(22), pages 1-16, November.
  • Handle: RePEc:gam:jijerp:v:19:y:2022:i:22:p:15171-:d:975548
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/19/22/15171/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/19/22/15171/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Stephen F Weng & Jenna Reps & Joe Kai & Jonathan M Garibaldi & Nadeem Qureshi, 2017. "Can machine-learning improve cardiovascular risk prediction using routine clinical data?," PLOS ONE, Public Library of Science, vol. 12(4), pages 1-14, April.
    2. Andrea Saltelli, 2002. "Sensitivity Analysis for Importance Assessment," Risk Analysis, John Wiley & Sons, vol. 22(3), pages 579-590, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mirza Rizwan Sajid & Bader A. Almehmadi & Waqas Sami & Mansour K. Alzahrani & Noryanti Muhammad & Christophe Chesneau & Asif Hanif & Arshad Ali Khan & Ahmad Shahbaz, 2021. "Development of Nonlaboratory-Based Risk Prediction Models for Cardiovascular Diseases Using Conventional and Machine Learning Approaches," IJERPH, MDPI, vol. 18(23), pages 1-16, November.
    2. Makam, Vaishno Devi & Millossovich, Pietro & Tsanakas, Andreas, 2021. "Sensitivity analysis with χ2-divergences," Insurance: Mathematics and Economics, Elsevier, vol. 100(C), pages 372-383.
    3. S. Cucurachi & E. Borgonovo & R. Heijungs, 2016. "A Protocol for the Global Sensitivity Analysis of Impact Assessment Models in Life Cycle Assessment," Risk Analysis, John Wiley & Sons, vol. 36(2), pages 357-377, February.
    4. Marco Percoco, 2006. "A Note on the Inoperability Input‐Output Model," Risk Analysis, John Wiley & Sons, vol. 26(3), pages 589-594, June.
    5. Wenbin Ruan & Zhenzhou Lu & Longfei Tian, 2013. "A modified variance-based importance measure and its solution by state dependent parameter," Journal of Risk and Reliability, , vol. 227(1), pages 3-15, February.
    6. Salvatore Tedesco & Martina Andrulli & Markus Åkerlund Larsson & Daniel Kelly & Antti Alamäki & Suzanne Timmons & John Barton & Joan Condell & Brendan O’Flynn & Anna Nordström, 2021. "Comparison of Machine Learning Techniques for Mortality Prediction in a Prospective Cohort of Older Adults," IJERPH, MDPI, vol. 18(23), pages 1-18, December.
    7. Ajay Dev & Sanjay Kumar Malik, 2021. "Artificial Bee Colony Optimized Deep Neural Network Model for Handling Imbalanced Stroke Data: ABC-DNN for Prediction of Stroke," International Journal of E-Health and Medical Communications (IJEHMC), IGI Global, vol. 12(5), pages 67-83, September.
    8. Kunz, Nathan & Chesney, Thomas & Trautrims, Alexander & Gold, Stefan, 2023. "Adoption and transferability of joint interventions to fight modern slavery in food supply chains," International Journal of Production Economics, Elsevier, vol. 258(C).
    9. Yun, Wanying & Lu, Zhenzhou & Feng, Kaixuan & Li, Luyi, 2019. "An elaborate algorithm for analyzing the Borgonovo moment-independent sensitivity by replacing the probability density function estimation with the probability estimation," Reliability Engineering and System Safety, Elsevier, vol. 189(C), pages 99-108.
    10. Feihan Lu & Yao Zheng & Harrington Cleveland & Chris Burton & David Madigan, 2018. "Bayesian hierarchical vector autoregressive models for patient-level predictive modeling," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-27, December.
    11. Gonnet, Gaston H. & Stewart, John & Lafleur, Joseph & Keith, Stephen & McLellan, Mark & Jiang-Gorsline, David & Snider, Tim, 2021. "Analysis of feature influence on Covid-19 Death Rate Per Country Using a Novel Orthogonalization Technique," MetaArXiv 4kw2n, Center for Open Science.
    12. Abdur Rahim Hamidi & Jiangwei Wang & Shiyao Guo & Zhongping Zeng, 2020. "Flood vulnerability assessment using MOVE framework: a case study of the northern part of district Peshawar, Pakistan," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 101(2), pages 385-408, March.
    13. Kamoonpuri, Sana Zehra & Sengar, Anita, 2023. "Hi, May AI help you? An analysis of the barriers impeding the implementation and use of artificial intelligence-enabled virtual assistants in retail," Journal of Retailing and Consumer Services, Elsevier, vol. 72(C).
    14. Wenbin Ruan & Zhenzhou Lu & Pengfei Wei, 2013. "Estimation of conditional moment by moving least squares and its application for importance analysis," Journal of Risk and Reliability, , vol. 227(6), pages 641-650, December.
    15. Pesenti, Silvana M. & Millossovich, Pietro & Tsanakas, Andreas, 2019. "Reverse sensitivity testing: What does it take to break the model?," European Journal of Operational Research, Elsevier, vol. 274(2), pages 654-670.
    16. Shinya Suzuki & Takeshi Yamashita & Tsuyoshi Sakama & Takuto Arita & Naoharu Yagi & Takayuki Otsuka & Hiroaki Semba & Hiroto Kano & Shunsuke Matsuno & Yuko Kato & Tokuhisa Uejima & Yuji Oikawa & Minor, 2019. "Comparison of risk models for mortality and cardiovascular events between machine learning and conventional logistic regression analysis," PLOS ONE, Public Library of Science, vol. 14(9), pages 1-14, September.
    17. Li, Haihe & Wang, Pan & Huang, Xiaoyu & Zhang, Zheng & Zhou, Changcong & Yue, Zhufeng, 2021. "Vine copula-based parametric sensitivity analysis of failure probability-based importance measure in the presence of multidimensional dependencies," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
    18. Emanuele Borgonovo, 2006. "Measuring Uncertainty Importance: Investigation and Comparison of Alternative Approaches," Risk Analysis, John Wiley & Sons, vol. 26(5), pages 1349-1361, October.
    19. Jung, WoongHee & Taflanidis, Alexandros A., 2023. "Efficient global sensitivity analysis for high-dimensional outputs combining data-driven probability models and dimensionality reduction," Reliability Engineering and System Safety, Elsevier, vol. 231(C).
    20. C. L. Smith & E. Borgonovo, 2007. "Decision Making During Nuclear Power Plant Incidents—A New Approach to the Evaluation of Precursor Events," Risk Analysis, John Wiley & Sons, vol. 27(4), pages 1027-1042, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:22:p:15171-:d:975548. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.