IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v21y2024i7p867-d1427726.html
   My bibliography  Save this article

Random Forest and Feature Importance Measures for Discriminating the Most Influential Environmental Factors in Predicting Cardiovascular and Respiratory Diseases

Author

Listed:
  • Francesco Cappelli

    (DIBAF Department, University of Tuscia, 01100 Viterbo, Italy)

  • Gianfranco Castronuovo

    (School of Engineering, University of Basilicata, Viale dell’Ateneo Lucano 10, 85100 Potenza, Italy)

  • Salvatore Grimaldi

    (DIBAF Department, University of Tuscia, 01100 Viterbo, Italy)

  • Vito Telesca

    (School of Engineering, University of Basilicata, Viale dell’Ateneo Lucano 10, 85100 Potenza, Italy)

Abstract

Background: Several studies suggest that environmental and climatic factors are linked to the risk of mortality due to cardiovascular and respiratory diseases; however, it is still unclear which are the most influential ones. This study sheds light on the potentiality of a data-driven statistical approach by providing a case study analysis. Methods: Daily admissions to the emergency room for cardiovascular and respiratory diseases are jointly analyzed with daily environmental and climatic parameter values (temperature, atmospheric pressure, relative humidity, carbon monoxide, ozone, particulate matter, and nitrogen dioxide). The Random Forest (RF) model and feature importance measure (FMI) techniques (permutation feature importance (PFI), Shapley Additive exPlanations (SHAP) feature importance, and the derivative-based importance measure ( κ A L E )) are applied for discriminating the role of each environmental and climatic parameter. Data are pre-processed to remove trend and seasonal behavior using the Seasonal Trend Decomposition (STL) method and preliminary analyzed to avoid redundancy of information. Results: The RF performance is encouraging, being able to predict cardiovascular and respiratory disease admissions with a mean absolute relative error of 0.04 and 0.05 cases per day, respectively. Feature importance measures discriminate parameter behaviors providing importance rankings. Indeed, only three parameters (temperature, atmospheric pressure, and carbon monoxide) were responsible for most of the total prediction accuracy. Conclusions: Data-driven and statistical tools, like the feature importance measure, are promising for discriminating the role of environmental and climatic factors in predicting the risk related to cardiovascular and respiratory diseases. Our results reveal the potential of employing these tools in public health policy applications for the development of early warning systems that address health risks associated with climate change, and improving disease prevention strategies.

Suggested Citation

  • Francesco Cappelli & Gianfranco Castronuovo & Salvatore Grimaldi & Vito Telesca, 2024. "Random Forest and Feature Importance Measures for Discriminating the Most Influential Environmental Factors in Predicting Cardiovascular and Respiratory Diseases," IJERPH, MDPI, vol. 21(7), pages 1-21, July.
  • Handle: RePEc:gam:jijerp:v:21:y:2024:i:7:p:867-:d:1427726
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/21/7/867/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/21/7/867/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Stephen F Weng & Jenna Reps & Joe Kai & Jonathan M Garibaldi & Nadeem Qureshi, 2017. "Can machine-learning improve cardiovascular risk prediction using routine clinical data?," PLOS ONE, Public Library of Science, vol. 12(4), pages 1-14, April.
    2. Jonathan A. Patz & Diarmid Campbell-Lendrum & Tracey Holloway & Jonathan A. Foley, 2005. "Impact of regional climate change on human health," Nature, Nature, vol. 438(7066), pages 310-317, November.
    3. Matteo Scortichini & Manuela De Sario & Francesca K. De’Donato & Marina Davoli & Paola Michelozzi & Massimo Stafoggia, 2018. "Short-Term Effects of Heat on Mortality and Effect Modification by Air Pollution in 25 Italian Cities," IJERPH, MDPI, vol. 15(8), pages 1-12, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Molini, A. & Talkner, P. & Katul, G.G. & Porporato, A., 2011. "First passage time statistics of Brownian motion with purely time dependent drift and diffusion," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(11), pages 1841-1852.
    2. Denis Maragno & Michele Dalla Fontana & Francesco Musco, 2020. "Mapping Heat Stress Vulnerability and Risk Assessment at the Neighborhood Scale to Drive Urban Adaptation Planning," Sustainability, MDPI, vol. 12(3), pages 1-16, February.
    3. Bing Li & Zhifeng Liu & Ying Nan & Shengnan Li & Yanmin Yang, 2018. "Comparative Analysis of Urban Heat Island Intensities in Chinese, Russian, and DPRK Regions across the Transnational Urban Agglomeration of the Tumen River in Northeast Asia," Sustainability, MDPI, vol. 10(8), pages 1-16, July.
    4. Mirza Rizwan Sajid & Bader A. Almehmadi & Waqas Sami & Mansour K. Alzahrani & Noryanti Muhammad & Christophe Chesneau & Asif Hanif & Arshad Ali Khan & Ahmad Shahbaz, 2021. "Development of Nonlaboratory-Based Risk Prediction Models for Cardiovascular Diseases Using Conventional and Machine Learning Approaches," IJERPH, MDPI, vol. 18(23), pages 1-16, November.
    5. Michael Tong & Berhanu Wondmagegn & Jianjun Xiang & Alana Hansen & Keith Dear & Dino Pisaniello & Blesson Varghese & Jianguo Xiao & Le Jian & Benjamin Scalley & Monika Nitschke & John Nairn & Hilary B, 2022. "Hospitalization Costs of Respiratory Diseases Attributable to Temperature in Australia and Projections for Future Costs in the 2030s and 2050s under Climate Change," IJERPH, MDPI, vol. 19(15), pages 1-16, August.
    6. Nicolas Taconet & Aurélie Méjean & Céline Guivarch, 2020. "Influence of climate change impacts and mitigation costs on inequality between countries," Climatic Change, Springer, vol. 160(1), pages 15-34, May.
    7. Jaewon Kwak & Huiseong Noh & Soojun Kim & Vijay P. Singh & Seung Jin Hong & Duckgil Kim & Keonhaeng Lee & Narae Kang & Hung Soo Kim, 2014. "Future Climate Data from RCP 4.5 and Occurrence of Malaria in Korea," IJERPH, MDPI, vol. 11(10), pages 1-19, October.
    8. Mariani, Fabio & Pérez-Barahona, Agustín & Raffin, Natacha, 2010. "Life expectancy and the environment," Journal of Economic Dynamics and Control, Elsevier, vol. 34(4), pages 798-815, April.
    9. Louise Bedsworth, 2012. "California’s local health agencies and the state’s climate adaptation strategy," Climatic Change, Springer, vol. 111(1), pages 119-133, March.
    10. Salvatore Tedesco & Martina Andrulli & Markus Åkerlund Larsson & Daniel Kelly & Antti Alamäki & Suzanne Timmons & John Barton & Joan Condell & Brendan O’Flynn & Anna Nordström, 2021. "Comparison of Machine Learning Techniques for Mortality Prediction in a Prospective Cohort of Older Adults," IJERPH, MDPI, vol. 18(23), pages 1-18, December.
    11. Ajay Dev & Sanjay Kumar Malik, 2021. "Artificial Bee Colony Optimized Deep Neural Network Model for Handling Imbalanced Stroke Data: ABC-DNN for Prediction of Stroke," International Journal of E-Health and Medical Communications (IJEHMC), IGI Global, vol. 12(5), pages 67-83, September.
    12. Menconi, M.E. & Giordano, S. & Grohmann, D., 2022. "Revisiting global food production and consumption patterns by developing resilient food systems for local communities," Land Use Policy, Elsevier, vol. 119(C).
    13. Xiaoguang Chen & Madhu Khanna & Lu Yang, 2022. "The impacts of temperature on Chinese food processing firms," Australian Journal of Agricultural and Resource Economics, Australian Agricultural and Resource Economics Society, vol. 66(2), pages 256-279, April.
    14. Alper Ozpinar, 2023. "A Hyper-Integrated Mobility as a Service (MaaS) to Gamification and Carbon Market Enterprise Architecture Framework for Sustainable Environment," Energies, MDPI, vol. 16(5), pages 1-22, March.
    15. Flückiger, Matthias & Ludwig, Markus, 2022. "Temperature and risk of diarrhoea among children in Sub-Saharan Africa," World Development, Elsevier, vol. 160(C).
    16. Feihan Lu & Yao Zheng & Harrington Cleveland & Chris Burton & David Madigan, 2018. "Bayesian hierarchical vector autoregressive models for patient-level predictive modeling," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-27, December.
    17. Nicholas A. Mailloux & Colleen P. Henegan & Dorothy Lsoto & Kristen P. Patterson & Paul C. West & Jonathan A. Foley & Jonathan A. Patz, 2021. "Climate Solutions Double as Health Interventions," IJERPH, MDPI, vol. 18(24), pages 1-15, December.
    18. SangHyeok Lee & Donghyun Kim, 2022. "Multidisciplinary Understanding of the Urban Heating Problem and Mitigation: A Conceptual Framework for Urban Planning," IJERPH, MDPI, vol. 19(16), pages 1-15, August.
    19. Shinji Otani & Satomi Funaki Ishizu & Toshio Masumoto & Hiroki Amano & Youichi Kurozawa, 2021. "The Effect of Minimum and Maximum Air Temperatures in the Summer on Heat Stroke in Japan: A Time-Stratified Case-Crossover Study," IJERPH, MDPI, vol. 18(4), pages 1-12, February.
    20. Laetitia H. M. Schmitt & Hilary M. Graham & Piran C. L. White, 2016. "Economic Evaluations of the Health Impacts of Weather-Related Extreme Events: A Scoping Review," IJERPH, MDPI, vol. 13(11), pages 1-19, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:21:y:2024:i:7:p:867-:d:1427726. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.