IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v17y2020i21p7923-d436380.html
   My bibliography  Save this article

Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification Performance

Author

Listed:
  • Davide Barbieri

    (Department of Biomedical and Specialty Surgical Sciences, Faculty of Medicine, Pharmacy and Prevention, University of Ferrara, 44121 Ferrara, Italy
    From 1 November 2020 the name of Department will be Neuroscience and Rehabilitation.)

  • Nitesh Chawla

    (Interdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, IN 46556, USA)

  • Luciana Zaccagni

    (Department of Biomedical and Specialty Surgical Sciences, Faculty of Medicine, Pharmacy and Prevention, University of Ferrara, 44121 Ferrara, Italy
    Biomedical Sport Studies Center, University of Ferrara, 44123 Ferrara, Italy)

  • Tonći Grgurinović

    (Polyclinic for Occupational Health and Sports of Zagreb Sports Association with Laboratory of Medical Biochemistry, 10000 Zagreb, Croatia)

  • Jelena Šarac

    (Centre for Applied Bioanthropology, Institute for Anthropological Research, 10000 Zagreb, Croatia)

  • Miran Čoklo

    (Centre for Applied Bioanthropology, Institute for Anthropological Research, 10000 Zagreb, Croatia)

  • Saša Missoni

    (Institute for Anthropological Research, 10000 Zagreb, Croatia
    School of Medicine, Josip Juraj Strossmayer University of Osijek, 31000 Osijek, Croatia)

Abstract

Cardiovascular diseases are the main cause of death worldwide. The aim of the present study is to verify the performances of a data mining methodology in the evaluation of cardiovascular risk in athletes, and whether the results may be used to support clinical decision making. Anthropometric (height and weight), demographic (age and sex) and biomedical (blood pressure and pulse rate) data of 26,002 athletes were collected in 2012 during routine sport medical examinations, which included electrocardiography at rest. Subjects were involved in competitive sport practice, for which medical clearance was needed. Outcomes were negative for the largest majority, as expected in an active population. Resampling was applied to balance positive/negative class ratio. A decision tree and logistic regression were used to classify individuals as either at risk or not. The receiver operating characteristic curve was used to assess classification performances. Data mining and resampling improved cardiovascular risk assessment in terms of increased area under the curve. The proposed methodology can be effectively applied to biomedical data in order to optimize clinical decision making, and—at the same time—minimize the amount of unnecessary examinations.

Suggested Citation

  • Davide Barbieri & Nitesh Chawla & Luciana Zaccagni & Tonći Grgurinović & Jelena Šarac & Miran Čoklo & Saša Missoni, 2020. "Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification Performance," IJERPH, MDPI, vol. 17(21), pages 1-9, October.
  • Handle: RePEc:gam:jijerp:v:17:y:2020:i:21:p:7923-:d:436380
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/17/21/7923/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/17/21/7923/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Lukas Tanner & Mark Schreiber & Jenny G H Low & Adrian Ong & Thomas Tolfvenstam & Yee Ling Lai & Lee Ching Ng & Yee Sin Leo & Le Thi Puong & Subhash G Vasudevan & Cameron P Simmons & Martin L Hibberd , 2008. "Decision Tree Algorithms Predict the Diagnosis and Outcome of Dengue Fever in the Early Phase of Illness," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 2(3), pages 1-9, March.
    2. Azra Ramezankhani & Omid Pournik & Jamal Shahrabi & Fereidoun Azizi & Farzad Hadaegh & Davood Khalili, 2016. "The Impact of Oversampling with SMOTE on the Performance of 3 Classifiers in Prediction of Type 2 Diabetes," Medical Decision Making, , vol. 36(1), pages 137-144, January.
    3. Paul H. Lee, 2014. "Resampling Methods Improve the Predictive Power of Modeling in Class-Imbalanced Datasets," IJERPH, MDPI, vol. 11(9), pages 1-14, September.
    4. Shaoyan Zhang & Christos Tjortjis & Xiaojun Zeng & Hong Qiao & Iain Buchan & John Keane, 2009. "Comparing data mining methods with logistic regression in childhood obesity prediction," Information Systems Frontiers, Springer, vol. 11(4), pages 449-460, September.
    5. Catherine K. Murphy, 2001. "Identifying Diagnostic Errors with Induced Decision Trees," Medical Decision Making, , vol. 21(5), pages 368-375, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Emanuela Gualdi-Russo & Luciana Zaccagni, 2021. "Physical Activity for Health and Wellness," IJERPH, MDPI, vol. 18(15), pages 1-6, July.
    2. Davide Barbieri & Enrico Giuliani & Anna Del Prete & Amanda Losi & Matteo Villani & Alberto Barbieri, 2021. "How Artificial Intelligence and New Technologies Can Help the Management of the COVID-19 Pandemic," IJERPH, MDPI, vol. 18(14), pages 1-10, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Songul Cinaroglu, 2020. "Modelling unbalanced catastrophic health expenditure data by using machine‐learning methods," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 27(4), pages 168-181, October.
    2. Tzong-Shiann Ho & Ting-Chia Weng & Jung-Der Wang & Hsieh-Cheng Han & Hao-Chien Cheng & Chun-Chieh Yang & Chih-Hen Yu & Yen-Jung Liu & Chien Hsiang Hu & Chun-Yu Huang & Ming-Hong Chen & Chwan-Chuen Kin, 2020. "Comparing machine learning with case-control models to identify confirmed dengue cases," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 14(11), pages 1-21, November.
    3. Sangshin Park & Anon Srikiatkhachorn & Siripen Kalayanarooj & Louis Macareo & Sharone Green & Jennifer F Friedman & Alan L Rothman, 2018. "Use of structural equation models to predict dengue illness phenotype," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 12(10), pages 1-14, October.
    4. Pierfrancesco Alaimo Di Loro & Daria Scacciatelli & Giovanna Tagliaferri, 2023. "2-step Gradient Boosting approach to selectivity bias correction in tax audit: an application to the VAT gap in Italy," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 32(1), pages 237-270, March.
    5. Carlos Magno Sousa & Ewaldo Santana & Marcus Vinicius Lopes & Guilherme Lima & Luana Azoubel & Érika Carneiro & Allan Kardec Barros & Nilviane Pires, 2019. "Development of a Computational Model to Predict Excess Body Fat in Adolescents through Low Cost Variables," IJERPH, MDPI, vol. 16(16), pages 1-12, August.
    6. Marco Vriens & Nathan Bosch & Chad Vidden & Jason Talwar, 2022. "Prediction and profitability in market segmentation typing tools," Journal of Marketing Analytics, Palgrave Macmillan, vol. 10(4), pages 360-389, December.
    7. Nida Shahid & Tim Rappon & Whitney Berta, 2019. "Applications of artificial neural networks in health care organizational decision-making: A scoping review," PLOS ONE, Public Library of Science, vol. 14(2), pages 1-22, February.
    8. Cheong Kim & Francis Joseph Costello & Kun Chang Lee & Yuan Li & Chenyao Li, 2019. "Predicting Factors Affecting Adolescent Obesity Using General Bayesian Network and What-If Analysis," IJERPH, MDPI, vol. 16(23), pages 1-18, November.
    9. Phung Khanh Lam & Tran Van Ngoc & Truong Thi Thu Thuy & Nguyen Thi Hong Van & Tran Thi Nhu Thuy & Dong Thi Hoai Tam & Nguyen Minh Dung & Nguyen Thi Hanh Tien & Nguyen Tan Thanh Kieu & Cameron Simmons , 2017. "The value of daily platelet counts for predicting dengue shock syndrome: Results from a prospective observational study of 2301 Vietnamese children with dengue," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 11(4), pages 1-20, April.
    10. Yanchao Liu, 2022. "bsnsing: A Decision Tree Induction Method Based on Recursive Optimal Boolean Rule Composition," INFORMS Journal on Computing, INFORMS, vol. 34(6), pages 2908-2929, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:17:y:2020:i:21:p:7923-:d:436380. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.