IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v15y2023i3p2754-d1056138.html
   My bibliography  Save this article

A Comparative Analysis of Machine Learning Models: A Case Study in Predicting Chronic Kidney Disease

Author

Listed:
  • Hasnain Iftikhar

    (Department of Mathematics, City University of Science and Information Technology Peshawar, Peshawar 25000, Pakistan
    Department of Statistics, Quaid-i-Azam University, Islamabad 44000, Pakistan)

  • Murad Khan

    (Department of Statistics, Abdul Wali Khan University Mardan, Mardan 23200, Pakistan)

  • Zardad Khan

    (Department of Statistics, Abdul Wali Khan University Mardan, Mardan 23200, Pakistan)

  • Faridoon Khan

    (Pakistan Institute of Development Economics, Islamabad 44000, Pakistan)

  • Huda M Alshanbari

    (Department of Mathematical Sciences, College of Science, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia)

  • Zubair Ahmad

    (Department of Statistics, Quaid-i-Azam University, Islamabad 44000, Pakistan)

Abstract

In the modern world, chronic kidney disease is one of the most severe diseases that negatively affects human life. It is becoming a growing problem in both developed and underdeveloped countries. An accurate and timely diagnosis of chronic kidney disease is vital in preventing and treating kidney failure. The diagnosis of chronic kidney disease through history has been considered unreliable in many respects. To classify healthy people and people with chronic kidney disease, non-invasive methods like machine learning models are reliable and efficient. In our current work, we predict chronic kidney disease using different machine learning models, including logistic, probit, random forest, decision tree, k-nearest neighbor, and support vector machine with four kernel functions (linear, Laplacian, Bessel, and radial basis kernels). The dataset is a record taken as a case–control study containing chronic kidney disease patients from district Buner, Khyber Pakhtunkhwa, Pakistan. To compare the models in terms of classification and accuracy, we calculated different performance measures, including accuracy, Brier score, sensitivity, Youdent, specificity, and F1 score. The Diebold and Mariano test of comparable prediction accuracy was also conducted to determine whether there is a substantial difference in the accuracy measures of different predictive models. As confirmed by the results, the support vector machine with the Laplace kernel function outperforms all other models, while the random forest is competitive.

Suggested Citation

  • Hasnain Iftikhar & Murad Khan & Zardad Khan & Faridoon Khan & Huda M Alshanbari & Zubair Ahmad, 2023. "A Comparative Analysis of Machine Learning Models: A Case Study in Predicting Chronic Kidney Disease," Sustainability, MDPI, vol. 15(3), pages 1-13, February.
  • Handle: RePEc:gam:jsusta:v:15:y:2023:i:3:p:2754-:d:1056138
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/15/3/2754/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/15/3/2754/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ismail Shah & Hasnain Iftikhar & Sajid Ali, 2020. "Modeling and Forecasting Medium-Term Electricity Consumption Using Component Estimation Technique," Forecasting, MDPI, vol. 2(2), pages 1-17, May.
    2. Ismail Shah & Hasnain Iftikhar & Sajid Ali & Depeng Wang, 2019. "Short-Term Electricity Demand Forecasting Using Components Estimation Technique," Energies, MDPI, vol. 12(13), pages 1-17, July.
    3. Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
    4. Channa Jayasumana & Sarath Gunatilake & Priyantha Senanayake, 2014. "Glyphosate, Hard Water and Nephrotoxic Metals: Are They the Culprits Behind the Epidemic of Chronic Kidney Disease of Unknown Etiology in Sri Lanka?," IJERPH, MDPI, vol. 11(2), pages 1-23, February.
    5. Viet-Ha Nhu & Ataollah Shirzadi & Himan Shahabi & Sushant K. Singh & Nadhir Al-Ansari & John J. Clague & Abolfazl Jaafari & Wei Chen & Shaghayegh Miraki & Jie Dou & Chinh Luu & Krzysztof Górski & Binh, 2020. "Shallow Landslide Susceptibility Mapping: A Comparison between Logistic Model Tree, Logistic Regression, Naïve Bayes Tree, Artificial Neural Network, and Support Vector Machine Algorithms," IJERPH, MDPI, vol. 17(8), pages 1-30, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hasnain Iftikhar & Josue E. Turpo-Chaparro & Paulo Canas Rodrigues & Javier Linkolk López-Gonzales, 2023. "Forecasting Day-Ahead Electricity Prices for the Italian Electricity Market Using a New Decomposition—Combination Technique," Energies, MDPI, vol. 16(18), pages 1-23, September.
    2. Hasnain Iftikhar & Aimel Zafar & Josue E. Turpo-Chaparro & Paulo Canas Rodrigues & Javier Linkolk López-Gonzales, 2023. "Forecasting Day-Ahead Brent Crude Oil Prices Using Hybrid Combinations of Time Series Models," Mathematics, MDPI, vol. 11(16), pages 1-19, August.
    3. Hasnain Iftikhar & Josue E. Turpo-Chaparro & Paulo Canas Rodrigues & Javier Linkolk López-Gonzales, 2023. "Day-Ahead Electricity Demand Forecasting Using a Novel Decomposition Combination Method," Energies, MDPI, vol. 16(18), pages 1-22, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hasnain Iftikhar & Josue E. Turpo-Chaparro & Paulo Canas Rodrigues & Javier Linkolk López-Gonzales, 2023. "Forecasting Day-Ahead Electricity Prices for the Italian Electricity Market Using a New Decomposition—Combination Technique," Energies, MDPI, vol. 16(18), pages 1-23, September.
    2. Maciej Slowik & Wieslaw Urban, 2022. "Machine Learning Short-Term Energy Consumption Forecasting for Microgrids in a Manufacturing Plant," Energies, MDPI, vol. 15(9), pages 1-16, May.
    3. Hasnain Iftikhar & Nadeela Bibi & Paulo Canas Rodrigues & Javier Linkolk López-Gonzales, 2023. "Multiple Novel Decomposition Techniques for Time Series Forecasting: Application to Monthly Forecasting of Electricity Consumption in Pakistan," Energies, MDPI, vol. 16(6), pages 1-17, March.
    4. Muhammad Ahmar & Fahad Ali & Yuexiang Jiang & Mamdooh Alwetaishi & Sherif S. M. Ghoneim, 2022. "Households’ Energy Choices in Rural Pakistan," Energies, MDPI, vol. 15(9), pages 1-23, April.
    5. Nazila Pourhaji & Mohammad Asadpour & Ali Ahmadian & Ali Elkamel, 2022. "The Investigation of Monthly/Seasonal Data Clustering Impact on Short-Term Electricity Price Forecasting Accuracy: Ontario Province Case Study," Sustainability, MDPI, vol. 14(5), pages 1-14, March.
    6. Chatum Sankalpa & Somsak Kittipiyakul & Seksan Laitrakun, 2022. "Forecasting Short-Term Electricity Load Using Validated Ensemble Learning," Energies, MDPI, vol. 15(22), pages 1-30, November.
    7. Roozbeh Sadeghian Broujeny & Safa Ben Ayed & Mouadh Matalah, 2023. "Energy Consumption Forecasting in a University Office by Artificial Intelligence Techniques: An Analysis of the Exogenous Data Effect on the Modeling," Energies, MDPI, vol. 16(10), pages 1-21, May.
    8. Nahid Sultana & S. M. Zakir Hossain & Salma Hamad Almuhaini & Dilek Düştegör, 2022. "Bayesian Optimization Algorithm-Based Statistical and Machine Learning Approaches for Forecasting Short-Term Electricity Demand," Energies, MDPI, vol. 15(9), pages 1-26, May.
    9. Jasiński, Tomasz, 2022. "A new approach to modeling cycles with summer and winter demand peaks as input variables for deep neural networks," Renewable and Sustainable Energy Reviews, Elsevier, vol. 159(C).
    10. Hasnain Iftikhar & Aimel Zafar & Josue E. Turpo-Chaparro & Paulo Canas Rodrigues & Javier Linkolk López-Gonzales, 2023. "Forecasting Day-Ahead Brent Crude Oil Prices Using Hybrid Combinations of Time Series Models," Mathematics, MDPI, vol. 11(16), pages 1-19, August.
    11. Fang Guo & Shangyun Deng & Weijia Zheng & An Wen & Jinfeng Du & Guangshan Huang & Ruiyang Wang, 2022. "Short-Term Electricity Price Forecasting Based on the Two-Layer VMD Decomposition Technique and SSA-LSTM," Energies, MDPI, vol. 15(22), pages 1-20, November.
    12. Hasnain Iftikhar & Josue E. Turpo-Chaparro & Paulo Canas Rodrigues & Javier Linkolk López-Gonzales, 2023. "Day-Ahead Electricity Demand Forecasting Using a Novel Decomposition Combination Method," Energies, MDPI, vol. 16(18), pages 1-22, September.
    13. Anatoly A. Peresetsky & Ruslan I. Yakubov, 2017. "Autocorrelation in an unobservable global trend: does it help to forecast market returns?," International Journal of Computational Economics and Econometrics, Inderscience Enterprises Ltd, vol. 7(1/2), pages 152-169.
    14. Kelly Burns & Imad Moosa, 2017. "Demystifying the Meese–Rogoff puzzle: structural breaks or measures of forecasting accuracy?," Applied Economics, Taylor & Francis Journals, vol. 49(48), pages 4897-4910, October.
    15. João C. Claudio & Katja Heinisch & Oliver Holtemöller, 2020. "Nowcasting East German GDP growth: a MIDAS approach," Empirical Economics, Springer, vol. 58(1), pages 29-54, January.
    16. Antonio Rubia & Trino-Manuel Ñíguez, 2006. "Forecasting the conditional covariance matrix of a portfolio under long-run temporal dependence," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 25(6), pages 439-458.
    17. Athanasopoulos, George & de Carvalho Guillén, Osmani Teixeira & Issler, João Victor & Vahid, Farshid, 2011. "Model selection, estimation and forecasting in VAR models with short-run and long-run restrictions," Journal of Econometrics, Elsevier, vol. 164(1), pages 116-129, September.
    18. Hao Chen & Qiulan Wan & Yurong Wang, 2014. "Refined Diebold-Mariano Test Methods for the Evaluation of Wind Power Forecasting Models," Energies, MDPI, vol. 7(7), pages 1-14, July.
    19. Antonello D’Agostino & Kieran Mcquinn & Karl Whelan, 2012. "Are Some Forecasters Really Better Than Others?," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 44(4), pages 715-732, June.
    20. Cavit Pakel & Neil Shephard & Kevin Sheppard, 2009. "Nuisance parameters, composite likelihoods and a panel of GARCH models," Economics Papers 2009-W12, Economics Group, Nuffield College, University of Oxford.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:15:y:2023:i:3:p:2754-:d:1056138. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.