IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0240867.html
   My bibliography  Save this article

Selecting the best machine learning algorithm to support the diagnosis of Non-Alcoholic Fatty Liver Disease: A meta learner study

Author

Listed:
  • Paolo Sorino
  • Maria Gabriella Caruso
  • Giovanni Misciagna
  • Caterina Bonfiglio
  • Angelo Campanella
  • Antonella Mirizzi
  • Isabella Franco
  • Antonella Bianco
  • Claudia Buongiorno
  • Rosalba Liuzzi
  • Anna Maria Cisternino
  • Maria Notarnicola
  • Marisa Chiloiro
  • Giovanni Pascoschi
  • Alberto Rubén Osella
  • MICOL Group

Abstract

Background & aims: Liver ultrasound scan (US) use in diagnosing Non-Alcoholic Fatty Liver Disease (NAFLD) causes costs and waiting lists overloads. We aimed to compare various Machine learning algorithms with a Meta learner approach to find the best of these as a predictor of NAFLD. Methods: The study included 2970 subjects, 2920 constituting the training set and 50, randomly selected, used in the test phase, performing cross-validation. The best predictors were combined to create three models: 1) FLI plus GLUCOSE plus SEX plus AGE, 2) AVI plus GLUCOSE plus GGT plus SEX plus AGE, 3) BRI plus GLUCOSE plus GGT plus SEX plus AGE. Eight machine learning algorithms were trained with the predictors of each of the three models created. For these algorithms, the percent accuracy, variance and percent weight were compared. Results: The SVM algorithm performed better with all models. Model 1 had 68% accuracy, with 1% variance and an algorithm weight of 27.35; Model 2 had 68% accuracy, with 1% variance and an algorithm weight of 33.62 and Model 3 had 77% accuracy, with 1% variance and an algorithm weight of 34.70. Model 2 was the most performing, composed of AVI plus GLUCOSE plus GGT plus SEX plus AGE, despite a lower percentage of accuracy. Conclusion: A Machine Learning approach can support NAFLD diagnosis and reduce health costs. The SVM algorithm is easy to apply and the necessary parameters are easily retrieved in databases.

Suggested Citation

  • Paolo Sorino & Maria Gabriella Caruso & Giovanni Misciagna & Caterina Bonfiglio & Angelo Campanella & Antonella Mirizzi & Isabella Franco & Antonella Bianco & Claudia Buongiorno & Rosalba Liuzzi & Ann, 2020. "Selecting the best machine learning algorithm to support the diagnosis of Non-Alcoholic Fatty Liver Disease: A meta learner study," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-15, October.
  • Handle: RePEc:plo:pone00:0240867
    DOI: 10.1371/journal.pone.0240867
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0240867
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0240867&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0240867?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Karatzoglou, Alexandros & Meyer, David & Hornik, Kurt, 2006. "Support Vector Machines in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 15(i09).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Benítez-Peña, Sandra & Blanquero, Rafael & Carrizosa, Emilio & Ramírez-Cobo, Pepa, 2024. "Cost-sensitive probabilistic predictions for support vector machines," European Journal of Operational Research, Elsevier, vol. 314(1), pages 268-279.
    2. Na Tang & Maoxiang Yuan & Zhijun Chen & Jian Ma & Rui Sun & Yide Yang & Quanyuan He & Xiaowei Guo & Shixiong Hu & Junhua Zhou, 2023. "Machine Learning Prediction Model of Tuberculosis Incidence Based on Meteorological Factors and Air Pollutants," IJERPH, MDPI, vol. 20(5), pages 1-17, February.
    3. Moro Russ A. & Härdle Wolfgang K. & Schäfer Dorothea, 2017. "Company rating with support vector machines," Statistics & Risk Modeling, De Gruyter, vol. 34(1-2), pages 55-67, June.
    4. Ana Patrícia Rocha & Hugo Miguel Pereira Choupina & Maria do Carmo Vilas-Boas & José Maria Fernandes & João Paulo Silva Cunha, 2018. "System for automatic gait analysis based on a single RGB-D camera," PLOS ONE, Public Library of Science, vol. 13(8), pages 1-24, August.
    5. Phichhang Ou & Hengshan Wang, 2009. "Prediction of Stock Market Index Movement by Ten Data Mining Techniques," Modern Applied Science, Canadian Center of Science and Education, vol. 3(12), pages 1-28, December.
    6. Luca Longo, 2018. "Experienced mental workload, perception of usability, their interaction and impact on task performance," PLOS ONE, Public Library of Science, vol. 13(8), pages 1-36, August.
    7. Courage Kamusoko & Jonah Gamba & Hitomi Murakami, 2014. "Mapping Woodland Cover in the Miombo Ecosystem: A Comparison of Machine Learning Classifiers," Land, MDPI, vol. 3(2), pages 1-17, June.
    8. Sotiropoulou, Kalliopi F. & Vavatsikos, Athanasios P. & Botsaris, Pantelis N., 2024. "A hybrid AHP-PROMETHEE II onshore wind farms multicriteria suitability analysis using kNN and SVM regression models in northeastern Greece," Renewable Energy, Elsevier, vol. 221(C).
    9. Perthame, Emeline & Forbes, Florence & Deleforge, Antoine, 2018. "Inverse regression approach to robust nonlinear high-to-low dimensional mapping," Journal of Multivariate Analysis, Elsevier, vol. 163(C), pages 1-14.
    10. Jindřich Špička, 2018. "How Do Agricultural Biogas Investments Affect Czech Farms?," Central European Business Review, Prague University of Economics and Business, vol. 2018(4), pages 34-60.
    11. Santiago José Elías Velazco & Franklin Galvão & Fabricio Villalobos & Paulo De Marco Júnior, 2017. "Using worldwide edaphic data to model plant species niches: An assessment at a continental extent," PLOS ONE, Public Library of Science, vol. 12(10), pages 1-24, October.
    12. Christian Bunn & Peter Läderach & Oriana Ovalle Rivera & Dieter Kirschke, 2015. "A bitter cup: climate change profile of global production of Arabica and Robusta coffee," Climatic Change, Springer, vol. 129(1), pages 89-101, March.
    13. Samia Zaoui & Clovis Foguem & Dieudonné Tchuente & Samuel Fosso-Wamba & Bernard Kamsu-Foguem, 2023. "The Viability of Supply Chains with Interpretable Learning Systems: The Case of COVID-19 Vaccine Deliveries," Global Journal of Flexible Systems Management, Springer;Global Institute of Flexible Systems Management, vol. 24(4), pages 633-657, December.
    14. Akın, Melda, 2015. "A novel approach to model selection in tourism demand modeling," Tourism Management, Elsevier, vol. 48(C), pages 64-72.
    15. Li, Xinyi & Yao, Runming, 2020. "A machine-learning-based approach to predict residential annual space heating and cooling loads considering occupant behaviour," Energy, Elsevier, vol. 212(C).
    16. Oosterlinck, Dieter & Benoit, Dries F. & Baecke, Philippe, 2020. "From one-class to two-class classification by incorporating expert knowledge: Novelty detection in human behaviour," European Journal of Operational Research, Elsevier, vol. 282(3), pages 1011-1024.
    17. The-Hanh Pham & Jahmunah Vicnesh & Joel Koh En Wei & Shu Lih Oh & N. Arunkumar & Enas. W. Abdulhay & Edward J. Ciaccio & U. Rajendra Acharya, 2020. "Autism Spectrum Disorder Diagnostic System Using HOS Bispectrum with EEG Signals," IJERPH, MDPI, vol. 17(3), pages 1-15, February.
    18. repec:cup:judgdm:v:17:y:2022:i:3:p:598-627 is not listed on IDEAS
    19. Claudio Conversano & Elise Dusseldorp, 2017. "Modeling Threshold Interaction Effects Through the Logistic Classification Trunk," Journal of Classification, Springer;The Classification Society, vol. 34(3), pages 399-426, October.
    20. Liu, Zhen Jia, 2015. "Estudo cross-country sobre os fatores determinantes da crise financeira bancária," RAE - Revista de Administração de Empresas, FGV-EAESP Escola de Administração de Empresas de São Paulo (Brazil), vol. 55(5), September.
    21. Gregory Gadzinski & Alessio Castello, 2022. "Combining white box models, black box machines and human interventions for interpretable decision strategies," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 17(3), pages 598-627, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0240867. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.