IDEAS home Printed from https://ideas.repec.org/a/spr/infosf/v11y2009i4d10.1007_s10796-009-9157-0.html
   My bibliography  Save this article

Comparing data mining methods with logistic regression in childhood obesity prediction

Author

Listed:
  • Shaoyan Zhang

    (University of Manchester)

  • Christos Tjortjis

    (University of Western Macedonia
    University of Ioannina)

  • Xiaojun Zeng

    (University of Manchester)

  • Hong Qiao

    (University of Manchester)

  • Iain Buchan

    (University of Manchester)

  • John Keane

    (University of Manchester)

Abstract

The epidemiological question of concern here is “can young children at risk of obesity be identified from their early growth records?” Pilot work using logistic regression to predict overweight and obese children demonstrated relatively limited success. Hence we investigate the incorporation of non-linear interactions to help improve accuracy of prediction; by comparing the result of logistic regression with those of six mature data mining techniques. The contributions of this paper are as follows: a) a comparison of logistic regression with six data mining techniques: specifically, for the prediction of overweight and obese children at 3 years using data recorded at birth, 6 weeks, 8 months and 2 years respectively; b) improved accuracy of prediction: prediction at 8 months accuracy is improved very slightly, in this case by using neural networks, whereas for prediction at 2 years obtained accuracy is improved by over 10%, in this case by using Bayesian methods. It has also been shown that incorporation of non-linear interactions could be important in epidemiological prediction, and that data mining techniques are becoming sufficiently well established to offer the medical research community a valid alternative to logistic regression.

Suggested Citation

  • Shaoyan Zhang & Christos Tjortjis & Xiaojun Zeng & Hong Qiao & Iain Buchan & John Keane, 2009. "Comparing data mining methods with logistic regression in childhood obesity prediction," Information Systems Frontiers, Springer, vol. 11(4), pages 449-460, September.
  • Handle: RePEc:spr:infosf:v:11:y:2009:i:4:d:10.1007_s10796-009-9157-0
    DOI: 10.1007/s10796-009-9157-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10796-009-9157-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10796-009-9157-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kweku-Muata Osei-Bryson & Kendall Giles, 2006. "Splitting methods for decision tree induction: An exploration of the relative performance of two entropy-based families," Information Systems Frontiers, Springer, vol. 8(3), pages 195-209, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Nida Shahid & Tim Rappon & Whitney Berta, 2019. "Applications of artificial neural networks in health care organizational decision-making: A scoping review," PLOS ONE, Public Library of Science, vol. 14(2), pages 1-22, February.
    2. Carlos Magno Sousa & Ewaldo Santana & Marcus Vinicius Lopes & Guilherme Lima & Luana Azoubel & Érika Carneiro & Allan Kardec Barros & Nilviane Pires, 2019. "Development of a Computational Model to Predict Excess Body Fat in Adolescents through Low Cost Variables," IJERPH, MDPI, vol. 16(16), pages 1-12, August.
    3. Cheong Kim & Francis Joseph Costello & Kun Chang Lee & Yuan Li & Chenyao Li, 2019. "Predicting Factors Affecting Adolescent Obesity Using General Bayesian Network and What-If Analysis," IJERPH, MDPI, vol. 16(23), pages 1-18, November.
    4. Davide Barbieri & Nitesh Chawla & Luciana Zaccagni & Tonći Grgurinović & Jelena Šarac & Miran Čoklo & Saša Missoni, 2020. "Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification Performance," IJERPH, MDPI, vol. 17(21), pages 1-9, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yunus Atalan & Emirhan Hacıoğlu & Müzeyyen Ertürk & Faik Gürsoy & Gradimir V. Milovanović, 2024. "Novel algorithms based on forward-backward splitting technique: effective methods for regression and classification," Journal of Global Optimization, Springer, vol. 90(4), pages 869-890, December.
    2. Francis Kofi Andoh-Baidoo & Kweku-Muata Osei-Bryson & Kwasi Amoako-Gyampah, 2012. "Effects of firm and IT characteristics on the value of e-commerce initiatives: An inductive theoretical framework," Information Systems Frontiers, Springer, vol. 14(2), pages 237-259, April.
    3. Chulhwan Chris Bang, 2015. "Information systems frontiers: Keyword analysis and classification," Information Systems Frontiers, Springer, vol. 17(1), pages 217-237, February.
    4. Gunjan Mansingh & Lila Rao & Kweku-Muata Osei-Bryson & Annette Mills, 2015. "Profiling internet banking users: A knowledge discovery in data mining process model based approach," Information Systems Frontiers, Springer, vol. 17(1), pages 193-215, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infosf:v:11:y:2009:i:4:d:10.1007_s10796-009-9157-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.