IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v11y2020i1d10.1038_s41467-020-18037-z.html
   My bibliography  Save this article

Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets

Author

Listed:
  • Marc-Andre Schulz

    (Aachen University)

  • B. T. Thomas Yeo

    (National University of Singapore
    National University of Singapore
    National University of Singapore)

  • Joshua T. Vogelstein

    (Johns Hopkins University
    Johns Hopkins University)

  • Janaina Mourao-Miranada

    (University College London
    University College London)

  • Jakob N. Kather

    (University Hospital RWTH Aachen
    German Cancer Research Center (DKFZ)
    German Cancer Research Center (DKFZ))

  • Konrad Kording

    (University of Pennsylvania)

  • Blake Richards

    (McGill University
    McGill University
    Canadian Institute for Advanced Research
    Mila - Quebec Artificial Intelligence Institute)

  • Danilo Bzdok

    (Mila - Quebec Artificial Intelligence Institute
    Commissariat à l’Energie Atomique (CEA) Saclay
    Institut National de Recherche en Informatique et en Automatique (INRIA)
    McGill University)

Abstract

Recently, deep learning has unlocked unprecedented success in various domains, especially using images, text, and speech. However, deep learning is only beneficial if the data have nonlinear relationships and if they are exploitable at available sample sizes. We systematically profiled the performance of deep, kernel, and linear models as a function of sample size on UKBiobank brain images against established machine learning references. On MNIST and Zalando Fashion, prediction accuracy consistently improves when escalating from linear models to shallow-nonlinear models, and further improves with deep-nonlinear models. In contrast, using structural or functional brain scans, simple linear models perform on par with more complex, highly parameterized models in age/sex prediction across increasing sample sizes. In sum, linear models keep improving as the sample size approaches ~10,000 subjects. Yet, nonlinearities for predicting common phenotypes from typical brain scans remain largely inaccessible to the examined kernel and deep learning methods.

Suggested Citation

  • Marc-Andre Schulz & B. T. Thomas Yeo & Joshua T. Vogelstein & Janaina Mourao-Miranada & Jakob N. Kather & Konrad Kording & Blake Richards & Danilo Bzdok, 2020. "Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets," Nature Communications, Nature, vol. 11(1), pages 1-15, December.
  • Handle: RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-18037-z
    DOI: 10.1038/s41467-020-18037-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-020-18037-z
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-020-18037-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Andrea I. Luppi & Helena M. Gellersen & Zhen-Qi Liu & Alexander R. D. Peattie & Anne E. Manktelow & Ram Adapa & Adrian M. Owen & Lorina Naci & David K. Menon & Stavros I. Dimitriadis & Emmanuel A. Sta, 2024. "Systematic evaluation of fMRI data-processing pipelines for consistent functional connectomics," Nature Communications, Nature, vol. 15(1), pages 1-24, December.
    2. Winn-Nuñez, Emily T. & Griffin, Maryclare & Crawford, Lorin, 2024. "A simple approach for local and global variable importance in nonlinear regression models," Computational Statistics & Data Analysis, Elsevier, vol. 194(C).
    3. Jessica Dafflon & Pedro F. Da Costa & František Váša & Ricardo Pio Monti & Danilo Bzdok & Peter J. Hellyer & Federico Turkheimer & Jonathan Smallwood & Emily Jones & Robert Leech, 2022. "A guided multiverse study of neuroimaging analyses," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    4. Vieira, Bruno Hebling & Pamplona, Gustavo Santo Pedro & Fachinello, Karim & Silva, Alice Kamensek & Foss, Maria Paula & Salmon, Carlos Ernesto Garrido, 2022. "On the prediction of human intelligence from neuroimaging: A systematic review of methods and reporting," Intelligence, Elsevier, vol. 93(C).
    5. Jianzhong Chen & Angela Tam & Valeria Kebets & Csaba Orban & Leon Qi Rong Ooi & Christopher L. Asplund & Scott Marek & Nico U. F. Dosenbach & Simon B. Eickhoff & Danilo Bzdok & Avram J. Holmes & B. T., 2022. "Shared and unique brain network features predict cognitive, personality, and mental health scores in the ABCD study," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    6. S. Parker Singleton & Andrea I. Luppi & Robin L. Carhart-Harris & Josephine Cruzat & Leor Roseman & David J. Nutt & Gustavo Deco & Morten L. Kringelbach & Emmanuel A. Stamatakis & Amy Kuceyeski, 2022. "Receptor-informed network control theory links LSD and psilocybin to a flattening of the brain’s control energy landscape," Nature Communications, Nature, vol. 13(1), pages 1-13, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-18037-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.