IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v11y2020i1d10.1038_s41467-020-19354-z.html
   My bibliography  Save this article

Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification

Author

Listed:
  • Khawla Seddiki

    (Computational Biology Laboratory, CHU de Québec - Université Laval Research Center, Québec City, Québec, Canada.
    U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM)

  • Philippe Saudemont

    (U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM)

  • Frédéric Precioso

    (INRIA, I3S)

  • Nina Ogrinc

    (U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM)

  • Maxence Wisztorski

    (U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM)

  • Michel Salzet

    (U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM)

  • Isabelle Fournier

    (U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM)

  • Arnaud Droit

    (Computational Biology Laboratory, CHU de Québec - Université Laval Research Center, Québec City, Québec, Canada.)

Abstract

Rapid and accurate clinical diagnosis remains challenging. A component of diagnosis tool development is the design of effective classification models with Mass spectrometry (MS) data. Some Machine Learning approaches have been investigated but these models require time-consuming preprocessing steps to remove artifacts, making them unsuitable for rapid analysis. Convolutional Neural Networks (CNNs) have been found to perform well under such circumstances since they can learn representations from raw data. However, their effectiveness decreases when the number of available training samples is small, which is a common situation in medicine. In this work, we investigate transfer learning on 1D-CNNs, then we develop a cumulative learning method when transfer learning is not powerful enough. We propose to train the same model through several classification tasks over various small datasets to accumulate knowledge in the resulting representation. By using rat brain as the initial training dataset, a cumulative learning approach can have a classification accuracy exceeding 98% for 1D clinical MS-data. We show the use of cumulative learning using datasets generated in different biological contexts, on different organisms, and acquired by different instruments. Here we show a promising strategy for improving MS data classification accuracy when only small numbers of samples are available.

Suggested Citation

  • Khawla Seddiki & Philippe Saudemont & Frédéric Precioso & Nina Ogrinc & Maxence Wisztorski & Michel Salzet & Isabelle Fournier & Arnaud Droit, 2020. "Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification," Nature Communications, Nature, vol. 11(1), pages 1-11, December.
  • Handle: RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-19354-z
    DOI: 10.1038/s41467-020-19354-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-020-19354-z
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-020-19354-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jyh-Woei Lin, 2022. "Generalized two-dimensional principal component analysis and two artificial neural network models to detect traveling ionospheric disturbances," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 111(2), pages 1245-1270, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-19354-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.