IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i22p4225-d970614.html
   My bibliography  Save this article

Semi-Supervised Approach for EGFR Mutation Prediction on CT Images

Author

Listed:
  • Cláudia Pinheiro

    (INESC TEC—Institute for Systems and Computer Engineering, Technology and Science, 4200-465 Porto, Portugal
    FEUP—Faculty of Engineering, University of Porto, 4200-465 Porto, Portugal)

  • Francisco Silva

    (INESC TEC—Institute for Systems and Computer Engineering, Technology and Science, 4200-465 Porto, Portugal
    FCUP—Faculty of Science, University of Porto, 4169-007 Porto, Portugal)

  • Tania Pereira

    (INESC TEC—Institute for Systems and Computer Engineering, Technology and Science, 4200-465 Porto, Portugal)

  • Hélder P. Oliveira

    (INESC TEC—Institute for Systems and Computer Engineering, Technology and Science, 4200-465 Porto, Portugal
    FCUP—Faculty of Science, University of Porto, 4169-007 Porto, Portugal)

Abstract

The use of deep learning methods in medical imaging has been able to deliver promising results; however, the success of such models highly relies on large, properly annotated datasets. The annotation of medical images is a laborious, expensive, and time-consuming process. This difficulty is increased for the mutations status label since these require additional exams (usually biopsies) to be obtained. On the other hand, raw images, without annotations, are extensively collected as part of the clinical routine. This work investigated methods that could mitigate the labelled data scarcity problem by using both labelled and unlabelled data to improve the efficiency of predictive models. A semi-supervised learning (SSL) approach was developed to predict epidermal growth factor receptor ( EGFR ) mutation status in lung cancer in a less invasive manner using 3D CT scans.The proposed approach consists of combining a variational autoencoder (VAE) and exploiting the power of adversarial training, intending that the features extracted from unlabelled data to discriminate images can help in the classification task. To incorporate labelled and unlabelled images, adversarial training was used, extending a traditional variational autoencoder. With the developed method, a mean AUC of 0.701 was achieved with the best-performing model, with only 14 % of the training data being labelled. This SSL approach improved the discrimination ability by nearly 7 percentage points over a fully supervised model developed with the same amount of labelled data, confirming the advantage of using such methods when few annotated examples are available.

Suggested Citation

  • Cláudia Pinheiro & Francisco Silva & Tania Pereira & Hélder P. Oliveira, 2022. "Semi-Supervised Approach for EGFR Mutation Prediction on CT Images," Mathematics, MDPI, vol. 10(22), pages 1-18, November.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:22:p:4225-:d:970614
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/22/4225/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/22/4225/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:22:p:4225-:d:970614. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.