IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-024-54970-z.html
   My bibliography  Save this article

A deep multiple instance learning framework improves microsatellite instability detection from tumor next generation sequencing

Author

Listed:
  • John Ziegler

    (Memorial Sloan Kettering Cancer Center
    MongoDB)

  • Jaclyn F. Hechtman

    (Memorial Sloan Kettering Cancer Center
    Caris Life Sciences)

  • Satshil Rana

    (Memorial Sloan Kettering Cancer Center)

  • Ryan N. Ptashkin

    (Memorial Sloan Kettering Cancer Center
    Natera)

  • Gowtham Jayakumaran

    (Memorial Sloan Kettering Cancer Center
    Guardant Health)

  • Sumit Middha

    (Memorial Sloan Kettering Cancer Center
    Adaptimmune)

  • Shweta S. Chavan

    (Memorial Sloan Kettering Cancer Center
    Repertoire Immune Medicines)

  • Chad Vanderbilt

    (Memorial Sloan Kettering Cancer Center)

  • Deborah DeLair

    (Memorial Sloan Kettering Cancer Center
    Northwell Health)

  • Jacklyn Casanova

    (Memorial Sloan Kettering Cancer Center)

  • Jinru Shia

    (Memorial Sloan Kettering Cancer Center)

  • Nicole DeGroat

    (Memorial Sloan Kettering Cancer Center
    Regeneron Pharmaceuticals)

  • Ryma Benayed

    (Memorial Sloan Kettering Cancer Center
    AstraZeneca)

  • Marc Ladanyi

    (Memorial Sloan Kettering Cancer Center)

  • Michael F. Berger

    (Memorial Sloan Kettering Cancer Center
    Memorial Sloan Kettering Cancer Center)

  • Thomas J. Fuchs

    (Memorial Sloan Kettering Cancer Center
    Weill Cornell Graduate School of Medical Sciences
    Elli Lilly and Company)

  • A. Rose Brannon

    (Memorial Sloan Kettering Cancer Center)

  • Ahmet Zehir

    (Memorial Sloan Kettering Cancer Center
    Natera)

Abstract

Microsatellite instability (MSI) is a critical phenotype of cancer genomes and an FDA-recognized biomarker that can guide treatment with immune checkpoint inhibitors. Previous work has demonstrated that next-generation sequencing data can be used to identify samples with MSI-high phenotype. However, low tumor purity, as frequently observed in routine clinical samples, poses a challenge to the sensitivity of existing algorithms. To overcome this critical issue, we developed MiMSI, an MSI classifier based on deep neural networks and trained using a dataset that included low tumor purity MSI cases in a multiple instance learning framework. On a challenging yet representative set of cases, MiMSI showed higher sensitivity (0.895) and auROC (0.971) than MSISensor (sensitivity: 0.67; auROC: 0.907), an open-source software previously validated for clinical use at our institution using MSK-IMPACT large panel targeted NGS data. In a separate, prospective cohort, MiMSI confirmed that it outperforms MSISensor in low purity cases (P = 8.244e-07).

Suggested Citation

  • John Ziegler & Jaclyn F. Hechtman & Satshil Rana & Ryan N. Ptashkin & Gowtham Jayakumaran & Sumit Middha & Shweta S. Chavan & Chad Vanderbilt & Deborah DeLair & Jacklyn Casanova & Jinru Shia & Nicole , 2025. "A deep multiple instance learning framework improves microsatellite instability detection from tumor next generation sequencing," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-024-54970-z
    DOI: 10.1038/s41467-024-54970-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-54970-z
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-54970-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-024-54970-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.