IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v634y2024i8035d10.1038_s41586-024-07894-z.html
   My bibliography  Save this article

A pathology foundation model for cancer diagnosis and prognosis prediction

Author

Listed:
  • Xiyue Wang

    (Harvard Medical School
    Stanford University School of Medicine)

  • Junhan Zhao

    (Harvard Medical School
    Harvard T.H. Chan School of Public Health)

  • Eliana Marostica

    (Harvard Medical School
    Harvard-Massachusetts Institute of Technology)

  • Wei Yuan

    (Sichuan University)

  • Jietian Jin

    (Sun Yat-sen University Cancer Center)

  • Jiayu Zhang

    (Sichuan University)

  • Ruijiang Li

    (Stanford University School of Medicine)

  • Hongping Tang

    (Shenzhen Maternity & Child Healthcare Hospital)

  • Kanran Wang

    (Chongqing University Cancer Hospital)

  • Yu Li

    (Chongqing University Cancer Hospital)

  • Fang Wang

    (The Affiliated Yantai Yuhuangding Hospital of Qingdao University)

  • Yulong Peng

    (The First Affiliated Hospital of Jinan University)

  • Junyou Zhu

    (Sun Yat-sen University)

  • Jing Zhang

    (Sichuan University)

  • Christopher R. Jackson

    (Harvard Medical School
    Pennsylvania State University
    Massachusetts General Hospital)

  • Jun Zhang

    (Tencent AI Lab)

  • Deborah Dillon

    (Brigham and Women’s Hospital)

  • Nancy U. Lin

    (Dana-Farber Cancer Institute)

  • Lynette Sholl

    (Brigham and Women’s Hospital
    Dana-Farber Cancer Institute)

  • Thomas Denize

    (Brigham and Women’s Hospital
    Dana-Farber Cancer Institute)

  • David Meredith

    (Brigham and Women’s Hospital)

  • Keith L. Ligon

    (Brigham and Women’s Hospital
    Dana-Farber Cancer Institute)

  • Sabina Signoretti

    (Brigham and Women’s Hospital
    Dana-Farber Cancer Institute)

  • Shuji Ogino

    (Brigham and Women’s Hospital
    Harvard T.H. Chan School of Public Health
    Broad Institute of MIT and Harvard)

  • Jeffrey A. Golden

    (Brigham and Women’s Hospital
    Cedars-Sinai Medical Center)

  • MacLean P. Nasrallah

    (Perelman School of Medicine at the University of Pennsylvania)

  • Xiao Han

    (Tencent AI Lab)

  • Sen Yang

    (Harvard Medical School
    Stanford University School of Medicine)

  • Kun-Hsing Yu

    (Harvard Medical School
    Brigham and Women’s Hospital
    Harvard University)

Abstract

Histopathology image evaluation is indispensable for cancer diagnoses and subtype classification. Standard artificial intelligence methods for histopathology image analyses have focused on optimizing specialized models for each diagnostic task1,2. Although such methods have achieved some success, they often have limited generalizability to images generated by different digitization protocols or samples collected from different populations3. Here, to address this challenge, we devised the Clinical Histopathology Imaging Evaluation Foundation (CHIEF) model, a general-purpose weakly supervised machine learning framework to extract pathology imaging features for systematic cancer evaluation. CHIEF leverages two complementary pretraining methods to extract diverse pathology representations: unsupervised pretraining for tile-level feature identification and weakly supervised pretraining for whole-slide pattern recognition. We developed CHIEF using 60,530 whole-slide images spanning 19 anatomical sites. Through pretraining on 44 terabytes of high-resolution pathology imaging datasets, CHIEF extracted microscopic representations useful for cancer cell detection, tumour origin identification, molecular profile characterization and prognostic prediction. We successfully validated CHIEF using 19,491 whole-slide images from 32 independent slide sets collected from 24 hospitals and cohorts internationally. Overall, CHIEF outperformed the state-of-the-art deep learning methods by up to 36.1%, showing its ability to address domain shifts observed in samples from diverse populations and processed by different slide preparation methods. CHIEF provides a generalizable foundation for efficient digital pathology evaluation for patients with cancer.

Suggested Citation

  • Xiyue Wang & Junhan Zhao & Eliana Marostica & Wei Yuan & Jietian Jin & Jiayu Zhang & Ruijiang Li & Hongping Tang & Kanran Wang & Yu Li & Fang Wang & Yulong Peng & Junyou Zhu & Jing Zhang & Christopher, 2024. "A pathology foundation model for cancer diagnosis and prognosis prediction," Nature, Nature, vol. 634(8035), pages 970-978, October.
  • Handle: RePEc:nat:nature:v:634:y:2024:i:8035:d:10.1038_s41586-024-07894-z
    DOI: 10.1038/s41586-024-07894-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41586-024-07894-z
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41586-024-07894-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:634:y:2024:i:8035:d:10.1038_s41586-024-07894-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.