IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v634y2024i8033d10.1038_s41586-024-07618-3.html
   My bibliography  Save this article

A multimodal generative AI copilot for human pathology

Author

Listed:
  • Ming Y. Lu

    (Harvard Medical School
    Harvard Medical School
    Broad Institute of Harvard and MIT
    Massachusetts Institute of Technology (MIT))

  • Bowen Chen

    (Harvard Medical School
    Harvard Medical School)

  • Drew F. K. Williamson

    (Harvard Medical School
    Harvard Medical School
    Broad Institute of Harvard and MIT)

  • Richard J. Chen

    (Harvard Medical School
    Harvard Medical School
    Broad Institute of Harvard and MIT)

  • Melissa Zhao

    (Harvard Medical School
    Harvard Medical School)

  • Aaron K. Chow

    (Ohio State University)

  • Kenji Ikemura

    (Harvard Medical School
    Harvard Medical School)

  • Ahrong Kim

    (Harvard Medical School
    Pusan National University)

  • Dimitra Pouli

    (Harvard Medical School
    Harvard Medical School)

  • Ankush Patel

    (Mayo Clinic)

  • Amr Soliman

    (Ohio State University)

  • Chengkuan Chen

    (Harvard Medical School)

  • Tong Ding

    (Harvard Medical School
    Harvard University)

  • Judy J. Wang

    (Harvard Medical School)

  • Georg Gerber

    (Harvard Medical School)

  • Ivy Liang

    (Harvard Medical School
    Harvard University)

  • Long Phi Le

    (Harvard Medical School)

  • Anil V. Parwani

    (Ohio State University)

  • Luca L. Weishaupt

    (Harvard Medical School
    Harvard-MIT)

  • Faisal Mahmood

    (Harvard Medical School
    Harvard Medical School
    Broad Institute of Harvard and MIT
    Harvard University)

Abstract

Computational pathology1,2 has witnessed considerable progress in the development of both task-specific predictive models and task-agnostic self-supervised vision encoders3,4. However, despite the explosive growth of generative artificial intelligence (AI), there have been few studies on building general-purpose multimodal AI assistants and copilots5 tailored to pathology. Here we present PathChat, a vision-language generalist AI assistant for human pathology. We built PathChat by adapting a foundational vision encoder for pathology, combining it with a pretrained large language model and fine-tuning the whole system on over 456,000 diverse visual-language instructions consisting of 999,202 question and answer turns. We compare PathChat with several multimodal vision-language AI assistants and GPT-4V, which powers the commercially available multimodal general-purpose AI assistant ChatGPT-4 (ref. 6). PathChat achieved state-of-the-art performance on multiple-choice diagnostic questions from cases with diverse tissue origins and disease models. Furthermore, using open-ended questions and human expert evaluation, we found that overall PathChat produced more accurate and pathologist-preferable responses to diverse queries related to pathology. As an interactive vision-language AI copilot that can flexibly handle both visual and natural language inputs, PathChat may potentially find impactful applications in pathology education, research and human-in-the-loop clinical decision-making.

Suggested Citation

  • Ming Y. Lu & Bowen Chen & Drew F. K. Williamson & Richard J. Chen & Melissa Zhao & Aaron K. Chow & Kenji Ikemura & Ahrong Kim & Dimitra Pouli & Ankush Patel & Amr Soliman & Chengkuan Chen & Tong Ding , 2024. "A multimodal generative AI copilot for human pathology," Nature, Nature, vol. 634(8033), pages 466-473, October.
  • Handle: RePEc:nat:nature:v:634:y:2024:i:8033:d:10.1038_s41586-024-07618-3
    DOI: 10.1038/s41586-024-07618-3
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41586-024-07618-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41586-024-07618-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:634:y:2024:i:8033:d:10.1038_s41586-024-07618-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.