A multimodal generative AI copilot for human pathology

My bibliography Save this article

A multimodal generative AI copilot for human pathology

Author

Listed:

Ming Y. Lu
(Harvard Medical School
Harvard Medical School
Broad Institute of Harvard and MIT
Massachusetts Institute of Technology (MIT))
Bowen Chen
(Harvard Medical School
Harvard Medical School)
Drew F. K. Williamson
(Harvard Medical School
Harvard Medical School
Broad Institute of Harvard and MIT)
Richard J. Chen
(Harvard Medical School
Harvard Medical School
Broad Institute of Harvard and MIT)
Melissa Zhao
(Harvard Medical School
Harvard Medical School)
Aaron K. Chow
(Ohio State University)
Kenji Ikemura
(Harvard Medical School
Harvard Medical School)
Ahrong Kim
(Harvard Medical School
Pusan National University)
Dimitra Pouli
(Harvard Medical School
Harvard Medical School)
Ankush Patel
(Mayo Clinic)
Amr Soliman
(Ohio State University)
Chengkuan Chen
(Harvard Medical School)
Tong Ding
(Harvard Medical School
Harvard University)
Judy J. Wang
(Harvard Medical School)
Georg Gerber
(Harvard Medical School)
Ivy Liang
(Harvard Medical School
Harvard University)
Long Phi Le
(Harvard Medical School)
Anil V. Parwani
(Ohio State University)
Luca L. Weishaupt
(Harvard Medical School
Harvard-MIT)
Faisal Mahmood
(Harvard Medical School
Harvard Medical School
Broad Institute of Harvard and MIT
Harvard University)

Registered:

Abstract

Computational pathology1,2 has witnessed considerable progress in the development of both task-specific predictive models and task-agnostic self-supervised vision encoders3,4. However, despite the explosive growth of generative artificial intelligence (AI), there have been few studies on building general-purpose multimodal AI assistants and copilots5 tailored to pathology. Here we present PathChat, a vision-language generalist AI assistant for human pathology. We built PathChat by adapting a foundational vision encoder for pathology, combining it with a pretrained large language model and fine-tuning the whole system on over 456,000 diverse visual-language instructions consisting of 999,202 question and answer turns. We compare PathChat with several multimodal vision-language AI assistants and GPT-4V, which powers the commercially available multimodal general-purpose AI assistant ChatGPT-4 (ref. 6). PathChat achieved state-of-the-art performance on multiple-choice diagnostic questions from cases with diverse tissue origins and disease models. Furthermore, using open-ended questions and human expert evaluation, we found that overall PathChat produced more accurate and pathologist-preferable responses to diverse queries related to pathology. As an interactive vision-language AI copilot that can flexibly handle both visual and natural language inputs, PathChat may potentially find impactful applications in pathology education, research and human-in-the-loop clinical decision-making.

Suggested Citation

Ming Y. Lu & Bowen Chen & Drew F. K. Williamson & Richard J. Chen & Melissa Zhao & Aaron K. Chow & Kenji Ikemura & Ahrong Kim & Dimitra Pouli & Ankush Patel & Amr Soliman & Chengkuan Chen & Tong Ding , 2024. "A multimodal generative AI copilot for human pathology," Nature, Nature, vol. 634(8033), pages 466-473, October.

Handle: RePEc:nat:nature:v:634:y:2024:i:8033:d:10.1038_s41586-024-07618-3
DOI: 10.1038/s41586-024-07618-3

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:634:y:2024:i:8033:d:10.1038_s41586-024-07618-3. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A multimodal generative AI copilot for human pathology

Author

Abstract

Suggested Citation

Download full text from publisher

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data