IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-53081-z.html
   My bibliography  Save this article

Matching patients to clinical trials with large language models

Author

Listed:
  • Qiao Jin

    (National Institutes of Health (NIH))

  • Zifeng Wang

    (University of Illinois Urbana-Champaign)

  • Charalampos S. Floudas

    (National Institutes of Health)

  • Fangyuan Chen

    (University of Pittsburgh)

  • Changlin Gong

    (Albert Einstein College of Medicine)

  • Dara Bracken-Clarke

    (National Institutes of Health)

  • Elisabetta Xue

    (National Institutes of Health)

  • Yifan Yang

    (National Institutes of Health (NIH)
    University of Maryland College Park)

  • Jimeng Sun

    (University of Illinois Urbana-Champaign)

  • Zhiyong Lu

    (National Institutes of Health (NIH))

Abstract

Patient recruitment is challenging for clinical trials. We introduce TrialGPT, an end-to-end framework for zero-shot patient-to-trial matching with large language models. TrialGPT comprises three modules: it first performs large-scale filtering to retrieve candidate trials (TrialGPT-Retrieval); then predicts criterion-level patient eligibility (TrialGPT-Matching); and finally generates trial-level scores (TrialGPT-Ranking). We evaluate TrialGPT on three cohorts of 183 synthetic patients with over 75,000 trial annotations. TrialGPT-Retrieval can recall over 90% of relevant trials using less than 6% of the initial collection. Manual evaluations on 1015 patient-criterion pairs show that TrialGPT-Matching achieves an accuracy of 87.3% with faithful explanations, close to the expert performance. The TrialGPT-Ranking scores are highly correlated with human judgments and outperform the best-competing models by 43.8% in ranking and excluding trials. Furthermore, our user study reveals that TrialGPT can reduce the screening time by 42.6% in patient recruitment. Overall, these results have demonstrated promising opportunities for patient-to-trial matching with TrialGPT.

Suggested Citation

  • Qiao Jin & Zifeng Wang & Charalampos S. Floudas & Fangyuan Chen & Changlin Gong & Dara Bracken-Clarke & Elisabetta Xue & Yifan Yang & Jimeng Sun & Zhiyong Lu, 2024. "Matching patients to clinical trials with large language models," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-53081-z
    DOI: 10.1038/s41467-024-53081-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-53081-z
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-53081-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Matthew Hutson, 2024. "How AI is being used to accelerate clinical trials," Nature, Nature, vol. 627(8003), pages 2-5, March.
    2. Marcus Woo, 2019. "An AI boost for clinical trials," Nature, Nature, vol. 573(7775), pages 100-102, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shepherd, Dean A. & Majchrzak, Ann, 2022. "Machines augmenting entrepreneurs: Opportunities (and threats) at the Nexus of artificial intelligence and entrepreneurship," Journal of Business Venturing, Elsevier, vol. 37(4).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-53081-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.