IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v11y2020i1d10.1038_s41467-019-14225-8.html
   My bibliography  Save this article

Machine learning can identify newly diagnosed patients with CLL at high risk of infection

Author

Listed:
  • Rudi Agius

    (Technical University of Denmark
    Rigshospitalet, Copenhagen University Hospital)

  • Christian Brieghel

    (Rigshospitalet, Copenhagen University Hospital)

  • Michael A. Andersen

    (Rigshospitalet, Copenhagen University Hospital)

  • Alexander T. Pearson

    (University of Chicago)

  • Bruno Ledergerber

    (University of Zurich
    Rigshospitalet, Copenhagen University Hospital)

  • Alessandro Cozzi-Lepri

    (University College London)

  • Yoram Louzoun

    (Bar-Ilan University)

  • Christen L. Andersen

    (Rigshospitalet, Copenhagen University Hospital
    Copenhagen University)

  • Jacob Bergstedt

    (Institut Pasteur)

  • Jakob H. Stemann

    (Rigshospitalet, Copenhagen University Hospital)

  • Mette Jørgensen

    (Rigshospitalet, Copenhagen University Hospital)

  • Man-Hung Eric Tang

    (Rigshospitalet, Copenhagen University Hospital)

  • Magnus Fontes

    (Rigshospitalet, Copenhagen University Hospital
    International Group for Data Analysis, Institut Pasteur)

  • Jasmin Bahlo

    (University Hospital)

  • Carmen D. Herling

    (University Hospital)

  • Michael Hallek

    (University Hospital
    University Hospital, Cologne, CECAD (Cluster of Excellence on Cellular Stress Responses in Aging-Associated Diseases), University of Cologne)

  • Jens Lundgren

    (Rigshospitalet, Copenhagen University Hospital)

  • Cameron Ross MacPherson

    (Rigshospitalet, Copenhagen University Hospital)

  • Jan Larsen

    (Technical University of Denmark)

  • Carsten U. Niemann

    (Rigshospitalet, Copenhagen University Hospital)

Abstract

Infections have become the major cause of morbidity and mortality among patients with chronic lymphocytic leukemia (CLL) due to immune dysfunction and cytotoxic CLL treatment. Yet, predictive models for infection are missing. In this work, we develop the CLL Treatment-Infection Model (CLL-TIM) that identifies patients at risk of infection or CLL treatment within 2 years of diagnosis as validated on both internal and external cohorts. CLL-TIM is an ensemble algorithm composed of 28 machine learning algorithms based on data from 4,149 patients with CLL. The model is capable of dealing with heterogeneous data, including the high rates of missing data to be expected in the real-world setting, with a precision of 72% and a recall of 75%. To address concerns regarding the use of complex machine learning algorithms in the clinic, for each patient with CLL, CLL-TIM provides explainable predictions through uncertainty estimates and personalized risk factors.

Suggested Citation

  • Rudi Agius & Christian Brieghel & Michael A. Andersen & Alexander T. Pearson & Bruno Ledergerber & Alessandro Cozzi-Lepri & Yoram Louzoun & Christen L. Andersen & Jacob Bergstedt & Jakob H. Stemann & , 2020. "Machine learning can identify newly diagnosed patients with CLL at high risk of infection," Nature Communications, Nature, vol. 11(1), pages 1-17, December.
  • Handle: RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-019-14225-8
    DOI: 10.1038/s41467-019-14225-8
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-019-14225-8
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-019-14225-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-019-14225-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.