IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v11y2020i1d10.1038_s41467-020-19015-1.html
   My bibliography  Save this article

Benchmarking of cell type deconvolution pipelines for transcriptomics data

Author

Listed:
  • Francisco Avila Cobos

    (Ghent University
    Cancer Research Institute Ghent (CRIG)
    Garvan Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research)

  • José Alquicira-Hernandez

    (Garvan Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research
    Institute for Molecular Bioscience, University of Queensland)

  • Joseph E. Powell

    (Garvan Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research
    Institute for Molecular Bioscience, University of Queensland)

  • Pieter Mestdagh

    (Ghent University
    Cancer Research Institute Ghent (CRIG))

  • Katleen De Preter

    (Ghent University
    Cancer Research Institute Ghent (CRIG))

Abstract

Many computational methods have been developed to infer cell type proportions from bulk transcriptomics data. However, an evaluation of the impact of data transformation, pre-processing, marker selection, cell type composition and choice of methodology on the deconvolution results is still lacking. Using five single-cell RNA-sequencing (scRNA-seq) datasets, we generate pseudo-bulk mixtures to evaluate the combined impact of these factors. Both bulk deconvolution methodologies and those that use scRNA-seq data as reference perform best when applied to data in linear scale and the choice of normalization has a dramatic impact on some, but not all methods. Overall, methods that use scRNA-seq data have comparable performance to the best performing bulk methods whereas semi-supervised approaches show higher error values. Moreover, failure to include cell types in the reference that are present in a mixture leads to substantially worse results, regardless of the previous choices. Altogether, we evaluate the combined impact of factors affecting the deconvolution task across different datasets and propose general guidelines to maximize its performance.

Suggested Citation

  • Francisco Avila Cobos & José Alquicira-Hernandez & Joseph E. Powell & Pieter Mestdagh & Katleen De Preter, 2020. "Benchmarking of cell type deconvolution pipelines for transcriptomics data," Nature Communications, Nature, vol. 11(1), pages 1-14, December.
  • Handle: RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-19015-1
    DOI: 10.1038/s41467-020-19015-1
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-020-19015-1
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-020-19015-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kobe Ridder & Huiwen Che & Kaat Leroy & Bernard Thienpont, 2024. "Benchmarking of methods for DNA methylome deconvolution," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    2. Gavin J. Sutton & Daniel Poppe & Rebecca K. Simmons & Kieran Walsh & Urwah Nawaz & Ryan Lister & Johann A. Gagnon-Bartsch & Irina Voineagu, 2022. "Comprehensive evaluation of deconvolution methods for human brain gene expression," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    3. Kang Wang & Ioannis Zerdes & Henrik J. Johansson & Dhifaf Sarhan & Yizhe Sun & Dimitris C. Kanellis & Emmanouil G. Sifakis & Artur Mezheyeuski & Xingrong Liu & Niklas Loman & Ingrid Hedenfalk & Jonas , 2024. "Longitudinal molecular profiling elucidates immunometabolism dynamics in breast cancer," Nature Communications, Nature, vol. 15(1), pages 1-24, December.
    4. Eloise Berson & Anjali Sreenivas & Thanaphong Phongpreecha & Amalia Perna & Fiorella C. Grandi & Lei Xue & Neal G. Ravindra & Neelufar Payrovnaziri & Samson Mataraso & Yeasul Kim & Camilo Espinosa & A, 2023. "Whole genome deconvolution unveils Alzheimer’s resilient epigenetic signature," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    5. Adrian B. Levine & Liana Nobre & Anirban Das & Scott Milos & Vanessa Bianchi & Monique Johnson & Nicholas R. Fernandez & Lucie Stengs & Scott Ryall & Michelle Ku & Mansuba Rana & Benjamin Laxer & Java, 2024. "Immuno-oncologic profiling of pediatric brain tumors reveals major clinical significance of the tumor immune microenvironment," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    6. Daniel Charytonowicz & Rachel Brody & Robert Sebra, 2023. "Interpretable and context-free deconvolution of multi-scale whole transcriptomic data with UniCell deconvolve," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    7. Yann Vanrobaeys & Zeru J. Peterson & Emily. N. Walsh & Snehajyoti Chatterjee & Li-Chun Lin & Lisa C. Lyons & Thomas Nickl-Jockschat & Ted Abel, 2023. "Spatial transcriptomics reveals unique gene expression changes in different brain regions after sleep deprivation," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    8. Nelson Johansen & Hongru Hu & Gerald Quon, 2023. "Projecting RNA measurements onto single cell atlases to extract cell type-specific expression profiles using scProjection," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    9. Xiaoyu Song & Jiayi Ji & Joseph H. Rothstein & Stacey E. Alexeeff & Lori C. Sakoda & Adriana Sistig & Ninah Achacoso & Eric Jorgenson & Alice S. Whittemore & Robert J. Klein & Laurel A. Habel & Pei Wa, 2023. "MiXcan: a framework for cell-type-aware transcriptome-wide association studies with an application to breast cancer," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    10. Khoa A. Tran & Venkateswar Addala & Rebecca L. Johnston & David Lovell & Andrew Bradley & Lambros T. Koufariotis & Scott Wood & Sunny Z. Wu & Daniel Roden & Ghamdan Al-Eryani & Alexander Swarbrick & E, 2023. "Performance of tumour microenvironment deconvolution methods in breast cancer using single-cell simulated bulk mixtures," Nature Communications, Nature, vol. 14(1), pages 1-17, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:11:y:2020:i:1:d:10.1038_s41467-020-19015-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.