IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v14y2023i1d10.1038_s41467-023-36961-8.html
   My bibliography  Save this article

Interpretable and context-free deconvolution of multi-scale whole transcriptomic data with UniCell deconvolve

Author

Listed:
  • Daniel Charytonowicz

    (Icahn School of Medicine at Mount Sinai)

  • Rachel Brody

    (Icahn School of Medicine at Mount Sinai)

  • Robert Sebra

    (Icahn School of Medicine at Mount Sinai
    Icahn Genomics Institute
    Black Family Stem Cell Institute)

Abstract

We introduce UniCell: Deconvolve Base (UCDBase), a pre-trained, interpretable, deep learning model to deconvolve cell type fractions and predict cell identity across Spatial, bulk-RNA-Seq, and scRNA-Seq datasets without contextualized reference data. UCD is trained on 10 million pseudo-mixtures from a fully-integrated scRNA-Seq training database comprising over 28 million annotated single cells spanning 840 unique cell types from 898 studies. We show that our UCDBase and transfer-learning models achieve comparable or superior performance on in-silico mixture deconvolution to existing, reference-based, state-of-the-art methods. Feature attribute analysis uncovers gene signatures associated with cell-type specific inflammatory-fibrotic responses in ischemic kidney injury, discerns cancer subtypes, and accurately deconvolves tumor microenvironments. UCD identifies pathologic changes in cell fractions among bulk-RNA-Seq data for several disease states. Applied to lung cancer scRNA-Seq data, UCD annotates and distinguishes normal from cancerous cells. Overall, UCD enhances transcriptomic data analysis, aiding in assessment of cellular and spatial context.

Suggested Citation

  • Daniel Charytonowicz & Rachel Brody & Robert Sebra, 2023. "Interpretable and context-free deconvolution of multi-scale whole transcriptomic data with UniCell deconvolve," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
  • Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-36961-8
    DOI: 10.1038/s41467-023-36961-8
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-36961-8
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-36961-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jiarui Ding & Aviv Regev, 2021. "Deep generative model embedding of single-cell RNA-Seq profiles on hyperspheres and hyperbolic spaces," Nature Communications, Nature, vol. 12(1), pages 1-17, December.
    2. Francesco Vallania & Andrew Tam & Shane Lofgren & Steven Schaffert & Tej D. Azad & Erika Bongen & Winston Haynes & Meia Alsup & Michael Alonso & Mark Davis & Edgar Engleman & Purvesh Khatri, 2018. "Leveraging heterogeneity across multiple datasets increases cell-mixture deconvolution accuracy and reduces biological and technical biases," Nature Communications, Nature, vol. 9(1), pages 1-8, December.
    3. Francisco Avila Cobos & José Alquicira-Hernandez & Joseph E. Powell & Pieter Mestdagh & Katleen Preter, 2020. "Author Correction: Benchmarking of cell type deconvolution pipelines for transcriptomics data," Nature Communications, Nature, vol. 11(1), pages 1-1, December.
    4. Vijay Saxena & Hongyu Gao & Samuel Arregui & Amy Zollman & Malgorzata Maria Kamocka & Xiaoling Xuei & Patrick McGuire & Michael Hutchens & Takashi Hato & David S. Hains & Andrew L. Schwaderer, 2021. "Kidney intercalated cells are phagocytic and acidify internalized uropathogenic Escherichia coli," Nature Communications, Nature, vol. 12(1), pages 1-15, December.
    5. Francisco Avila Cobos & José Alquicira-Hernandez & Joseph E. Powell & Pieter Mestdagh & Katleen De Preter, 2020. "Benchmarking of cell type deconvolution pipelines for transcriptomics data," Nature Communications, Nature, vol. 11(1), pages 1-14, December.
    6. Xuran Wang & Jihwan Park & Katalin Susztak & Nancy R. Zhang & Mingyao Li, 2019. "Bulk tissue cell type deconvolution with multi-subject single-cell expression reference," Nature Communications, Nature, vol. 10(1), pages 1-9, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gavin J. Sutton & Daniel Poppe & Rebecca K. Simmons & Kieran Walsh & Urwah Nawaz & Ryan Lister & Johann A. Gagnon-Bartsch & Irina Voineagu, 2022. "Comprehensive evaluation of deconvolution methods for human brain gene expression," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    2. Kobe Ridder & Huiwen Che & Kaat Leroy & Bernard Thienpont, 2024. "Benchmarking of methods for DNA methylome deconvolution," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    3. Nelson Johansen & Hongru Hu & Gerald Quon, 2023. "Projecting RNA measurements onto single cell atlases to extract cell type-specific expression profiles using scProjection," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    4. Eloise Berson & Anjali Sreenivas & Thanaphong Phongpreecha & Amalia Perna & Fiorella C. Grandi & Lei Xue & Neal G. Ravindra & Neelufar Payrovnaziri & Samson Mataraso & Yeasul Kim & Camilo Espinosa & A, 2023. "Whole genome deconvolution unveils Alzheimer’s resilient epigenetic signature," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    5. Khoa A. Tran & Venkateswar Addala & Rebecca L. Johnston & David Lovell & Andrew Bradley & Lambros T. Koufariotis & Scott Wood & Sunny Z. Wu & Daniel Roden & Ghamdan Al-Eryani & Alexander Swarbrick & E, 2023. "Performance of tumour microenvironment deconvolution methods in breast cancer using single-cell simulated bulk mixtures," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    6. Xiaoyu Song & Jiayi Ji & Joseph H. Rothstein & Stacey E. Alexeeff & Lori C. Sakoda & Adriana Sistig & Ninah Achacoso & Eric Jorgenson & Alice S. Whittemore & Robert J. Klein & Laurel A. Habel & Pei Wa, 2023. "MiXcan: a framework for cell-type-aware transcriptome-wide association studies with an application to breast cancer," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    7. Adrian B. Levine & Liana Nobre & Anirban Das & Scott Milos & Vanessa Bianchi & Monique Johnson & Nicholas R. Fernandez & Lucie Stengs & Scott Ryall & Michelle Ku & Mansuba Rana & Benjamin Laxer & Java, 2024. "Immuno-oncologic profiling of pediatric brain tumors reveals major clinical significance of the tumor immune microenvironment," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    8. Kang Wang & Ioannis Zerdes & Henrik J. Johansson & Dhifaf Sarhan & Yizhe Sun & Dimitris C. Kanellis & Emmanouil G. Sifakis & Artur Mezheyeuski & Xingrong Liu & Niklas Loman & Ingrid Hedenfalk & Jonas , 2024. "Longitudinal molecular profiling elucidates immunometabolism dynamics in breast cancer," Nature Communications, Nature, vol. 15(1), pages 1-24, December.
    9. Yann Vanrobaeys & Zeru J. Peterson & Emily. N. Walsh & Snehajyoti Chatterjee & Li-Chun Lin & Lisa C. Lyons & Thomas Nickl-Jockschat & Ted Abel, 2023. "Spatial transcriptomics reveals unique gene expression changes in different brain regions after sleep deprivation," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    10. Jakob Hartmann & Thomas Bajaj & Joy Otten & Claudia Klengel & Tim Ebert & Anne-Kathrin Gellner & Ellen Junglas & Kathrin Hafner & Elmira A. Anderzhanova & Fiona Tang & Galen Missig & Lindsay Rexrode &, 2024. "SKA2 regulated hyperactive secretory autophagy drives neuroinflammation-induced neurodegeneration," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
    11. Bárbara Andrade Barbosa & Saskia D. Asten & Ji Won Oh & Arantza Farina-Sarasqueta & Joanne Verheij & Frederike Dijk & Hanneke W. M. Laarhoven & Bauke Ylstra & Juan J. Garcia Vallejo & Mark A. Wiel & Y, 2021. "Bayesian log-normal deconvolution for enhanced in silico microdissection of bulk gene expression data," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    12. Keyong Sun & Runda Xu & Fuhai Ma & Naixue Yang & Yang Li & Xiaofeng Sun & Peng Jin & Wenzhe Kang & Lemei Jia & Jianping Xiong & Haitao Hu & Yantao Tian & Xun Lan, 2022. "scRNA-seq of gastric tumor shows complex intercellular interaction with an alternative T cell exhaustion trajectory," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    13. Jozsef Karman & Jing Wang & Corneliu Bodea & Sherry Cao & Marc C Levesque, 2021. "Lung gene expression and single cell analyses reveal two subsets of idiopathic pulmonary fibrosis (IPF) patients associated with different pathogenic mechanisms," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-28, March.
    14. Chang Su & Zichun Xu & Xinning Shan & Biao Cai & Hongyu Zhao & Jingfei Zhang, 2023. "Cell-type-specific co-expression inference from single cell RNA-sequencing data," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    15. Ajita Shree & Musale Krushna Pavan & Hamim Zafar, 2023. "scDREAMER for atlas-level integration of single-cell datasets using deep generative model paired with adversarial classifier," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    16. Seoyeon Lee & Mohammad Naimul Islam & Kaveh Boostanpour & Dvir Aran & Guangchun Jin & Stephanie Christenson & Michael A. Matthay & Walter L. Eckalbar & Daryle J. DePianto & Joseph R. Arron & Liam Mage, 2021. "Molecular programs of fibrotic change in aging human lung," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    17. Xiao Zhou & Zhen Cheng & Mingyu Dong & Qi Liu & Weiyang Yang & Min Liu & Junzhang Tian & Weibin Cheng, 2022. "Tumor fractions deciphered from circulating cell-free DNA methylation for cancer early diagnosis," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    18. Matteo D’Antonio & Jennifer P. Nguyen & Timothy D. Arthur & Hiroko Matsui & Agnieszka D’Antonio-Chronowska & Kelly A. Frazer, 2023. "Fine mapping spatiotemporal mechanisms of genetic variants underlying cardiac traits and disease," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    19. Brian D. Lehmann & Antonio Colaprico & Tiago C. Silva & Jianjiao Chen & Hanbing An & Yuguang Ban & Hanchen Huang & Lily Wang & Jamaal L. James & Justin M. Balko & Paula I. Gonzalez-Ericsson & Melinda , 2021. "Multi-omics analysis identifies therapeutic vulnerabilities in triple-negative breast cancer subtypes," Nature Communications, Nature, vol. 12(1), pages 1-18, December.
    20. Beibei Ru & Jinlin Huang & Yu Zhang & Kenneth Aldape & Peng Jiang, 2023. "Estimation of cell lineages in tumors from spatial transcriptomics data," Nature Communications, Nature, vol. 14(1), pages 1-13, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-36961-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.