IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-45240-z.html
   My bibliography  Save this article

Semi-supervised integration of single-cell transcriptomics data

Author

Listed:
  • Massimo Andreatta

    (CHUV and University of Lausanne
    AGORA Cancer Research Center
    Swiss Institute of Bioinformatics)

  • Léonard Hérault

    (CHUV and University of Lausanne
    AGORA Cancer Research Center
    Swiss Institute of Bioinformatics)

  • Paul Gueguen

    (CHUV and University of Lausanne
    AGORA Cancer Research Center
    Swiss Institute of Bioinformatics)

  • David Gfeller

    (CHUV and University of Lausanne
    AGORA Cancer Research Center
    Swiss Institute of Bioinformatics)

  • Ariel J. Berenstein

    (Instituto Multidisciplinario de Investigaciones en Patologías Pediátricas (IMIPP), CONICET-GCBA)

  • Santiago J. Carmona

    (CHUV and University of Lausanne
    AGORA Cancer Research Center
    Swiss Institute of Bioinformatics)

Abstract

Batch effects in single-cell RNA-seq data pose a significant challenge for comparative analyses across samples, individuals, and conditions. Although batch effect correction methods are routinely applied, data integration often leads to overcorrection and can result in the loss of biological variability. In this work we present STACAS, a batch correction method for scRNA-seq that leverages prior knowledge on cell types to preserve biological variability upon integration. Through an open-source benchmark, we show that semi-supervised STACAS outperforms state-of-the-art unsupervised methods, as well as supervised methods such as scANVI and scGen. STACAS scales well to large datasets and is robust to incomplete and imprecise input cell type labels, which are commonly encountered in real-life integration tasks. We argue that the incorporation of prior cell type information should be a common practice in single-cell data integration, and we provide a flexible framework for semi-supervised batch effect correction.

Suggested Citation

  • Massimo Andreatta & Léonard Hérault & Paul Gueguen & David Gfeller & Ariel J. Berenstein & Santiago J. Carmona, 2024. "Semi-supervised integration of single-cell transcriptomics data," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-45240-z
    DOI: 10.1038/s41467-024-45240-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-45240-z
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-45240-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Massimo Andreatta & Jesus Corria-Osorio & Sören Müller & Rafael Cubas & George Coukos & Santiago J. Carmona, 2021. "Interpretation of T cell states from single-cell transcriptomics data using reference atlases," Nature Communications, Nature, vol. 12(1), pages 1-19, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiaofeng Liao & Wenxue Li & Hongyue Zhou & Barani Kumar Rajendran & Ao Li & Jingjing Ren & Yi Luan & David A. Calderwood & Benjamin Turk & Wenwen Tang & Yansheng Liu & Dianqing Wu, 2024. "The CUL5 E3 ligase complex negatively regulates central signaling pathways in CD8+ T cells," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
    2. Nils-Petter Rudqvist & Maud Charpentier & Claire Lhuillier & Erik Wennerberg & Sheila Spada & Caroline Sheridan & Xi Kathy Zhou & Tuo Zhang & Silvia C. Formenti & Jennifer S. Sims & Alicia Alonso & Sa, 2023. "Immunotherapy targeting different immune compartments in combination with radiation therapy induces regression of resistant tumors," Nature Communications, Nature, vol. 14(1), pages 1-23, December.
    3. Alessandra Castiglioni & Yagai Yang & Katherine Williams & Alvin Gogineni & Ryan S. Lane & Amber W. Wang & Justin A. Shyer & Zhe Zhang & Stephanie Mittman & Alan Gutierrez & Jillian L. Astarita & Minh, 2023. "Combined PD-L1/TGFβ blockade allows expansion and differentiation of stem cell-like CD8 T cells in immune excluded tumors," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    4. Carmen Oi Ning Leung & Yang Yang & Rainbow Wing Hei Leung & Karl Kam Hei So & Hai Jun Guo & Martina Mang Leng Lei & Gregory Kenneth Muliawan & Yuan Gao & Qian Qian Yu & Jing Ping Yun & Stephanie Ma & , 2023. "Broad-spectrum kinome profiling identifies CDK6 upregulation as a driver of lenvatinib resistance in hepatocellular carcinoma," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    5. Matthew A. Cottam & Heather L. Caslin & Nathan C. Winn & Alyssa H. Hasty, 2022. "Multiomics reveals persistence of obesity-associated immune cell phenotypes in adipose tissue during weight loss and weight regain in mice," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    6. Siqi Li & Kun Li & Kang Wang & Haoyuan Yu & Xiangyang Wang & Mengchen Shi & Zhixing Liang & Zhou Yang & Yongwei Hu & Yang Li & Wei Liu & Hua Li & Shuqun Cheng & Linsen Ye & Yang Yang, 2023. "Low-dose radiotherapy combined with dual PD-L1 and VEGFA blockade elicits antitumor response in hepatocellular carcinoma mediated by activated intratumoral CD8+ exhausted-like T cells," Nature Communications, Nature, vol. 14(1), pages 1-21, December.
    7. Alexandra Argyriou & Marc H. Wadsworth & Adrian Lendvai & Stephen M. Christensen & Aase H. Hensvold & Christina Gerstner & Annika Vollenhoven & Kellie Kravarik & Aaron Winkler & Vivianne Malmström & K, 2022. "Single cell sequencing identifies clonally expanded synovial CD4+ TPH cells expressing GPR56 in rheumatoid arthritis," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    8. Joyce B. Kang & Aparna Nathan & Kathryn Weinand & Fan Zhang & Nghia Millard & Laurie Rumker & D. Branch Moody & Ilya Korsunsky & Soumya Raychaudhuri, 2021. "Efficient and precise single-cell reference atlas mapping with Symphony," Nature Communications, Nature, vol. 12(1), pages 1-21, December.
    9. Yi Zhang & Guanjue Xiang & Alva Yijia Jiang & Allen Lynch & Zexian Zeng & Chenfei Wang & Wubing Zhang & Jingyu Fan & Jiajinlong Kang & Shengqing Stan Gu & Changxin Wan & Boning Zhang & X. Shirley Liu , 2023. "MetaTiME integrates single-cell gene expression to characterize the meta-components of the tumor immune microenvironment," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    10. Marcel P. Trefny & Nicole Kirchhammer & Priska Auf der Maur & Marina Natoli & Dominic Schmid & Markus Germann & Laura Fernandez Rodriguez & Petra Herzig & Jonas Lötscher & Maryam Akrami & Jane C. Stin, 2023. "Deletion of SNX9 alleviates CD8 T cell exhaustion for effective cellular cancer immunotherapy," Nature Communications, Nature, vol. 14(1), pages 1-21, December.
    11. Marina T. Broz & Emily Y. Ko & Kristin Ishaya & Jinfen Xiao & Marco Simone & Xen Ping Hoi & Roberta Piras & Basia Gala & Fernando H. G. Tessaro & Anja Karlstaedt & Sandra Orsulic & Amanda W. Lund & Ke, 2024. "Metabolic targeting of cancer associated fibroblasts overcomes T-cell exclusion and chemoresistance in soft-tissue sarcomas," Nature Communications, Nature, vol. 15(1), pages 1-18, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-45240-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.