IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-47884-3.html
   My bibliography  Save this article

scLENS: data-driven signal detection for unbiased scRNA-seq data analysis

Author

Listed:
  • Hyun Kim

    (Institute for Basic Science)

  • Won Chang

    (University of Cincinnati)

  • Seok Joo Chae

    (Institute for Basic Science
    KAIST)

  • Jong-Eun Park

    (KAIST)

  • Minseok Seo

    (Korea University)

  • Jae Kyoung Kim

    (Institute for Basic Science
    KAIST)

Abstract

High dimensionality and noise have limited the new biological insights that can be discovered in scRNA-seq data. While dimensionality reduction tools have been developed to extract biological signals from the data, they often require manual determination of signal dimension, introducing user bias. Furthermore, a common data preprocessing method, log normalization, can unintentionally distort signals in the data. Here, we develop scLENS, a dimensionality reduction tool that circumvents the long-standing issues of signal distortion and manual input. Specifically, we identify the primary cause of signal distortion during log normalization and effectively address it by uniformizing cell vector lengths with L2 normalization. Furthermore, we utilize random matrix theory-based noise filtering and a signal robustness test to enable data-driven determination of the threshold for signal dimensions. Our method outperforms 11 widely used dimensionality reduction tools and performs particularly well for challenging scRNA-seq datasets with high sparsity and variability. To facilitate the use of scLENS, we provide a user-friendly package that automates accurate signal detection of scRNA-seq data without manual time-consuming tuning.

Suggested Citation

  • Hyun Kim & Won Chang & Seok Joo Chae & Jong-Eun Park & Minseok Seo & Jae Kyoung Kim, 2024. "scLENS: data-driven signal detection for unbiased scRNA-seq data analysis," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-47884-3
    DOI: 10.1038/s41467-024-47884-3
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-47884-3
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-47884-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Peng Qiu, 2020. "Embracing the dropouts in single-cell RNA-seq analysis," Nature Communications, Nature, vol. 11(1), pages 1-9, December.
    2. John Horn, 1965. "A rationale and test for the number of factors in factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 30(2), pages 179-185, June.
    3. Jong Kyoung Kim & Aleksandra A. Kolodziejczyk & Tomislav Ilicic & Sarah A. Teichmann & John C. Marioni, 2015. "Characterizing noise structure in single-cell RNA-seq distinguishes genuine from technical stochastic allelic expression," Nature Communications, Nature, vol. 6(1), pages 1-9, December.
    4. Koen Van den Berge & Hector Roux de Bézieux & Kelly Street & Wouter Saelens & Robrecht Cannoodt & Yvan Saeys & Sandrine Dudoit & Lieven Clement, 2020. "Trajectory-based differential expression analysis for single-cell sequencing data," Nature Communications, Nature, vol. 11(1), pages 1-13, December.
    5. Suoqin Jin & Christian F. Guerrero-Juarez & Lihua Zhang & Ivan Chang & Raul Ramos & Chen-Hsiang Kuan & Peggy Myung & Maksim V. Plikus & Qing Nie, 2021. "Inference and analysis of cell-cell communication using CellChat," Nature Communications, Nature, vol. 12(1), pages 1-20, December.
    6. Roshan M. Kumar & Patrick Cahan & Alex K. Shalek & Rahul Satija & A. Jay DaleyKeyser & Hu Li & Jin Zhang & Keith Pardee & David Gennert & John J. Trombetta & Thomas C. Ferrante & Aviv Regev & George Q, 2014. "Deconstructing transcriptional heterogeneity in pluripotent stem cells," Nature, Nature, vol. 516(7529), pages 56-61, December.
    7. George C. Linderman & Jun Zhao & Manolis Roulis & Piotr Bielecki & Richard A. Flavell & Boaz Nadler & Yuval Kluger, 2022. "Zero-preserving imputation of single-cell RNA-seq data," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    8. Shahin Mohammadi & Jose Davila-Velderrain & Manolis Kellis, 2020. "A multiresolution framework to characterize single-cell state landscapes," Nature Communications, Nature, vol. 11(1), pages 1-14, December.
    9. Zhijian Li & Christoph Kuppe & Susanne Ziegler & Mingbo Cheng & Nazanin Kabgani & Sylvia Menzel & Martin Zenke & Rafael Kramann & Ivan G. Costa, 2021. "Chromatin-accessibility estimation from single-cell ATAC-seq data with scOpen," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    10. Duc Tran & Hung Nguyen & Bang Tran & Carlo La Vecchia & Hung N. Luu & Tin Nguyen, 2021. "Fast and precise single-cell data analysis using a hierarchical autoencoder," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    11. Wei Vivian Li & Jingyi Jessica Li, 2018. "An accurate and robust imputation method scImpute for single-cell RNA-seq data," Nature Communications, Nature, vol. 9(1), pages 1-9, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lingfei Wang, 2021. "Single-cell normalization and association testing unifying CRISPR screen and gene co-expression analyses with Normalisr," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    2. Yichuan Cao & Xiamiao Zhao & Songming Tang & Qun Jiang & Sijie Li & Siyu Li & Shengquan Chen, 2024. "scButterfly: a versatile single-cell cross-modality translation method via dual-aligned variational autoencoders," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    3. Jeff DeMartino & Michael T. Meister & Lindy L. Visser & Mariël Brok & Marian J. A. Groot Koerkamp & Amber K. L. Wezenaar & Laura S. Hiemcke-Jiwa & Terezinha Souza & Johannes H. M. Merks & Anne C. Rios, 2023. "Single-cell transcriptomics reveals immune suppression and cell states predictive of patient outcomes in rhabdomyosarcoma," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    4. Maxime Brunner & David Lopez-Rodriguez & Judith Estrada-Meza & Rafik Dali & Antoine Rohrbach & Tamara Deglise & Andrea Messina & Bernard Thorens & Federico Santoni & Fanny Langlet, 2024. "Fasting induces metabolic switches and spatial redistributions of lipid processing and neuronal interactions in tanycytes," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    5. Sarah Cappuyns & Gino Philips & Vincent Vandecaveye & Bram Boeckx & Rogier Schepers & Thomas Van Brussel & Ingrid Arijs & Aurelie Mechels & Ayse Bassez & Francesca Lodi & Joris Jaekers & Halit Topal &, 2023. "PD-1- CD45RA+ effector-memory CD8 T cells and CXCL10+ macrophages are associated with response to atezolizumab plus bevacizumab in advanced hepatocellular carcinoma," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    6. Isaac Dean & Colin Y. C. Lee & Zewen K. Tuong & Zhi Li & Christopher A. Tibbitt & Claire Willis & Fabrina Gaspal & Bethany C. Kennedy & Veronika Matei-Rascu & Rémi Fiancette & Caroline Nordenvall & Ul, 2024. "Rapid functional impairment of natural killer cells following tumor entry limits anti-tumor immunity," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    7. Zhuohan Yu & Yanchi Su & Yifu Lu & Yuning Yang & Fuzhou Wang & Shixiong Zhang & Yi Chang & Ka-Chun Wong & Xiangtao Li, 2023. "Topological identification and interpretation for single-cell gene regulation elucidation across multiple platforms using scMGCA," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    8. Irfete S. Fetahu & Wolfgang Esser-Skala & Rohit Dnyansagar & Samuel Sindelar & Fikret Rifatbegovic & Andrea Bileck & Lukas Skos & Eva Bozsaky & Daria Lazic & Lisa Shaw & Marcus Tötzl & Dora Tarlungean, 2023. "Single-cell transcriptomics and epigenomics unravel the role of monocytes in neuroblastoma bone marrow metastasis," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    9. Songming Tang & Xuejian Cui & Rongxiang Wang & Sijie Li & Siyu Li & Xin Huang & Shengquan Chen, 2024. "scCASE: accurate and interpretable enhancement for single-cell chromatin accessibility sequencing data," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    10. Benjamin L. Walker & Qing Nie, 2023. "NeST: nested hierarchical structure identification in spatial transcriptomic data," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    11. Malosree Maitra & Haruka Mitsuhashi & Reza Rahimian & Anjali Chawla & Jennie Yang & Laura M. Fiori & Maria Antonietta Davoli & Kelly Perlman & Zahia Aouabed & Deborah C. Mash & Matthew Suderman & Nagu, 2023. "Cell type specific transcriptomic differences in depression show similar patterns between males and females but implicate distinct cell types and genes," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    12. Eloise Berson & Anjali Sreenivas & Thanaphong Phongpreecha & Amalia Perna & Fiorella C. Grandi & Lei Xue & Neal G. Ravindra & Neelufar Payrovnaziri & Samson Mataraso & Yeasul Kim & Camilo Espinosa & A, 2023. "Whole genome deconvolution unveils Alzheimer’s resilient epigenetic signature," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    13. Maria Lidia Mascia & Mirian Agus & Łukasz Tomczyk & Natale Salvatore Bonfiglio & Diego Bellini & Maria Pietronilla Penna, 2023. "Smartphone Distraction: Italian Validation of the Smartphone Distraction Scale (SDS)," IJERPH, MDPI, vol. 20(15), pages 1-15, August.
    14. Patrick Hylton & Ben Kisby & Paul Goddard, 2018. "Young People’s Citizen Identities: A Q-Methodological Analysis of English Youth Perceptions of Citizenship in Britain," Societies, MDPI, vol. 8(4), pages 1-21, December.
    15. Orazio Attanasio & Sarah Cattan & Emla Fitzsimons & Costas Meghir & Marta Rubio-Codina, 2020. "Estimating the Production Function for Human Capital: Results from a Randomized Controlled Trial in Colombia," American Economic Review, American Economic Association, vol. 110(1), pages 48-85, January.
    16. Yanchuan Li & Huamei Li & Cheng Peng & Ge Meng & Yijun Lu & Honglin Liu & Li Cui & Huan Zhou & Zhu Xu & Lingyun Sun & Lihong Liu & Qing Xiong & Beicheng Sun & Shiping Jiao, 2024. "Unraveling the spatial organization and development of human thymocytes through integration of spatial transcriptomics and single-cell multi-omics profiling," Nature Communications, Nature, vol. 15(1), pages 1-25, December.
    17. Franke, Nikolaus & Shah, Sonali, 2003. "How communities support innovative activities: an exploration of assistance and sharing among end-users," Research Policy, Elsevier, vol. 32(1), pages 157-178, January.
    18. Elise Verot & Paul Bonjean & Robin Chaux & Julie Gagnaire & Amandine Gagneux-Brunon & Bruno Pozzetto & Philippe Berthelot & Elisabeth Botelho-Nevers & Franck Chauvin, 2022. "Development and Validation of the COVID-19 Knowledges and Behavior Questionnaire in a French Population (CoVQuest-CC)," IJERPH, MDPI, vol. 19(5), pages 1-22, February.
    19. Zaitun Mohd Saman & Ab Hamid Siti-Azrin & Azizah Othman & Yee Cheng Kueh, 2021. "The Validity and Reliability of the Malay Version of the Cyberbullying Scale among Secondary School Adolescents in Malaysia," IJERPH, MDPI, vol. 18(21), pages 1-12, November.
    20. Michele Villa & Colette Balice-Bourgois & Angela Tolotti & Anna Falcó-Pegueroles & Serena Barello & Elena Corina Luca & Luca Clivio & Annette Biegger & Dario Valcarenghi & Loris Bonetti, 2021. "Ethical Conflict and Its Psychological Correlates among Hospital Nurses in the Pandemic: A Cross-Sectional Study within Swiss COVID-19 and Non-COVID-19 Wards," IJERPH, MDPI, vol. 18(22), pages 1-14, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-47884-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.