IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v13y2022i1d10.1038_s41467-022-29212-9.html
   My bibliography  Save this article

Comprehensive generation, visualization, and reporting of quality control metrics for single-cell RNA sequencing data

Author

Listed:
  • Rui Hong

    (Bioinformatics Program, Boston University
    Section of Computational Biomedicine, Boston University School of Medicine)

  • Yusuke Koga

    (Bioinformatics Program, Boston University
    Section of Computational Biomedicine, Boston University School of Medicine)

  • Shruthi Bandyadka

    (Bioinformatics Program, Boston University
    Boston University)

  • Anastasia Leshchyk

    (Bioinformatics Program, Boston University
    Section of Computational Biomedicine, Boston University School of Medicine)

  • Yichen Wang

    (Section of Computational Biomedicine, Boston University School of Medicine)

  • Vidya Akavoor

    (Section of Computational Biomedicine, Boston University School of Medicine
    Software & Application Innovation Lab, Rafik B. Hariri Institute for Computing and Computational Science and Engineering)

  • Xinyun Cao

    (Software & Application Innovation Lab, Rafik B. Hariri Institute for Computing and Computational Science and Engineering)

  • Irzam Sarfraz

    (Section of Computational Biomedicine, Boston University School of Medicine)

  • Zhe Wang

    (Bioinformatics Program, Boston University
    Section of Computational Biomedicine, Boston University School of Medicine)

  • Salam Alabdullatif

    (Section of Computational Biomedicine, Boston University School of Medicine)

  • Frederick Jansen

    (Software & Application Innovation Lab, Rafik B. Hariri Institute for Computing and Computational Science and Engineering)

  • Masanao Yajima

    (Boston University)

  • W. Evan Johnson

    (Bioinformatics Program, Boston University
    Section of Computational Biomedicine, Boston University School of Medicine)

  • Joshua D. Campbell

    (Bioinformatics Program, Boston University
    Section of Computational Biomedicine, Boston University School of Medicine)

Abstract

Single-cell RNA sequencing (scRNA-seq) can be used to gain insights into cellular heterogeneity within complex tissues. However, various technical artifacts can be present in scRNA-seq data and should be assessed before performing downstream analyses. While several tools have been developed to perform individual quality control (QC) tasks, they are scattered in different packages across several programming environments. Here, to streamline the process of generating and visualizing QC metrics for scRNA-seq data, we built the SCTK-QC pipeline within the singleCellTK R package. The SCTK-QC workflow can import data from several single-cell platforms and preprocessing tools and includes steps for empty droplet detection, generation of standard QC metrics, prediction of doublets, and estimation of ambient RNA. It can run on the command line, within the R console, on the cloud platform or with an interactive graphical user interface. Overall, the SCTK-QC pipeline streamlines and standardizes the process of performing QC for scRNA-seq data.

Suggested Citation

  • Rui Hong & Yusuke Koga & Shruthi Bandyadka & Anastasia Leshchyk & Yichen Wang & Vidya Akavoor & Xinyun Cao & Irzam Sarfraz & Zhe Wang & Salam Alabdullatif & Frederick Jansen & Masanao Yajima & W. Evan, 2022. "Comprehensive generation, visualization, and reporting of quality control metrics for single-cell RNA sequencing data," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
  • Handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-29212-9
    DOI: 10.1038/s41467-022-29212-9
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-022-29212-9
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-022-29212-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Orit Rozenblatt-Rosen & Michael J. T. Stubbington & Aviv Regev & Sarah A. Teichmann, 2017. "The Human Cell Atlas: from vision to reality," Nature, Nature, vol. 550(7677), pages 451-453, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christoph Ziegenhain & Rickard Sandberg, 2021. "BAMboozle removes genetic variation from human sequence data for open data sharing," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    2. Aiko Sekita & Hiroshi Kawasaki & Ayano Fukushima-Nomura & Kiyoshi Yashiro & Keiji Tanese & Susumu Toshima & Koichi Ashizaki & Tomohiro Miyai & Junshi Yazaki & Atsuo Kobayashi & Shinichi Namba & Tatsuh, 2023. "Multifaceted analysis of cross-tissue transcriptomes reveals phenotype–endotype associations in atopic dermatitis," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    3. Yingxin Lin & Yue Cao & Elijah Willie & Ellis Patrick & Jean Y. H. Yang, 2023. "Atlas-scale single-cell multi-sample multi-condition data integration using scMerge2," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    4. Luisa Santus & Maria Sopena-Rios & Raquel García-Pérez & Aaron E. Lin & Gordon C. Adams & Kayla G. Barnes & Katherine J. Siddle & Shirlee Wohl & Ferran Reverter & John L. Rinn & Richard S. Bennett & L, 2023. "Single-cell profiling of lncRNA expression during Ebola virus infection in rhesus macaques," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    5. Rongbo Shen & Lin Liu & Zihan Wu & Ying Zhang & Zhiyuan Yuan & Junfu Guo & Fan Yang & Chao Zhang & Bichao Chen & Wanwan Feng & Chao Liu & Jing Guo & Guozhen Fan & Yong Zhang & Yuxiang Li & Xun Xu & Ji, 2022. "Spatial-ID: a cell typing method for spatially resolved transcriptomics via transfer learning and spatial embedding," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    6. Andrew Jones & Diana Cai & Didong Li & Barbara E. Engelhardt, 2024. "Optimizing the design of spatial genomic studies," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    7. Erping Long & Montserrat García-Closas & Stephen J. Chanock & M. Constanza Camargo & Nicholas E. Banovich & Jiyeon Choi, 2022. "The case for increasing diversity in tissue-based functional genomics datasets to understand human disease susceptibility," Nature Communications, Nature, vol. 13(1), pages 1-4, December.
    8. Abhishek Nagaraj, 2022. "T he P rivate I mpact of P ublic D ata: L andsat S atellite M aps I ncreased G old D iscoveries and E ncouraged E ntry," Management Science, INFORMS, vol. 68(1), pages 564-582, January.
    9. Claudia Ctortecka & Natalie M. Clark & Brian W. Boyle & Anjali Seth & D. R. Mani & Namrata D. Udeshi & Steven A. Carr, 2024. "Automated single-cell proteomics providing sufficient proteome depth to study complex biology beyond cell type classifications," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    10. Chang Hoon Cho & Ilana Vasilisa Deyneko & Dylann Cordova-Martinez & Juan Vazquez & Anne S. Maguire & Jenny R. Diaz & Abigail U. Carbonell & Jaafar O. Tindi & Min-Hui Cui & Roman Fleysher & Sophie Molh, 2023. "ANKS1B encoded AIDA-1 regulates social behaviors by controlling oligodendrocyte function," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    11. Katharina T. Schmid & Barbara Höllbacher & Cristiana Cruceanu & Anika Böttcher & Heiko Lickert & Elisabeth B. Binder & Fabian J. Theis & Matthias Heinig, 2021. "scPower accelerates and optimizes the design of multi-sample single cell transcriptomic studies," Nature Communications, Nature, vol. 12(1), pages 1-18, December.
    12. Ajita Shree & Musale Krushna Pavan & Hamim Zafar, 2023. "scDREAMER for atlas-level integration of single-cell datasets using deep generative model paired with adversarial classifier," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    13. Licai Huang & Paul Little & Jeroen R. Huyghe & Qian Shi & Tabitha A. Harrison & Greg Yothers & Thomas J. George & Ulrike Peters & Andrew T. Chan & Polly A. Newcomb & Wei Sun, 2021. "A Statistical Method for Association Analysis of Cell Type Compositions," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(3), pages 373-385, December.
    14. Lindsay A. Rutter & Henry Cope & Matthew J. MacKay & Raúl Herranz & Saswati Das & Sergey A. Ponomarev & Sylvain V. Costes & Amber M. Paul & Richard Barker & Deanne M. Taylor & Daniela Bezdan & Nathani, 2024. "Astronaut omics and the impact of space on the human body at scale," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    15. Joyce B. Kang & Aparna Nathan & Kathryn Weinand & Fan Zhang & Nghia Millard & Laurie Rumker & D. Branch Moody & Ilya Korsunsky & Soumya Raychaudhuri, 2021. "Efficient and precise single-cell reference atlas mapping with Symphony," Nature Communications, Nature, vol. 12(1), pages 1-21, December.
    16. Lei Xiong & Kang Tian & Yuzhe Li & Weixi Ning & Xin Gao & Qiangfeng Cliff Zhang, 2022. "Online single-cell data integration through projecting heterogeneous datasets into a common cell-embedding space," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    17. Qihuang Zhang & Shunzhou Jiang & Amelia Schroeder & Jian Hu & Kejie Li & Baohong Zhang & David Dai & Edward B. Lee & Rui Xiao & Mingyao Li, 2023. "Leveraging spatial transcriptomics data to recover cell locations in single-cell RNA-seq with CeLEry," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    18. Ziye Xu & Tianyu Zhang & Hongyu Chen & Yuyi Zhu & Yuexiao Lv & Shunji Zhang & Jiaye Chen & Haide Chen & Lili Yang & Weiqin Jiang & Shengyu Ni & Fangru Lu & Zhaolun Wang & Hao Yang & Ling Dong & Feng C, 2023. "High-throughput single nucleus total RNA sequencing of formalin-fixed paraffin-embedded tissues by snRandom-seq," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    19. Amir Alavi & Ziv Bar-Joseph, 2020. "Iterative point set registration for aligning scRNA-seq data," PLOS Computational Biology, Public Library of Science, vol. 16(10), pages 1-21, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-29212-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.