IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1008012.html
   My bibliography  Save this article

Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data

Author

Listed:
  • Xian F Mallory
  • Mohammadamin Edrisi
  • Nicholas Navin
  • Luay Nakhleh

Abstract

Single-cell DNA sequencing technologies are enabling the study of mutations and their evolutionary trajectories in cancer. Somatic copy number aberrations (CNAs) have been implicated in the development and progression of various types of cancer. A wide array of methods for CNA detection has been either developed specifically for or adapted to single-cell DNA sequencing data. Understanding the strengths and limitations that are unique to each of these methods is very important for obtaining accurate copy number profiles from single-cell DNA sequencing data. We benchmarked three widely used methods–Ginkgo, HMMcopy, and CopyNumber–on simulated as well as real datasets. To facilitate this, we developed a novel simulator of single-cell genome evolution in the presence of CNAs. Furthermore, to assess performance on empirical data where the ground truth is unknown, we introduce a phylogeny-based measure for identifying potentially erroneous inferences. While single-cell DNA sequencing is very promising for elucidating and understanding CNAs, our findings show that even the best existing method does not exceed 80% accuracy. New methods that significantly improve upon the accuracy of these three methods are needed. Furthermore, with the large datasets being generated, the methods must be computationally efficient.Author summary: Copy number aberrations, or CNAs, refer to evolutionary events that act on cancer genomes by deleting segments of the genomes or introducing new copies of existing segments. These events have been implicated in various types of cancer; consequently, their accurate detection could shed light on the initiation and progression of tumor, as well as on the development of potential targeted therapeutics. Single-cell DNA sequencing technologies are now producing the type of data that would allow such detection at the resolution of individual cells. However, to achieve this detection task, methods have to implement several steps of “data wrangling” and dealing with technical artifacts. In this work, we benchmarked three widely used methods for CNA detection from single-cell DNA data, namely Ginkgo, HMMcopy, and CopyNumber. To accomplish this study, we developed a novel simulator and devised a phylogeny-based measure of potentially erroneous CNA calls. We find that none of these methods has high accuracy, and all of them can be computationally very demanding. These findings call for the development of more accurate and more efficient methods for CNA detection from single-cell DNA data.

Suggested Citation

  • Xian F Mallory & Mohammadamin Edrisi & Nicholas Navin & Luay Nakhleh, 2020. "Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data," PLOS Computational Biology, Public Library of Science, vol. 16(7), pages 1-24, July.
  • Handle: RePEc:plo:pcbi00:1008012
    DOI: 10.1371/journal.pcbi.1008012
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1008012
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1008012&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1008012?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Yong Wang & Jill Waters & Marco L. Leung & Anna Unruh & Whijae Roh & Xiuqing Shi & Ken Chen & Paul Scheet & Selina Vattathil & Han Liang & Asha Multani & Hong Zhang & Rui Zhao & Franziska Michor & Fun, 2014. "Clonal evolution in breast cancer revealed by single nucleus genome sequencing," Nature, Nature, vol. 512(7513), pages 155-160, August.
    2. Ashish Sen & S. Srivastava, 1975. "On tests for detecting change in mean when variance is unknown," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 27(1), pages 479-486, December.
    3. Akdes Serin Harmanci & Arif O. Harmanci & Xiaobo Zhou, 2020. "CaSpER identifies and visualizes CNV events by integrative analysis of single-cell or bulk RNA-sequencing data," Nature Communications, Nature, vol. 11(1), pages 1-16, December.
    4. Nicholas Navin & Jude Kendall & Jennifer Troge & Peter Andrews & Linda Rodgers & Jeanne McIndoo & Kerry Cook & Asya Stepansky & Dan Levy & Diane Esposito & Lakshmi Muthuswamy & Alex Krasnitz & W. Rich, 2011. "Tumour evolution inferred by single-cell sequencing," Nature, Nature, vol. 472(7341), pages 90-94, April.
    5. Christopher A Miller & Oliver Hampton & Cristian Coarfa & Aleksandar Milosavljevic, 2011. "ReadDepth: A Parallel R Package for Detecting Copy Number Alterations from Short Sequencing Reads," PLOS ONE, Public Library of Science, vol. 6(1), pages 1-7, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kahkashan Afrin & Ashif S Iquebal & Mostafa Karimi & Allyson Souris & Se Yoon Lee & Bani K Mallick, 2020. "Directionally dependent multi-view clustering using copula model," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-18, October.
    2. Akshaya Ramakrishnan & Aikaterini Symeonidi & Patrick Hanel & Katharina T. Schmid & Maria L. Richter & Michael Schubert & Maria Colomé-Tatché, 2023. "epiAneufinder identifies copy number alterations from single-cell ATAC-seq data," Nature Communications, Nature, vol. 14(1), pages 1-10, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jinhyun Kim & Sungsik Kim & Huiran Yeom & Seo Woo Song & Kyoungseob Shin & Sangwook Bae & Han Suk Ryu & Ji Young Kim & Ahyoun Choi & Sumin Lee & Taehoon Ryu & Yeongjae Choi & Hamin Kim & Okju Kim & Yu, 2023. "Barcoded multiple displacement amplification for high coverage sequencing in spatial genomics," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    2. Udit Singhal & Srinivas Nallandhighal & Jeffrey J. Tosoian & Kevin Hu & Trinh M. Pham & Judith Stangl-Kremser & Chia-Jen Liu & Razeen Karim & Komal R. Plouffe & Todd M. Morgan & Marcin Cieslik & Rober, 2024. "Integrative multi-region molecular profiling of primary prostate cancer in men with synchronous lymph node metastasis," Nature Communications, Nature, vol. 15(1), pages 1-9, December.
    3. Guillermo Serrano & Nerea Berastegui & Aintzane Díaz-Mazkiaran & Paula García-Olloqui & Carmen Rodriguez-Res & Sofia Huerga-Dominguez & Marina Ainciburu & Amaia Vilas-Zornoza & Patxi San Martin-Uriz &, 2024. "Single-cell transcriptional profile of CD34+ hematopoietic progenitor cells from del(5q) myelodysplastic syndromes and impact of lenalidomide," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    4. Narayanaswamy Balakrishnan & Laurent Bordes & Christian Paroissin & Jean-Christophe Turlot, 2016. "Single change-point detection methods for small lifetime samples," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 79(5), pages 531-551, July.
    5. Bill Russell & Dooruj Rambaccussing, 2019. "Breaks and the statistical process of inflation: the case of estimating the ‘modern’ long-run Phillips curve," Empirical Economics, Springer, vol. 56(5), pages 1455-1475, May.
    6. Humberto Contreras-Trujillo & Jiya Eerdeng & Samir Akre & Du Jiang & Jorge Contreras & Basia Gala & Mary C. Vergel-Rodriguez & Yeachan Lee & Aparna Jorapur & Areen Andreasian & Lisa Harton & Charles S, 2021. "Deciphering intratumoral heterogeneity using integrated clonal tracking and single-cell transcriptome analyses," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    7. Love Michael I. & Myšičková Alena & Sun Ruping & Kalscheuer Vera & Vingron Martin & Haas Stefan A., 2011. "Modeling Read Counts for CNV Detection in Exome Sequencing Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-30, November.
    8. Cheng-Kai Shiau & Lina Lu & Rachel Kieser & Kazutaka Fukumura & Timothy Pan & Hsiao-Yun Lin & Jie Yang & Eric L. Tong & GaHyun Lee & Yuanqing Yan & Jason T. Huse & Ruli Gao, 2023. "High throughput single cell long-read sequencing analyses of same-cell genotypes and phenotypes in human tumors," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    9. Joseph Ngatchou-Wandji & Echarif Elharfaoui & Michel Harel, 2022. "On change-points tests based on two-samples U-Statistics for weakly dependent observations," Statistical Papers, Springer, vol. 63(1), pages 287-316, February.
    10. Peter Bailey & Rachel A. Ridgway & Patrizia Cammareri & Mairi Treanor-Taylor & Ulla-Maja Bailey & Christina Schoenherr & Max Bone & Daniel Schreyer & Karin Purdie & Jason Thomson & William Rickaby & R, 2023. "Driver gene combinations dictate cutaneous squamous cell carcinoma disease continuum progression," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    11. Noushin Niknafs & Violeta Beleva-Guthrie & Daniel Q Naiman & Rachel Karchin, 2015. "SubClonal Hierarchy Inference from Somatic Mutations: Automatic Reconstruction of Cancer Evolutionary Trees from Multi-region Next Generation Sequencing," PLOS Computational Biology, Public Library of Science, vol. 11(10), pages 1-26, October.
    12. Paul J. Plummer & Jie Chen, 2014. "A Bayesian approach for locating change points in a compound Poisson process with application to detecting DNA copy number variations," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(2), pages 423-438, February.
    13. Venkata Jandhyala & Stergios Fotopoulos & Ian MacNeill & Pengyu Liu, 2013. "Inference for single and multiple change-points in time series," Journal of Time Series Analysis, Wiley Blackwell, vol. 34(4), pages 423-446, July.
    14. Brennen T. Fagan & Marina I. Knight & Niall J. MacKay & A. Jamie Wood, 2020. "Change point analysis of historical battle deaths," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(3), pages 909-933, June.
    15. Li, Boyan & Diao, Xundi, 2023. "Structural break in different stock index markets in China," The North American Journal of Economics and Finance, Elsevier, vol. 65(C).
    16. Rongting Huang & Xianjie Huang & Yin Tong & Helen Y. N. Yan & Suet Yi Leung & Oliver Stegle & Yuanhua Huang, 2024. "Robust analysis of allele-specific copy number alterations from scRNA-seq data with XClone," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    17. Fuqi Chen & Rogemar Mamon & Sévérien Nkurunziza, 2018. "Inference for a change-point problem under a generalised Ornstein–Uhlenbeck setting," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(4), pages 807-853, August.
    18. Yidong Zhou & Changjun Wang & Hanjiang Zhu & Yan Lin & Bo Pan & Xiaohui Zhang & Xin Huang & Qianqian Xu & Yali Xu & Qiang Sun, 2016. "Diagnostic Accuracy of PIK3CA Mutation Detection by Circulating Free DNA in Breast Cancer: A Meta-Analysis of Diagnostic Test Accuracy," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-15, June.
    19. Akshaya Ramakrishnan & Aikaterini Symeonidi & Patrick Hanel & Katharina T. Schmid & Maria L. Richter & Michael Schubert & Maria Colomé-Tatché, 2023. "epiAneufinder identifies copy number alterations from single-cell ATAC-seq data," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    20. Sandip Sinharay, 2017. "Some Remarks on Applications of Tests for Detecting A Change Point to Psychometric Problems," Psychometrika, Springer;The Psychometric Society, vol. 82(4), pages 1149-1161, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1008012. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.