IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v12y2021i1d10.1038_s41467-020-20850-5.html
   My bibliography  Save this article

PopDel identifies medium-size deletions simultaneously in tens of thousands of genomes

Author

Listed:
  • Sebastian Niehus

    (Regensburg Center for Interventional Immunology (RCI)
    Berlin Institute of Health (BIH)
    Charité—Universitätsmedizin Berlin)

  • Hákon Jónsson

    (deCODE genetics/Amgen Inc.)

  • Janina Schönberger

    (Regensburg Center for Interventional Immunology (RCI)
    Berlin Institute of Health (BIH))

  • Eythór Björnsson

    (deCODE genetics/Amgen Inc.
    University of Iceland
    Landspítali—The National University Hospital of Iceland)

  • Doruk Beyter

    (deCODE genetics/Amgen Inc.)

  • Hannes P. Eggertsson

    (deCODE genetics/Amgen Inc.)

  • Patrick Sulem

    (Charité—Universitätsmedizin Berlin)

  • Kári Stefánsson

    (deCODE genetics/Amgen Inc.
    University of Iceland)

  • Bjarni V. Halldórsson

    (deCODE genetics/Amgen Inc.
    Reykjavik University)

  • Birte Kehr

    (Regensburg Center for Interventional Immunology (RCI)
    Berlin Institute of Health (BIH)
    Charité—Universitätsmedizin Berlin
    Univeristät Regensburg)

Abstract

Thousands of genomic structural variants (SVs) segregate in the human population and can impact phenotypic traits and diseases. Their identification in whole-genome sequence data of large cohorts is a major computational challenge. Most current approaches identify SVs in single genomes and afterwards merge the identified variants into a joint call set across many genomes. We describe the approach PopDel, which directly identifies deletions of about 500 to at least 10,000 bp in length in data of many genomes jointly, eliminating the need for subsequent variant merging. PopDel scales to tens of thousands of genomes as we demonstrate in evaluations on up to 49,962 genomes. We show that PopDel reliably reports common, rare and de novo deletions. On genomes with available high-confidence reference call sets PopDel shows excellent recall and precision. Genotype inheritance patterns in up to 6794 trios indicate that genotypes predicted by PopDel are more reliable than those of previous SV callers. Furthermore, PopDel’s running time is competitive with the fastest tested previous tools. The demonstrated scalability and accuracy of PopDel enables routine scans for deletions in large-scale sequencing studies.

Suggested Citation

  • Sebastian Niehus & Hákon Jónsson & Janina Schönberger & Eythór Björnsson & Doruk Beyter & Hannes P. Eggertsson & Patrick Sulem & Kári Stefánsson & Bjarni V. Halldórsson & Birte Kehr, 2021. "PopDel identifies medium-size deletions simultaneously in tens of thousands of genomes," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
  • Handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-020-20850-5
    DOI: 10.1038/s41467-020-20850-5
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-020-20850-5
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-020-20850-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Marsha M. Wheeler & Adrienne M. Stilp & Shuquan Rao & Bjarni V. Halldórsson & Doruk Beyter & Jia Wen & Anna V. Mihkaylova & Caitlin P. McHugh & John Lane & Min-Zhi Jiang & Laura M. Raffield & Goo Jun , 2022. "Whole genome sequencing identifies structural variants contributing to hematologic traits in the NHLBI TOPMed program," Nature Communications, Nature, vol. 13(1), pages 1-18, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-020-20850-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.