IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v583y2020i7814d10.1038_s41586-020-2371-0.html
   My bibliography  Save this article

Mapping and characterization of structural variation in 17,795 human genomes

Author

Listed:
  • Haley J. Abel

    (Washington University School of Medicine
    Washington University School of Medicine)

  • David E. Larson

    (Washington University School of Medicine
    Washington University School of Medicine)

  • Allison A. Regier

    (Washington University School of Medicine
    Washington University School of Medicine)

  • Colby Chiang

    (Washington University School of Medicine)

  • Indraniel Das

    (Washington University School of Medicine)

  • Krishna L. Kanchi

    (Washington University School of Medicine)

  • Ryan M. Layer

    (University of Colorado
    University of Colorado)

  • Benjamin M. Neale

    (Broad Institute of MIT and Harvard
    Broad Institute of MIT and Harvard
    Massachusetts General Hospital)

  • William J. Salerno

    (Baylor College of Medicine)

  • Catherine Reeves

    (New York Genome Center)

  • Steven Buyske

    (Rutgers University)

  • Tara C. Matise

    (Rutgers University)

  • Donna M. Muzny

    (Baylor College of Medicine)

  • Michael C. Zody

    (New York Genome Center)

  • Eric S. Lander

    (Broad Institute of MIT and Harvard
    Massachusetts Institute of Technology
    Harvard Medical School)

  • Susan K. Dutcher

    (Washington University School of Medicine
    Washington University School of Medicine)

  • Nathan O. Stitziel

    (Washington University School of Medicine
    Washington University School of Medicine
    Washington University School of Medicine)

  • Ira M. Hall

    (Washington University School of Medicine
    Washington University School of Medicine
    Washington University School of Medicine)

Abstract

A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline1 to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0–11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing.

Suggested Citation

  • Haley J. Abel & David E. Larson & Allison A. Regier & Colby Chiang & Indraniel Das & Krishna L. Kanchi & Ryan M. Layer & Benjamin M. Neale & William J. Salerno & Catherine Reeves & Steven Buyske & Tar, 2020. "Mapping and characterization of structural variation in 17,795 human genomes," Nature, Nature, vol. 583(7814), pages 83-89, July.
  • Handle: RePEc:nat:nature:v:583:y:2020:i:7814:d:10.1038_s41586-020-2371-0
    DOI: 10.1038/s41586-020-2371-0
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41586-020-2371-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41586-020-2371-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Orshay Gabay & Yoav Shoshan & Eli Kopel & Udi Ben-Zvi & Tomer D. Mann & Noam Bressler & Roni Cohen‐Fultheim & Amos A. Schaffer & Shalom Hillel Roth & Ziv Tzur & Erez Y. Levanon & Eli Eisenberg, 2022. "Landscape of adenosine-to-inosine RNA recoding across human tissues," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    2. Zhikun Wu & Zehang Jiang & Tong Li & Chuanbo Xie & Liansheng Zhao & Jiaqi Yang & Shuai Ouyang & Yizhi Liu & Tao Li & Zhi Xie, 2021. "Structural variants in the Chinese population and their impact on phenotypes, diseases and population adaptation," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    3. Ludovica Montanucci & David Lewis-Smith & Ryan L. Collins & Lisa-Marie Niestroj & Shridhar Parthasarathy & Julie Xian & Shiva Ganesan & Marie Macnee & Tobias Brünger & Rhys H. Thomas & Michael Talkows, 2023. "Genome-wide identification and phenotypic characterization of seizure-associated copy number variations in 741,075 individuals," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    4. Ramesh Rajaby & Dong-Xu Liu & Chun Hang Au & Yuen-Ting Cheung & Amy Yuet Ting Lau & Qing-Yong Yang & Wing-Kin Sung, 2023. "INSurVeyor: improving insertion calling from short read sequencing data," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    5. Xue Gao & Sheng Wang & Yan-Fen Wang & Shuang Li & Shi-Xin Wu & Rong-Ge Yan & Yi-Wen Zhang & Rui-Dong Wan & Zhen He & Ren-De Song & Xin-Quan Zhao & Dong-Dong Wu & Qi-En Yang, 2022. "Long read genome assemblies complemented by single cell RNA-sequencing reveal genetic and cellular mechanisms underlying the adaptive evolution of yak," Nature Communications, Nature, vol. 13(1), pages 1-14, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:583:y:2020:i:7814:d:10.1038_s41586-020-2371-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.