IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v607y2022i7920d10.1038_s41586-022-04965-x.html
   My bibliography  Save this article

The sequences of 150,119 genomes in the UK Biobank

Author

Listed:
  • Bjarni V. Halldorsson

    (deCODE genetics/Amgen Inc.
    Reykjavik University)

  • Hannes P. Eggertsson

    (deCODE genetics/Amgen Inc.)

  • Kristjan H. S. Moore

    (deCODE genetics/Amgen Inc.)

  • Hannes Hauswedell

    (deCODE genetics/Amgen Inc.)

  • Ogmundur Eiriksson

    (deCODE genetics/Amgen Inc.)

  • Magnus O. Ulfarsson

    (deCODE genetics/Amgen Inc.
    University of Iceland)

  • Gunnar Palsson

    (deCODE genetics/Amgen Inc.)

  • Marteinn T. Hardarson

    (deCODE genetics/Amgen Inc.
    Reykjavik University)

  • Asmundur Oddsson

    (deCODE genetics/Amgen Inc.)

  • Brynjar O. Jensson

    (deCODE genetics/Amgen Inc.)

  • Snaedis Kristmundsdottir

    (deCODE genetics/Amgen Inc.
    Reykjavik University)

  • Brynja D. Sigurpalsdottir

    (deCODE genetics/Amgen Inc.
    Reykjavik University)

  • Olafur A. Stefansson

    (deCODE genetics/Amgen Inc.)

  • Doruk Beyter

    (deCODE genetics/Amgen Inc.)

  • Guillaume Holley

    (deCODE genetics/Amgen Inc.)

  • Vinicius Tragante

    (deCODE genetics/Amgen Inc.)

  • Arnaldur Gylfason

    (deCODE genetics/Amgen Inc.)

  • Pall I. Olason

    (deCODE genetics/Amgen Inc.)

  • Florian Zink

    (deCODE genetics/Amgen Inc.)

  • Margret Asgeirsdottir

    (deCODE genetics/Amgen Inc.)

  • Sverrir T. Sverrisson

    (deCODE genetics/Amgen Inc.)

  • Brynjar Sigurdsson

    (deCODE genetics/Amgen Inc.)

  • Sigurjon A. Gudjonsson

    (deCODE genetics/Amgen Inc.)

  • Gunnar T. Sigurdsson

    (deCODE genetics/Amgen Inc.)

  • Gisli H. Halldorsson

    (deCODE genetics/Amgen Inc.)

  • Gardar Sveinbjornsson

    (deCODE genetics/Amgen Inc.)

  • Kristjan Norland

    (deCODE genetics/Amgen Inc.)

  • Unnur Styrkarsdottir

    (deCODE genetics/Amgen Inc.)

  • Droplaug N. Magnusdottir

    (deCODE genetics/Amgen Inc.)

  • Steinunn Snorradottir

    (deCODE genetics/Amgen Inc.)

  • Kari Kristinsson

    (deCODE genetics/Amgen Inc.)

  • Emilia Sobech

    (deCODE genetics/Amgen Inc.)

  • Helgi Jonsson

    (Landspitali-University Hospital
    University of Iceland)

  • Arni J. Geirsson

    (Landspitali-University Hospital)

  • Isleifur Olafsson

    (Landspitali-University Hospital)

  • Palmi Jonsson

    (Landspitali-University Hospital
    University of Iceland)

  • Ole Birger Pedersen

    (Zealand University Hospital)

  • Christian Erikstrup

    (Aarhus University
    Aarhus University Hospital)

  • Søren Brunak

    (University of Copenhagen)

  • Sisse Rye Ostrowski

    (Copenhagen University Hospital (Rigshospitalet)
    Copenhagen University)

  • Gudmar Thorleifsson

    (deCODE genetics/Amgen Inc.)

  • Frosti Jonsson

    (deCODE genetics/Amgen Inc.)

  • Pall Melsted

    (deCODE genetics/Amgen Inc.
    University of Iceland)

  • Ingileif Jonsdottir

    (deCODE genetics/Amgen Inc.
    University of Iceland)

  • Thorunn Rafnar

    (deCODE genetics/Amgen Inc.)

  • Hilma Holm

    (deCODE genetics/Amgen Inc.)

  • Hreinn Stefansson

    (deCODE genetics/Amgen Inc.)

  • Jona Saemundsdottir

    (deCODE genetics/Amgen Inc.)

  • Daniel F. Gudbjartsson

    (deCODE genetics/Amgen Inc.
    University of Iceland)

  • Olafur T. Magnusson

    (deCODE genetics/Amgen Inc.)

  • Gisli Masson

    (deCODE genetics/Amgen Inc.)

  • Unnur Thorsteinsdottir

    (deCODE genetics/Amgen Inc.
    University of Iceland)

  • Agnar Helgason

    (deCODE genetics/Amgen Inc.
    University of Iceland)

  • Hakon Jonsson

    (deCODE genetics/Amgen Inc.)

  • Patrick Sulem

    (deCODE genetics/Amgen Inc.)

  • Kari Stefansson

    (deCODE genetics/Amgen Inc.)

Abstract

Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data1,2. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank3. This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.

Suggested Citation

  • Bjarni V. Halldorsson & Hannes P. Eggertsson & Kristjan H. S. Moore & Hannes Hauswedell & Ogmundur Eiriksson & Magnus O. Ulfarsson & Gunnar Palsson & Marteinn T. Hardarson & Asmundur Oddsson & Brynjar, 2022. "The sequences of 150,119 genomes in the UK Biobank," Nature, Nature, vol. 607(7920), pages 732-740, July.
  • Handle: RePEc:nat:nature:v:607:y:2022:i:7920:d:10.1038_s41586-022-04965-x
    DOI: 10.1038/s41586-022-04965-x
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41586-022-04965-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41586-022-04965-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Aimee M. Deaton & Aditi Dubey & Lucas D. Ward & Peter Dornbos & Jason Flannick & Elaine Yee & Simina Ticau & Leila Noetzli & Margaret M. Parker & Rachel A. Hoffing & Carissa Willis & Mollie E. Plekan , 2022. "Rare loss of function variants in the hepatokine gene INHBE protect from abdominal obesity," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    2. Alexander T. Williams & Jing Chen & Kayesha Coley & Chiara Batini & Abril Izquierdo & Richard Packer & Erik Abner & Stavroula Kanoni & David J. Shepherd & Robert C. Free & Edward J. Hollox & Nigel J. , 2023. "Genome-wide association study of thyroid-stimulating hormone highlights new genes, pathways and associations with thyroid disease," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    3. Scott D. Findlay & Lindsay Romo & Christopher B. Burge, 2024. "Quantifying negative selection in human 3ʹ UTRs uncovers constrained targets of RNA-binding proteins," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    4. Margaret Sunitha Selvaraj & Xihao Li & Zilin Li & Akhil Pampana & David Y. Zhang & Joseph Park & Stella Aslibekyan & Joshua C. Bis & Jennifer A. Brody & Brian E. Cade & Lee-Ming Chuang & Ren-Hua Chung, 2022. "Whole genome sequence analysis of blood lipid levels in >66,000 individuals," Nature Communications, Nature, vol. 13(1), pages 1-18, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:607:y:2022:i:7920:d:10.1038_s41586-022-04965-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.