IDEAS home Printed from https://ideas.repec.org/a/nat/nathum/v8y2024i8d10.1038_s41562-024-01909-5.html
   My bibliography  Save this article

Principled distillation of UK Biobank phenotype data reveals underlying structure in human variation

Author

Listed:
  • Caitlin E. Carey

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Massachusetts General Hospital)

  • Rebecca Shafee

    (Broad Institute of MIT and Harvard
    Harvard Medical School
    National Institute of Mental Health)

  • Robbee Wedow

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Purdue University
    Indiana University School of Medicine)

  • Amanda Elliott

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Harvard Medical School)

  • Duncan S. Palmer

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Broad Institute of MIT and Harvard
    Medical Sciences Division University of Oxford)

  • John Compitello

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Broad Institute of MIT and Harvard)

  • Masahiro Kanai

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Broad Institute of MIT and Harvard)

  • Liam Abbott

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital)

  • Patrick Schultz

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Broad Institute of MIT and Harvard)

  • Konrad J. Karczewski

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital)

  • Samuel C. Bryant

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital)

  • Caroline M. Cusick

    (Broad Institute of MIT and Harvard)

  • Claire Churchhouse

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Broad Institute of MIT and Harvard)

  • Daniel P. Howrigan

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital)

  • Daniel King

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Broad Institute of MIT and Harvard)

  • George Davey Smith

    (Broad Institute of MIT and Harvard
    University of Bristol, Oakfield House
    University of Bristol)

  • Benjamin M. Neale

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Massachusetts General Hospital
    Broad Institute of MIT and Harvard)

  • Raymond K. Walters

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Harvard Medical School)

  • Elise B. Robinson

    (Broad Institute of MIT and Harvard
    Massachusetts General Hospital
    Massachusetts General Hospital)

Abstract

Data within biobanks capture broad yet detailed indices of human variation, but biobank-wide insights can be difficult to extract due to complexity and scale. Here, using large-scale factor analysis, we distill hundreds of variables (diagnoses, assessments and survey items) into 35 latent constructs, using data from unrelated individuals with predominantly estimated European genetic ancestry in UK Biobank. These factors recapitulate known disease classifications, disentangle elements of socioeconomic status, highlight the relevance of psychiatric constructs to health and improve measurement of pro-health behaviours. We go on to demonstrate the power of this approach to clarify genetic signal, enhance discovery and identify associations between underlying phenotypic structure and health outcomes. In building a deeper understanding of ways in which constructs such as socioeconomic status, trauma, or physical activity are structured in the dataset, we emphasize the importance of considering the interwoven nature of the human phenome when evaluating public health patterns.

Suggested Citation

  • Caitlin E. Carey & Rebecca Shafee & Robbee Wedow & Amanda Elliott & Duncan S. Palmer & John Compitello & Masahiro Kanai & Liam Abbott & Patrick Schultz & Konrad J. Karczewski & Samuel C. Bryant & Caro, 2024. "Principled distillation of UK Biobank phenotype data reveals underlying structure in human variation," Nature Human Behaviour, Nature, vol. 8(8), pages 1599-1615, August.
  • Handle: RePEc:nat:nathum:v:8:y:2024:i:8:d:10.1038_s41562-024-01909-5
    DOI: 10.1038/s41562-024-01909-5
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41562-024-01909-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41562-024-01909-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nathum:v:8:y:2024:i:8:d:10.1038_s41562-024-01909-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.