IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v14y2023i1d10.1038_s41467-023-42491-0.html
   My bibliography  Save this article

Unappreciated subcontinental admixture in Europeans and European Americans and implications for genetic epidemiology studies

Author

Listed:
  • Mateus H. Gouveia

    (National Institutes of Health)

  • Amy R. Bentley

    (National Institutes of Health)

  • Thiago P. Leal

    (Cleveland Clinic)

  • Eduardo Tarazona-Santos

    (Universidade Federal de Minas Gerais)

  • Carlos D. Bustamante

    (Stanford University)

  • Adebowale A. Adeyemo

    (National Institutes of Health)

  • Charles N. Rotimi

    (National Institutes of Health)

  • Daniel Shriner

    (National Institutes of Health)

Abstract

European-ancestry populations are recognized as stratified but not as admixed, implying that residual confounding by locus-specific ancestry can affect studies of association, polygenic adaptation, and polygenic risk scores. We integrate individual-level genome-wide data from ~19,000 European-ancestry individuals across 79 European populations and five European American cohorts. We generate a new reference panel that captures ancestral diversity missed by both the 1000 Genomes and Human Genome Diversity Projects. Both Europeans and European Americans are admixed at the subcontinental level, with admixture dates differing among subgroups of European Americans. After adjustment for both genome-wide and locus-specific ancestry, associations between a highly differentiated variant in LCT (rs4988235) and height or LDL-cholesterol were confirmed to be false positives whereas the association between LCT and body mass index was genuine. We provide formal evidence of subcontinental admixture in individuals with European ancestry, which, if not properly accounted for, can produce spurious results in genetic epidemiology studies.

Suggested Citation

  • Mateus H. Gouveia & Amy R. Bentley & Thiago P. Leal & Eduardo Tarazona-Santos & Carlos D. Bustamante & Adebowale A. Adeyemo & Charles N. Rotimi & Daniel Shriner, 2023. "Unappreciated subcontinental admixture in Europeans and European Americans and implications for genetic epidemiology studies," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
  • Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-42491-0
    DOI: 10.1038/s41467-023-42491-0
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-42491-0
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-42491-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Adam E. Locke & Bratati Kahali & Sonja I. Berndt & Anne E. Justice & Tune H. Pers & Felix R. Day & Corey Powell & Sailaja Vedantam & Martin L. Buchkovich & Jian Yang & Damien C. Croteau-Chonka & Tonu , 2015. "Genetic studies of body mass index yield new insights for obesity biology," Nature, Nature, vol. 518(7538), pages 197-206, February.
    2. Andrés Moreno-Estrada & Simon Gravel & Fouad Zakharia & Jacob L McCauley & Jake K Byrnes & Christopher R Gignoux & Patricia A Ortiz-Tello & Ricardo J Martínez & Dale J Hedges & Richard W Morris & Cele, 2013. "Reconstructing the Population Genetic History of the Caribbean," PLOS Genetics, Public Library of Science, vol. 9(11), pages 1-19, November.
    3. Ashot Margaryan & Daniel J. Lawson & Martin Sikora & Fernando Racimo & Simon Rasmussen & Ida Moltke & Lara M. Cassidy & Emil Jørsboe & Andrés Ingason & Mikkel W. Pedersen & Thorfinn Korneliussen & Hel, 2020. "Population genomics of the Viking world," Nature, Nature, vol. 585(7825), pages 390-396, September.
    4. Joscha Gretzinger & Duncan Sayer & Pierre Justeau & Eveline Altena & Maria Pala & Katharina Dulias & Ceiridwen J. Edwards & Susanne Jodoin & Laura Lacher & Susanna Sabin & Åshild J. Vågene & Wolfgang , 2022. "The Anglo-Saxon migration and the formation of the early English gene pool," Nature, Nature, vol. 610(7930), pages 112-119, October.
    5. John Novembre & Toby Johnson & Katarzyna Bryc & Zoltán Kutalik & Adam R. Boyko & Adam Auton & Amit Indap & Karen S. King & Sven Bergmann & Matthew R. Nelson & Matthew Stephens & Carlos D. Bustamante, 2008. "Genes mirror geography within Europe," Nature, Nature, vol. 456(7219), pages 274-274, November.
    6. Iosif Lazaridis & Nick Patterson & Alissa Mittnik & Gabriel Renaud & Swapan Mallick & Karola Kirsanow & Peter H. Sudmant & Joshua G. Schraiber & Sergi Castellano & Mark Lipson & Bonnie Berger & Christ, 2014. "Ancient human genomes suggest three ancestral populations for present-day Europeans," Nature, Nature, vol. 513(7518), pages 409-413, September.
    7. Rasika Ann Mathias & Margaret A. Taub & Christopher R. Gignoux & Wenqing Fu & Shaila Musharoff & Timothy D. O'Connor & Candelaria Vergara & Dara G. Torgerson & Maria Pino-Yanes & Suyash S. Shringarpur, 2016. "A continuum of admixture in the Western Hemisphere revealed by the African Diaspora genome," Nature Communications, Nature, vol. 7(1), pages 1-10, November.
    8. Daniel John Lawson & Garrett Hellenthal & Simon Myers & Daniel Falush, 2012. "Inference of Population Structure using Dense Haplotype Data," PLOS Genetics, Public Library of Science, vol. 8(1), pages 1-16, January.
    9. Charrad, Malika & Ghazzali, Nadia & Boiteau, Véronique & Niknafs, Azam, 2014. "NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i06).
    10. Daniel Taliun & Daniel N. Harris & Michael D. Kessler & Jedidiah Carlson & Zachary A. Szpiech & Raul Torres & Sarah A. Gagliano Taliun & André Corvelo & Stephanie M. Gogarten & Hyun Min Kang & Achille, 2021. "Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program," Nature, Nature, vol. 590(7845), pages 290-299, February.
    11. John Novembre & Toby Johnson & Katarzyna Bryc & Zoltán Kutalik & Adam R. Boyko & Adam Auton & Amit Indap & Karen S. King & Sven Bergmann & Matthew R. Nelson & Matthew Stephens & Carlos D. Bustamante, 2008. "Genes mirror geography within Europe," Nature, Nature, vol. 456(7218), pages 98-101, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Isabel Alves & Joanna Giemza & Michael G. B. Blum & Carolina Bernhardsson & Stéphanie Chatel & Matilde Karakachoff & Aude Pierre & Anthony F. Herzig & Robert Olaso & Martial Monteil & Véronique Gallie, 2024. "Human genetic structure in Northwest France provides new insights into West European historical demography," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    2. Bárbara Sousa da Mota & Simone Rubinacci & Diana Ivette Cruz Dávalos & Carlos Eduardo G. Amorim & Martin Sikora & Niels N. Johannsen & Marzena H. Szmyt & Piotr Włodarczak & Anita Szczepanek & Marcin M, 2023. "Imputation of ancient human genomes," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    3. Jerome Kelleher & Alison M Etheridge & Gilean McVean, 2016. "Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes," PLOS Computational Biology, Public Library of Science, vol. 12(5), pages 1-22, May.
    4. Rozaimi Mohamad Razali & Juan Rodriguez-Flores & Mohammadmersad Ghorbani & Haroon Naeem & Waleed Aamer & Elbay Aliyev & Ali Jubran & Andrew G. Clark & Khalid A. Fakhro & Younes Mokrab, 2021. "Thousands of Qatari genomes inform human migration history and improve imputation of Arab haplotypes," Nature Communications, Nature, vol. 12(1), pages 1-16, December.
    5. Oscar Lao & Fan Liu & Andreas Wollstein & Manfred Kayser, 2014. "GAGA: A New Algorithm for Genomic Inference of Geographic Ancestry Reveals Fine Level Population Substructure in Europeans," PLOS Computational Biology, Public Library of Science, vol. 10(2), pages 1-11, February.
    6. Marco Lopez-Cruz & Fernando M. Aguate & Jacob D. Washburn & Natalia Leon & Shawn M. Kaeppler & Dayane Cristina Lima & Ruijuan Tan & Addie Thompson & Laurence Willard Bretonne & Gustavo los Campos, 2023. "Leveraging data from the Genomes-to-Fields Initiative to investigate genotype-by-environment interactions in maize in North America," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    7. Beatrix Eugster & Rafael Lalive & Andreas Steinhauer & Josef Zweimüller, 2011. "The Demand for Social Insurance: Does Culture Matter?," Economic Journal, Royal Economic Society, vol. 121(556), pages 413-448, November.
    8. Parsa Akbari & Olukayode A. Sosina & Jonas Bovijn & Karl Landheer & Jonas B. Nielsen & Minhee Kim & Senem Aykul & Tanima De & Mary E. Haas & George Hindy & Nan Lin & Ian R. Dinsmore & Jonathan Z. Luo , 2022. "Multiancestry exome sequencing reveals INHBE mutations associated with favorable fat distribution and protection from diabetes," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    9. Gad Abraham & Michael Inouye, 2014. "Fast Principal Component Analysis of Large-Scale Genome-Wide Data," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-5, April.
    10. Beatrix Brügger & Rafael Lalive & Josef Zweimüller, 2009. "Does Culture Affect Unemployment? Evidence from the Röstigraben," NRN working papers 2009-10, The Austrian Center for Labor Economics and the Analysis of the Welfare State, Johannes Kepler University Linz, Austria.
    11. Diana Chang & Alon Keinan, 2014. "Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies," PLOS Computational Biology, Public Library of Science, vol. 10(9), pages 1-14, September.
    12. Alejandro Ochoa & John D Storey, 2021. "Estimating FST and kinship for arbitrary population structures," PLOS Genetics, Public Library of Science, vol. 17(1), pages 1-36, January.
    13. Feldman, Michael J., 2023. "Spiked singular values and vectors under extreme aspect ratios," Journal of Multivariate Analysis, Elsevier, vol. 196(C).
    14. Nicola Barban & Elisabetta De Cao & Sonia Oreffice & Climent Quintana-Domeque, 2016. "Assortative Mating on Education: A Genetic Assessment," Working Papers 2016-034, Human Capital and Economic Opportunity Working Group.
    15. Buzbas, Erkan Ozge & Verdu, Paul, 2018. "Inference on admixture fractions in a mechanistic model of recurrent admixture," Theoretical Population Biology, Elsevier, vol. 122(C), pages 149-157.
    16. Bryc, Katarzyna & Bryc, Wlodek & Silverstein, Jack W., 2013. "Separation of the largest eigenvalues in eigenanalysis of genotype data from discrete subpopulations," Theoretical Population Biology, Elsevier, vol. 89(C), pages 34-43.
    17. Guang Guo & Yilan Fu & Hedwig Lee & Tianji Cai & Kathleen Mullan Harris & Yi Li, 2014. "Genetic Bio-Ancestry and Social Construction of Racial Classification in Social Surveys in the Contemporary United States," Demography, Springer;Population Association of America (PAA), vol. 51(1), pages 141-172, February.
    18. Panczak, Radoslaw & Moser, André & Held, Leonhard & Jones, Philip A. & Rühli, Frank J. & Staub, Kaspar, 2017. "A tall order: Small area mapping and modelling of adult height among Swiss male conscripts," Economics & Human Biology, Elsevier, vol. 26(C), pages 61-69.
    19. The International Multiple Sclerosis Genetics Consortium, 2011. "The Genetic Association of Variants in CD6, TNFRSF1A and IRF8 to Multiple Sclerosis: A Multicenter Case-Control Study," PLOS ONE, Public Library of Science, vol. 6(4), pages 1-6, April.
    20. Xiaodong Liu & Ke Zhang & Neslihan A. Kaya & Zhe Jia & Dafei Wu & Tingting Chen & Zhiyuan Liu & Sinan Zhu & Axel M. Hillmer & Torsten Wuestefeld & Jin Liu & Yun Shen Chan & Zheng Hu & Liang Ma & Li Ji, 2024. "Tumor phylogeography reveals block-shaped spatial heterogeneity and the mode of evolution in Hepatocellular Carcinoma," Nature Communications, Nature, vol. 15(1), pages 1-14, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-42491-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.