IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1003555.html
   My bibliography  Save this article

Enhanced Methods for Local Ancestry Assignment in Sequenced Admixed Individuals

Author

Listed:
  • Robert Brown
  • Bogdan Pasaniuc

Abstract

Inferring the ancestry at each locus in the genome of recently admixed individuals (e.g., Latino Americans) plays a major role in medical and population genetic inferences, ranging from finding disease-risk loci, to inferring recombination rates, to mapping missing contigs in the human genome. Although many methods for local ancestry inference have been proposed, most are designed for use with genotyping arrays and fail to make use of the full spectrum of data available from sequencing. In addition, current haplotype-based approaches are very computationally demanding, requiring large computational time for moderately large sample sizes. Here we present new methods for local ancestry inference that leverage continent-specific variants (CSVs) to attain increased performance over existing approaches in sequenced admixed genomes. A key feature of our approach is that it incorporates the admixed genomes themselves jointly with public datasets, such as 1000 Genomes, to improve the accuracy of CSV calling. We use simulations to show that our approach attains accuracy similar to widely used computationally intensive haplotype-based approaches with large decreases in runtime. Most importantly, we show that our method recovers comparable local ancestries, as the 1000 Genomes consensus local ancestry calls in the real admixed individuals from the 1000 Genomes Project. We extend our approach to account for low-coverage sequencing and show that accurate local ancestry inference can be attained at low sequencing coverage. Finally, we generalize CSVs to sub-continental population-specific variants (sCSVs) and show that in some cases it is possible to determine the sub-continental ancestry for short chromosomal segments on the basis of sCSVs.Author Summary: Advances in sequencing technologies are dramatically changing the volume and type of data collected in genetic studies. Although most genetic studies so far have focused on individuals of European ancestry, recent studies are increasingly being performed in individuals of admixed ancestry (i.e., with recent ancestors from multiple continents, e.g., Latino Americans). A key component in such studies is the accurate inference of continental ancestry at each segment in the genome of these individuals. In this work we present accurate and robust methods that use continent-specific variants (i.e., genetic variants observed only in individuals of a given continent), now readily accessible through sequencing technology, to perform extremely fast and accurate inference of the ancestral origin of each genomic segment in recently admixed individuals.

Suggested Citation

  • Robert Brown & Bogdan Pasaniuc, 2014. "Enhanced Methods for Local Ancestry Assignment in Sequenced Admixed Individuals," PLOS Computational Biology, Public Library of Science, vol. 10(4), pages 1-1, April.
  • Handle: RePEc:plo:pcbi00:1003555
    DOI: 10.1371/journal.pcbi.1003555
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003555
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1003555&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1003555?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Anjali G. Hinch & Arti Tandon & Nick Patterson & Yunli Song & Nadin Rohland & Cameron D. Palmer & Gary K. Chen & Kai Wang & Sarah G. Buxbaum & Ermeg L. Akylbekova & Melinda C. Aldrich & Christine B. A, 2011. "The landscape of recombination in African Americans," Nature, Nature, vol. 476(7359), pages 170-175, August.
    2. González Burchard, E. & Borrell, L.N. & Choudhry, S. & Naqvi, M. & Tsai, H.-J. & Rodriguez-Santana, J.R. & Chapela, R. & Rogers, S.D. & Mei, R. & Rodriguez-Cintron, W. & Arena, J.F. & Kittles, R. & Pe, 2005. "Latino populations: A unique opportunity for the study of race, genetics, and social environment in epidemiological research," American Journal of Public Health, American Public Health Association, vol. 95(12), pages 2161-2168.
    3. Carlos D. Bustamante & Francisco M. De La Vega & Esteban G. Burchard, 2011. "Genomics for the world," Nature, Nature, vol. 475(7355), pages 163-165, July.
    4. Daniel Shriner & Adebowale Adeyemo & Charles N Rotimi, 2011. "Joint Ancestry and Association Testing in Admixed Individuals," PLOS Computational Biology, Public Library of Science, vol. 7(12), pages 1-8, December.
    5. Ching-Yu Cheng & David Reich & Christopher A Haiman & Arti Tandon & Nick Patterson & Selvin Elizabeth & Ermeg L Akylbekova & Frederick L Brancati & Josef Coresh & Eric Boerwinkle & David Altshuler & H, 2012. "African Ancestry and Its Correlation to Type 2 Diabetes in African Americans: A Genetic Admixture Analysis in Three U.S. Population Cohorts," PLOS ONE, Public Library of Science, vol. 7(3), pages 1-9, March.
    6. Alkes L Price & Arti Tandon & Nick Patterson & Kathleen C Barnes & Nicholas Rafaels & Ingo Ruczinski & Terri H Beaty & Rasika Mathias & David Reich & Simon Myers, 2009. "Sensitive Detection of Chromosomal Segments of Distinct Ancestry in Admixed Populations," PLOS Genetics, Public Library of Science, vol. 5(6), pages 1-18, June.
    7. Nicholas A Johnson & Marc A Coram & Mark D Shriver & Isabelle Romieu & Gregory S Barsh & Stephanie J London & Hua Tang, 2011. "Ancestral Components of Admixed Genomes in a Mexican Cohort," PLOS Genetics, Public Library of Science, vol. 7(12), pages 1-12, December.
    8. Noah Zaitlen & Sara Lindström & Bogdan Pasaniuc & Marilyn Cornelis & Giulio Genovese & Samuela Pollack & Anne Barton & Heike Bickeböller & Donald W Bowden & Steve Eyre & Barry I Freedman & David J Fri, 2012. "Informed Conditioning on Clinical Covariates Increases Power in Case-Control Association Studies," PLOS Genetics, Public Library of Science, vol. 8(11), pages 1-13, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marina Muzzio & Josefina M B Motti & Paula B Paz Sepulveda & Muh-ching Yee & Thomas Cooke & María R Santos & Virginia Ramallo & Emma L Alfaro & Jose E Dipierri & Graciela Bailliet & Claudio M Bravi & , 2018. "Population structure in Argentina," PLOS ONE, Public Library of Science, vol. 13(5), pages 1-13, May.
    2. Owen Alexander Higgins & Alessandra Modi & Costanza Cannariato & Maria Angela Diroma & Federico Lugli & Stefano Ricci & Valentina Zaro & Stefania Vai & Antonino Vazzana & Matteo Romandini & He Yu & Fr, 2024. "Life history and ancestry of the late Upper Palaeolithic infant from Grotta delle Mura, Italy," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    3. Nadine R. Caron & Wilf Adam & Kate Anderson & Brooke T. Boswell & Meck Chongo & Viktor Deineko & Alexanne Dick & Shannon E. Hall & Jessica T. Hatcher & Patricia Howard & Megan Hunt & Kevin Linn & Ashl, 2023. "Partnering with First Nations in Northern British Columbia Canada to Reduce Inequity in Access to Genomic Research," IJERPH, MDPI, vol. 20(10), pages 1-31, May.
    4. Julian R Homburger & Andrés Moreno-Estrada & Christopher R Gignoux & Dominic Nelson & Elena Sanchez & Patricia Ortiz-Tello & Bernardo A Pons-Estel & Eduardo Acevedo-Vasquez & Pedro Miranda & Carl D La, 2015. "Genomic Insights into the Ancestry and Demographic History of South America," PLOS Genetics, Public Library of Science, vol. 11(12), pages 1-26, December.
    5. Daniel Shriner & Adebowale Adeyemo & Charles N Rotimi, 2011. "Joint Ancestry and Association Testing in Admixed Individuals," PLOS Computational Biology, Public Library of Science, vol. 7(12), pages 1-8, December.
    6. Michael J. Blackowicz & Daniel O. Hryhorczuk & Kristin M. Rankin & Dan A. Lewis & Danish Haider & Bruce P. Lanphear & Anne Evens, 2016. "The Impact of Low-Level Lead Toxicity on School Performance among Hispanic Subgroups in the Chicago Public Schools," IJERPH, MDPI, vol. 13(8), pages 1-12, August.
    7. Joel Mefford & John S Witte, 2012. "The Covariate's Dilemma," PLOS Genetics, Public Library of Science, vol. 8(11), pages 1-2, November.
    8. Jonathon P. Schuldt & Adam R. Pearson & Neil A. Lewis jr. & Ashley Jardina & Peter K. Enns, 2022. "Inequality and Misperceptions of Group Concerns Threaten the Integrity and Societal Impact of Science," The ANNALS of the American Academy of Political and Social Science, , vol. 700(1), pages 195-207, March.
    9. David A Turissini & Daniel R Matute, 2017. "Fine scale mapping of genomic introgressions within the Drosophila yakuba clade," PLOS Genetics, Public Library of Science, vol. 13(9), pages 1-40, September.
    10. Md. Moksedul Momin & Jisu Shin & Soohyun Lee & Buu Truong & Beben Benyamin & S. Hong Lee, 2023. "A method for an unbiased estimate of cross-ancestry genetic correlation using individual-level data," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    11. Rui Wang & Peng Zhang & Xin Lv & Lingling Jiang & Chunshi Gao & Yuanyuan Song & Yaqin Yu & Bo Li, 2016. "Situation of Diabetes and Related Disease Surveillance in Rural Areas of Jilin Province, Northeast China," IJERPH, MDPI, vol. 13(6), pages 1-10, May.
    12. Ido Amit & Kristin Ardlie & Fabiana Arzuaga & Gordon Awandare & Gary Bader & Alexander Bernier & Piero Carninci & Stacey Donnelly & Roland Eils & Alistair R. R. Forrest & Henry T. Greely & Roderic Gui, 2024. "The commitment of the human cell atlas to humanity," Nature Communications, Nature, vol. 15(1), pages 1-7, December.
    13. Gengjie Jia & Xue Zhong & Hae Kyung Im & Nathan Schoettler & Milton Pividori & D. Kyle Hogarth & Anne I. Sperling & Steven R. White & Edward T. Naureckas & Christopher S. Lyttle & Chikashi Terao & Yoi, 2022. "Discerning asthma endotypes through comorbidity mapping," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    14. Bywaters, Paul & Scourfield, Jonathan & Webb, Calum & Morris, Kate & Featherstone, Brid & Brady, Geraldine & Jones, Chantel & Sparks, Tim, 2019. "Paradoxical evidence on ethnic inequities in child welfare: Towards a research agenda," Children and Youth Services Review, Elsevier, vol. 96(C), pages 145-154.
    15. Marc Via & Christopher R Gignoux & Lindsey A Roth & Laura Fejerman & Joshua Galanter & Shweta Choudhry & Gladys Toro-Labrador & Jorge Viera-Vera & Taras K Oleksyk & Kenneth Beckman & Elad Ziv & Neil R, 2011. "History Shaped the Geographic Distribution of Genomic Admixture on the Island of Puerto Rico," PLOS ONE, Public Library of Science, vol. 6(1), pages 1-8, January.
    16. Emil M. Pedersen & Esben Agerbo & Oleguer Plana-Ripoll & Jette Steinbach & Morten D. Krebs & David M. Hougaard & Thomas Werge & Merete Nordentoft & Anders D. Børglum & Katherine L. Musliner & Andrea G, 2023. "ADuLT: An efficient and robust time-to-event GWAS," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    17. Kevin L Keys & Angel C Y Mak & Marquitta J White & Walter L Eckalbar & Andrew W Dahl & Joel Mefford & Anna V Mikhaylova & María G Contreras & Jennifer R Elhawary & Celeste Eng & Donglei Hu & Scott Hun, 2020. "On the cross-population generalizability of gene expression prediction models," PLOS Genetics, Public Library of Science, vol. 16(8), pages 1-28, August.
    18. Sharon R Browning & Brian L Browning & Martha L Daviglus & Ramon A Durazo-Arvizu & Neil Schneiderman & Robert C Kaplan & Cathy C Laurie, 2018. "Ancestry-specific recent effective population size in the Americas," PLOS Genetics, Public Library of Science, vol. 14(5), pages 1-22, May.
    19. Frampton, Geoff K. & Shepherd, Jonathan & Dorne, Jean-Lou C.M., 2009. "Demographic data in asthma clinical trials: A systematic review with implications for generalizing trial findings and tackling health disparities," Social Science & Medicine, Elsevier, vol. 69(8), pages 1147-1154, October.
    20. Canino, Glorisa & Koinis-Mitchell, Daphne & Ortega, Alexander N. & McQuaid, Elizabeth L. & Fritz, Gregory K. & Alegría, Margarita, 2006. "Asthma disparities in the prevalence, morbidity, and treatment of Latino children," Social Science & Medicine, Elsevier, vol. 63(11), pages 2926-2937, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1003555. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.