IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1003649.html
   My bibliography  Save this article

An HMM-Based Comparative Genomic Framework for Detecting Introgression in Eukaryotes

Author

Listed:
  • Kevin J Liu
  • Jingxuan Dai
  • Kathy Truong
  • Ying Song
  • Michael H Kohn
  • Luay Nakhleh

Abstract

One outcome of interspecific hybridization and subsequent effects of evolutionary forces is introgression, which is the integration of genetic material from one species into the genome of an individual in another species. The evolution of several groups of eukaryotic species has involved hybridization, and cases of adaptation through introgression have been already established. In this work, we report on PhyloNet-HMM—a new comparative genomic framework for detecting introgression in genomes. PhyloNet-HMM combines phylogenetic networks with hidden Markov models (HMMs) to simultaneously capture the (potentially reticulate) evolutionary history of the genomes and dependencies within genomes. A novel aspect of our work is that it also accounts for incomplete lineage sorting and dependence across loci. Application of our model to variation data from chromosome 7 in the mouse (Mus musculus domesticus) genome detected a recently reported adaptive introgression event involving the rodent poison resistance gene Vkorc1, in addition to other newly detected introgressed genomic regions. Based on our analysis, it is estimated that about 9% of all sites within chromosome 7 are of introgressive origin (these cover about 13 Mbp of chromosome 7, and over 300 genes). Further, our model detected no introgression in a negative control data set. We also found that our model accurately detected introgression and other evolutionary processes from synthetic data sets simulated under the coalescent model with recombination, isolation, and migration. Our work provides a powerful framework for systematic analysis of introgression while simultaneously accounting for dependence across sites, point mutations, recombination, and ancestral polymorphism.Author Summary: Hybridization is the mating between individuals from two different species. While hybridization introduces genetic material into a host genome, this genetic material may be transient and is purged from the population within a few generations after hybridization. However, in other cases, the introduced genetic material persists in the population—a process known as introgression—and can have significant evolutionary implications. In this paper, we introduce a novel method for detecting introgression in genomes using a comparative genomic approach. The method scans multiple aligned genomes for signatures of introgression by incorporating phylogenetic networks and hidden Markov models. The method allows for teasing apart true signatures of introgression from spurious ones that arise due to population effects and resemble those of introgression. Using the new method, we analyzed two sets of variation data from chromosome 7 in mouse genomes. The method detected previously reported introgressed regions as well as new ones in one of the data sets. In the other data set, which was selected as a negative control, the method detected no introgression. Furthermore, our method accurately detected introgression in simulated evolutionary scenarios and accurately inferred related population genetic quantities. Our method enables systematic comparative analyses of genomes where introgression is suspected, and can work with genome-wide data.

Suggested Citation

  • Kevin J Liu & Jingxuan Dai & Kathy Truong & Ying Song & Michael H Kohn & Luay Nakhleh, 2014. "An HMM-Based Comparative Genomic Framework for Detecting Introgression in Eukaryotes," PLOS Computational Biology, Public Library of Science, vol. 10(6), pages 1-13, June.
  • Handle: RePEc:plo:pcbi00:1003649
    DOI: 10.1371/journal.pcbi.1003649
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003649
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1003649&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1003649?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Thomas Mailund & Anders E Halager & Michael Westergaard & Julien Y Dutheil & Kasper Munch & Lars N Andersen & Gerton Lunter & Kay Prüfer & Aylwyn Scally & Asger Hobolth & Mikkel H Schierup, 2012. "A New Isolation with Migration Model along Complete Genomes Infers Very Different Divergence Processes among Closely Related Great Ape Species," PLOS Genetics, Public Library of Science, vol. 8(12), pages 1-19, December.
    2. Heng Li & Richard Durbin, 2011. "Inference of human population history from individual whole-genome sequences," Nature, Nature, vol. 475(7357), pages 493-496, July.
    3. Oscar Westesson & Ian Holmes, 2009. "Accurate Detection of Recombinant Breakpoints in Whole-Genome Alignments," PLOS Computational Biology, Public Library of Science, vol. 5(3), pages 1-13, March.
    4. Thomas Mailund & Julien Y Dutheil & Asger Hobolth & Gerton Lunter & Mikkel H Schierup, 2011. "Estimating Divergence Time and Ancestral Effective Population Size of Bornean and Sumatran Orangutan Subspecies Using a Coalescent Hidden Markov Model," PLOS Genetics, Public Library of Science, vol. 7(3), pages 1-15, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mark S Hibbins & Matthew W Hahn, 2021. "The effects of introgression across thousands of quantitative traits revealed by gene expression in wild tomatoes," PLOS Genetics, Public Library of Science, vol. 17(11), pages 1-20, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Steinrücken, Matthias & Paul, Joshua S. & Song, Yun S., 2013. "A sequentially Markov conditional sampling distribution for structured populations with migration and recombination," Theoretical Population Biology, Elsevier, vol. 87(C), pages 51-61.
    2. Hobolth, Asger & Jensen, Jens Ledet, 2014. "Markovian approximation to the finite loci coalescent with recombination along multiple sequences," Theoretical Population Biology, Elsevier, vol. 98(C), pages 48-58.
    3. Gideon S Bradburd & Peter L Ralph & Graham M Coop, 2016. "A Spatial Framework for Understanding Population Structure and Admixture," PLOS Genetics, Public Library of Science, vol. 12(1), pages 1-38, January.
    4. Juraj Bergman & Rasmus Ø. Pedersen & Erick J. Lundgren & Rhys T. Lemoine & Sophie Monsarrat & Elena A. Pearce & Mikkel H. Schierup & Jens-Christian Svenning, 2023. "Worldwide Late Pleistocene and Early Holocene population declines in extant megafauna are associated with Homo sapiens expansion rather than climate change," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    5. Costa, Rui J. & Wilkinson-Herbots, Hilde M., 2021. "Inference of gene flow in the process of speciation: Efficient maximum-likelihood implementation of a generalised isolation-with-migration model," Theoretical Population Biology, Elsevier, vol. 140(C), pages 1-15.
    6. Per Unneberg & Mårten Larsson & Anna Olsson & Ola Wallerman & Anna Petri & Ignas Bunikis & Olga Vinnere Pettersson & Chiara Papetti & Astthor Gislason & Henrik Glenner & Joan E. Cartes & Leocadio Blan, 2024. "Ecological genomics in the Northern krill uncovers loci for local adaptation across ocean basins," Nature Communications, Nature, vol. 15(1), pages 1-29, December.
    7. Ya-Mei Ding & Xiao-Xu Pang & Yu Cao & Wei-Ping Zhang & Susanne S. Renner & Da-Yong Zhang & Wei-Ning Bai, 2023. "Genome structure-based Juglandaceae phylogenies contradict alignment-based phylogenies and substitution rates vary with DNA repair genes," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    8. Romain Fournier & Zoi Tsangalidou & David Reich & Pier Francesco Palamara, 2023. "Haplotype-based inference of recent effective population size in modern and ancient DNA samples," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    9. Barton, N.H. & Etheridge, A.M. & Kelleher, J. & Véber, A., 2013. "Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks," Theoretical Population Biology, Elsevier, vol. 87(C), pages 105-119.
    10. Guangping Huang & Lingyun Song & Xin Du & Xin Huang & Fuwen Wei, 2023. "Evolutionary genomics of camouflage innovation in the orchid mantis," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    11. Legried, Brandon & Terhorst, Jonathan, 2022. "Rates of convergence in the two-island and isolation-with-migration models," Theoretical Population Biology, Elsevier, vol. 147(C), pages 16-27.
    12. Jörn Bethune & April Kleppe & Søren Besenbacher, 2022. "A method to build extended sequence context models of point mutations and indels," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    13. Wilton, Peter R. & Baduel, Pierre & Landon, Matthieu M. & Wakeley, John, 2017. "Population structure and coalescence in pedigrees: Comparisons to the structured coalescent and a framework for inference," Theoretical Population Biology, Elsevier, vol. 115(C), pages 1-12.
    14. Ling Zhong & Menghan Zhang & Libing Sun & Yu Yang & Bo Wang & Haibing Yang & Qiang Shen & Yu Xia & Jiarui Cui & Hui Hang & Yi Ren & Bo Pang & Xiangyu Deng & Yahui Zhan & Heng Li & Zhemin Zhou, 2023. "Distributed genotyping and clustering of Neisseria strains reveal continual emergence of epidemic meningococcus over a century," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    15. Carmi, Shai & Wilton, Peter R. & Wakeley, John & Pe’er, Itsik, 2014. "A renewal theory approach to IBD sharing," Theoretical Population Biology, Elsevier, vol. 97(C), pages 35-48.
    16. Kerdoncuff, Elise & Lambert, Amaury & Achaz, Guillaume, 2020. "Testing for population decline using maximal linkage disequilibrium blocks," Theoretical Population Biology, Elsevier, vol. 134(C), pages 171-181.
    17. Youjie Zhao & Chengyong Su & Bo He & Ruie Nie & Yunliang Wang & Junye Ma & Jingyu Song & Qun Yang & Jiasheng Hao, 2023. "Dispersal from the Qinghai-Tibet plateau by a high-altitude butterfly is associated with rapid expansion and reorganization of its genome," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    18. Xiaodong Liu & Long Lin & Mikkel-Holger S. Sinding & Laura D. Bertola & Kristian Hanghøj & Liam Quinn & Genís Garcia-Erill & Malthe Sebro Rasmussen & Mikkel Schubert & Patrícia Pečnerová & Renzo F. Ba, 2024. "Introgression and disruption of migration routes have shaped the genetic integrity of wildebeest populations," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    19. He Yu & Alexandra Jamieson & Ardern Hulme-Beaman & Chris J. Conroy & Becky Knight & Camilla Speller & Hiba Al-Jarah & Heidi Eager & Alexandra Trinks & Gamini Adikari & Henriette Baron & Beate Böhlendo, 2022. "Palaeogenomic analysis of black rat (Rattus rattus) reveals multiple European introductions associated with human economic history," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    20. Kumagai, Seiji & Uyenoyama, Marcy K., 2015. "Genealogical histories in structured populations," Theoretical Population Biology, Elsevier, vol. 102(C), pages 3-15.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1003649. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.