IDEAS home Printed from https://ideas.repec.org/a/eee/thpobi/v75y2009i4p346-354.html
   My bibliography  Save this article

Site frequency spectra from genomic SNP surveys

Author

Listed:
  • Ganapathy, Ganeshkumar
  • Uyenoyama, Marcy K.

Abstract

Genomic survey data now permit an unprecedented level of sensitivity in the detection of departures from canonical evolutionary models, including expansions in population size and selective sweeps. Here, we examine the effects of seemingly subtle differences among sampling distributions on goodness of fit analyses of site frequency spectra constructed from single nucleotide polymorphisms. Conditioning on the observation of exactly two alleles in a random sample results in a site frequency spectrum that is independent of the scaled rate of neutral substitution (θ). Other sampling distributions, including conditioning on a single mutational event in the sample genealogy or randomly selecting a single mutation from a genealogy with multiple mutations, have distinct site frequency spectra that show highly significant departures from the predictions of the biallelic model. Some aspects of data filtering may contribute to significant departures of site frequency spectra from expectation, apart from any violation of the standard neutral model.

Suggested Citation

  • Ganapathy, Ganeshkumar & Uyenoyama, Marcy K., 2009. "Site frequency spectra from genomic SNP surveys," Theoretical Population Biology, Elsevier, vol. 75(4), pages 346-354.
  • Handle: RePEc:eee:thpobi:v:75:y:2009:i:4:p:346-354
    DOI: 10.1016/j.tpb.2009.04.003
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040580909000355
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tpb.2009.04.003?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hobolth Asger & Uyenoyama Marcy K & Wiuf Carsten, 2008. "Importance Sampling for the Infinite Sites Model," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(1), pages 1-26, October.
    2. Hobolth, Asger & Wiuf, Carsten, 2009. "The genealogy, site frequency spectrum and ages of two nested mutant alleles," Theoretical Population Biology, Elsevier, vol. 75(4), pages 260-265.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Birkner, Matthias & Blath, Jochen & Steinrücken, Matthias, 2011. "Importance sampling for Lambda-coalescents in the infinitely many sites model," Theoretical Population Biology, Elsevier, vol. 79(4), pages 155-173.
    2. Sainudiin, Raazesh & Véber, Amandine, 2018. "Full likelihood inference from the site frequency spectrum based on the optimal tree resolution," Theoretical Population Biology, Elsevier, vol. 124(C), pages 1-15.
    3. Jenkins, Paul A. & Song, Yun S., 2011. "The effect of recurrent mutation on the frequency spectrum of a segregating site and the age of an allele," Theoretical Population Biology, Elsevier, vol. 80(2), pages 158-173.
    4. Ferretti, Luca & Klassmann, Alexander & Raineri, Emanuele & Ramos-Onsins, Sebastián E. & Wiehe, Thomas & Achaz, Guillaume, 2018. "The neutral frequency spectrum of linked sites," Theoretical Population Biology, Elsevier, vol. 123(C), pages 70-79.
    5. Hobolth, Asger & Wiuf, Carsten, 2009. "The genealogy, site frequency spectrum and ages of two nested mutant alleles," Theoretical Population Biology, Elsevier, vol. 75(4), pages 260-265.
    6. Uyenoyama, Marcy K. & Takebayashi, Naoki & Kumagai, Seiji, 2020. "Allele frequency spectra in structured populations: Novel-allele probabilities under the labelled coalescent," Theoretical Population Biology, Elsevier, vol. 133(C), pages 130-140.
    7. Hayman, Elizabeth & Ignatieva, Anastasia & Hein, Jotun, 2023. "Recoverability of ancestral recombination graph topologies," Theoretical Population Biology, Elsevier, vol. 154(C), pages 27-39.
    8. Merle, C. & Leblois, R. & Rousset, F. & Pudlo, P., 2017. "Resampling: An improvement of importance sampling in varying population size models," Theoretical Population Biology, Elsevier, vol. 114(C), pages 70-87.
    9. Jenkins Paul A., 2012. "Stopping-Time Resampling and Population Genetic Inference under Coalescent Models," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(1), pages 1-20, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:thpobi:v:75:y:2009:i:4:p:346-354. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/intelligence .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.