IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0003746.html
   My bibliography  Save this article

The Trouble with Sliding Windows and the Selective Pressure in BRCA1

Author

Listed:
  • Karl Schmid
  • Ziheng Yang

Abstract

Sliding-window analysis has widely been used to uncover synonymous (silent, dS) and nonsynonymous (replacement, dN) rate variation along the protein sequence and to detect regions of a protein under selective constraint (indicated by dN dS). The approach compares two or more protein-coding genes and plots estimates d̂S and d̂N from each sliding window along the sequence. Here we demonstrate that the approach produces artifactual trends of synonymous and nonsynonymous rate variation, with greater variation in d̂S than in d̂N. Such trends are generated even if the true dS and dN are constant along the whole protein and different codons are evolving independently. Many published tests of negative and positive selection using sliding windows that we have examined appear to be invalid because they fail to correct for multiple testing. Instead, likelihood ratio tests provide a more rigorous framework for detecting signals of natural selection affecting protein evolution. We demonstrate that a previous finding that a particular region of the BRCA1 gene experienced a synonymous rate reduction driven by purifying selection is likely an artifact of the sliding window analysis. We evaluate various sliding-window analyses in molecular evolution, population genetics, and comparative genomics, and argue that the approach is not generally valid if it is not known a priori that a trend exists and if no correction for multiple testing is applied.

Suggested Citation

  • Karl Schmid & Ziheng Yang, 2008. "The Trouble with Sliding Windows and the Selective Pressure in BRCA1," PLOS ONE, Public Library of Science, vol. 3(11), pages 1-7, November.
  • Handle: RePEc:plo:pone00:0003746
    DOI: 10.1371/journal.pone.0003746
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0003746
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0003746&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0003746?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhang Zhang & Jeffrey P Townsend, 2009. "Maximum-Likelihood Model Averaging To Profile Clustering of Site Types across Discrete Linear Sequences," PLOS Computational Biology, Public Library of Science, vol. 5(6), pages 1-14, June.
    2. Ghosh Samiran & Townsend Jeffrey P., 2015. "H-CLAP: hierarchical clustering within a linear array with an application in genetics," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 14(2), pages 125-141, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0003746. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.