IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0084696.html
   My bibliography  Save this article

Meta-Analysis of Repository Data: Impact of Data Regularization on NIMH Schizophrenia Linkage Results

Author

Listed:
  • Kimberly A Walters
  • Yungui Huang
  • Marco Azaro
  • Kathleen Tobin
  • Thomas Lehner
  • Linda M Brzustowicz
  • Veronica J Vieland

Abstract

Human geneticists are increasingly turning to study designs based on very large sample sizes to overcome difficulties in studying complex disorders. This in turn almost always requires multi-site data collection and processing of data through centralized repositories. While such repositories offer many advantages, including the ability to return to previously collected data to apply new analytic techniques, they also have some limitations. To illustrate, we reviewed data from seven older schizophrenia studies available from the NIMH-funded Center for Collaborative Genomic Studies on Mental Disorders, also known as the Human Genetics Initiative (HGI), and assessed the impact of data cleaning and regularization on linkage analyses. Extensive data regularization protocols were developed and applied to both genotypic and phenotypic data. Genome-wide nonparametric linkage (NPL) statistics were computed for each study, over various stages of data processing. To assess the impact of data processing on aggregate results, Genome-Scan Meta-Analysis (GSMA) was performed. Examples of increased, reduced and shifted linkage peaks were found when comparing linkage results based on original HGI data to results using post-processed data within the same set of pedigrees. Interestingly, reducing the number of affected individuals tended to increase rather than decrease linkage peaks. But most importantly, while the effects of data regularization within individual data sets were small, GSMA applied to the data in aggregate yielded a substantially different picture after data regularization. These results have implications for analyses based on other types of data (e.g., case-control GWAS or sequencing data) as well as data obtained from other repositories.

Suggested Citation

  • Kimberly A Walters & Yungui Huang & Marco Azaro & Kathleen Tobin & Thomas Lehner & Linda M Brzustowicz & Veronica J Vieland, 2014. "Meta-Analysis of Repository Data: Impact of Data Regularization on NIMH Schizophrenia Linkage Results," PLOS ONE, Public Library of Science, vol. 9(1), pages 1-8, January.
  • Handle: RePEc:plo:pone00:0084696
    DOI: 10.1371/journal.pone.0084696
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0084696
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0084696&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0084696?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Vieland Veronica J & Hodge Susan E, 2011. "Measurement of Evidence and Evidence of Measurement," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-11, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Veronica J Vieland & Sang-Cheol Seok, 2021. "The PPLD has advantages over conventional regression methods in application to moderately sized genome-wide association studies," PLOS ONE, Public Library of Science, vol. 16(9), pages 1-22, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0084696. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.