IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v53y2009i5p1767-1781.html
   My bibliography  Save this article

Excess false positive rate caused by population stratification and disease rate heterogeneity in case-control association studies

Author

Listed:
  • Li, Zhaohai
  • Zhang, Hong
  • Zheng, Gang
  • Gastwirth, Joseph L.
  • Gail, Mitchell H.

Abstract

Case-control association studies using unrelated cases and controls may suffer from potential confounding due to population stratification. Bias and variance distortion caused by population stratification in the commonly used allele-based tests can considerably inflate the Type I error rate. It is shown that the bias vanishes in the absence of disease rate heterogeneity. If only population stratification exists, a proper estimate of the variance of the allele-based test statistic is developed. Using this estimated variance yields a valid Type I error. However, when the frequencies of the allele under study and the disease rates differ among the subpopulations, it is difficult to correct for this bias. Explicit expressions for the excess false positive rate (EFPR) of the test due to bias and variance distortion are derived. It turns out that the bias created when both population stratification and disease rate heterogeneity are present usually has a greater effect on the EFPR than variance distortion. Comprehensive simulation studies strongly support these results.

Suggested Citation

  • Li, Zhaohai & Zhang, Hong & Zheng, Gang & Gastwirth, Joseph L. & Gail, Mitchell H., 2009. "Excess false positive rate caused by population stratification and disease rate heterogeneity in case-control association studies," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1767-1781, March.
  • Handle: RePEc:eee:csdana:v:53:y:2009:i:5:p:1767-1781
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(08)00089-3
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Nick Patterson & Alkes L Price & David Reich, 2006. "Population Structure and Eigenanalysis," PLOS Genetics, Public Library of Science, vol. 2(12), pages 1-20, December.
    2. Neil J. Risch, 2000. "Searching for genetic determinants in the new millennium," Nature, Nature, vol. 405(6788), pages 847-856, June.
    3. Gang Zheng & Boris Freidlin & Zhaohai Li & Joseph L. Gastwirth, 2005. "Genomic Control for Association Studies under Various Genetic Models," Biometrics, The International Biometric Society, vol. 61(1), pages 186-192, March.
    4. B. Devlin & Kathryn Roeder, 1999. "Genomic Control for Association Studies," Biometrics, The International Biometric Society, vol. 55(4), pages 997-1004, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wang, Linglu & Li, Qizhai & Li, Zhaohai & Zheng, Gang, 2011. "Bayes factors in the presence of population stratification," Statistics & Probability Letters, Elsevier, vol. 81(7), pages 836-841, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. H. Zhang & G. Zheng & Z. Li, 2006. "Statistical Analysis for Haplotype-Based Matched Case–Control Studies," Biometrics, The International Biometric Society, vol. 62(4), pages 1124-1131, December.
    2. Ning Jiang & Minghui Wang & Tianye Jia & Lin Wang & Lindsey Leach & Christine Hackett & David Marshall & Zewei Luo, 2011. "A Robust Statistical Method for Association-Based eQTL Analysis," PLOS ONE, Public Library of Science, vol. 6(8), pages 1-11, August.
    3. Gang Zheng & Zhaohai Li & Mitchell H. Gail & Joseph L. Gastwirth, 2010. "Impact of Population Substructure on Trend Tests for Genetic Case–Control Association Studies," Biometrics, The International Biometric Society, vol. 66(1), pages 196-204, March.
    4. Ilja M Nolte & Chris Wallace & Stephen J Newhouse & Daryl Waggott & Jingyuan Fu & Nicole Soranzo & Rhian Gwilliam & Panos Deloukas & Irina Savelieva & Dongling Zheng & Chrysoula Dalageorgou & Martin F, 2009. "Common Genetic Variation Near the Phospholamban Gene Is Associated with Cardiac Repolarisation: Meta-Analysis of Three Genome-Wide Association Studies," PLOS ONE, Public Library of Science, vol. 4(7), pages 1-10, July.
    5. Zhao Huaqing & Rebbeck Timothy R. & Mitra Nandita, 2012. "Analyzing Genetic Association Studies with an Extended Propensity Score Approach," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(5), pages 1-24, October.
    6. Kai Yu & Zhaoming Wang & Qizhai Li & Sholom Wacholder & David J Hunter & Robert N Hoover & Stephen Chanock & Gilles Thomas, 2008. "Population Substructure and Control Selection in Genome-Wide Association Studies," PLOS ONE, Public Library of Science, vol. 3(7), pages 1-14, July.
    7. Marie-Claude Babron & Marie de Tayrac & Douglas N Rutledge & Eleftheria Zeggini & Emmanuelle Génin, 2012. "Rare and Low Frequency Variant Stratification in the UK Population: Description and Impact on Association Tests," PLOS ONE, Public Library of Science, vol. 7(10), pages 1-9, October.
    8. Yakov A Tsepilov & Janina S Ried & Konstantin Strauch & Harald Grallert & Cornelia M van Duijn & Tatiana I Axenovich & Yurii S Aulchenko, 2013. "Development and Application of Genomic Control Methods for Genome-Wide Association Studies Using Non-Additive Models," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-9, December.
    9. Peristera Paschou & Petros Drineas & Jamey Lewis & Caroline M Nievergelt & Deborah A Nickerson & Joshua D Smith & Paul M Ridker & Daniel I Chasman & Ronald M Krauss & Elad Ziv, 2008. "Tracing Sub-Structure in the European American Population with PCA-Informative Markers," PLOS Genetics, Public Library of Science, vol. 4(7), pages 1-13, July.
    10. André X C N Valente & Joseph Zischkau & Joo Heon Shin & Yuan Gao & Abhijit Sarkar, 2012. "Genome-Wide Association Study Heterogeneous Cohort Homogenization via Subject Weight Knock-Down," PLOS ONE, Public Library of Science, vol. 7(10), pages 1-10, October.
    11. Thomas Charlon & Manuel Martínez-Bueno & Lara Bossini-Castillo & F David Carmona & Alessandro Di Cara & Jérôme Wojcik & Sviatoslav Voloshynovskiy & Javier Martín & Marta E Alarcón-Riquelme, 2016. "Single Nucleotide Polymorphism Clustering in Systemic Autoimmune Diseases," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-10, August.
    12. Zhao Huaqing & Mitra Nandita & Kanetsky Peter A. & Nathanson Katherine L. & Rebbeck Timothy R., 2018. "A practical approach to adjusting for population stratification in genome-wide association studies: principal components and propensity scores (PCAPS)," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 17(6), pages 1-12, December.
    13. Lei Zhang & Yu-Fang Pei & Jian Li & Christopher J Papasian & Hong-Wen Deng, 2009. "Univariate/Multivariate Genome-Wide Association Scans Using Data from Families and Unrelated Samples," PLOS ONE, Public Library of Science, vol. 4(8), pages 1-12, August.
    14. Vincent Michaud & Eulalie Lasseaux & David J. Green & Dave T. Gerrard & Claudio Plaisant & Tomas Fitzgerald & Ewan Birney & Benoît Arveiler & Graeme C. Black & Panagiotis I. Sergouniotis, 2022. "The contribution of common regulatory and protein-coding TYR variants to the genetic architecture of albinism," Nature Communications, Nature, vol. 13(1), pages 1-8, December.
    15. Gyaneshwer Chaubey & Anurag Kadian & Saroj Bala & Vadlamudi Raghavendra Rao, 2015. "Genetic Affinity of the Bhil, Kol and Gond Mentioned in Epic Ramayana," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-11, June.
    16. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    17. Parsa Akbari & Dragana Vuckovic & Luca Stefanucci & Tao Jiang & Kousik Kundu & Roman Kreuzhuber & Erik L. Bao & Janine H. Collins & Kate Downes & Luigi Grassi & Jose A. Guerrero & Stephen Kaptoge & Ju, 2023. "A genome-wide association study of blood cell morphology identifies cellular proteins implicated in disease aetiology," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    18. Estavoyer, Maxime & François, Olivier, 2022. "Theoretical analysis of principal components in an umbrella model of intraspecific evolution," Theoretical Population Biology, Elsevier, vol. 148(C), pages 11-21.
    19. Jakris Eu-ahsunthornwattana & E Nancy Miller & Michaela Fakiola & Wellcome Trust Case Control Consortium 2 & Selma M B Jeronimo & Jenefer M Blackwell & Heather J Cordell, 2014. "Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data," PLOS Genetics, Public Library of Science, vol. 10(7), pages 1-20, July.
    20. Jungnam Joo & Minjung Kwak & Gang Zheng, 2010. "Improving Power for Testing Genetic Association in Case–Control Studies by Reducing the Alternative Space," Biometrics, The International Biometric Society, vol. 66(1), pages 266-276, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:53:y:2009:i:5:p:1767-1781. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.