IDEAS home Printed from https://ideas.repec.org/a/bpj/sagmbi/v8y2009i1n46.html
   My bibliography  Save this article

Statistical Screening Method for Genetic Factors Influencing Susceptibility to Common Diseases in a Two-Stage Genome-Wide Association Study

Author

Listed:
  • Sato Yasunori

    (Harvard School of Public Health)

  • Laird Nan

    (Harvard School of Public Health)

  • Suganami Hideki

    (Tokyo University of Science)

  • Hamada Chikuma

    (Tokyo University of Science)

  • Niki Naoto

    (Tokyo University of Science)

  • Yoshimura Isao

    (Tokyo University of Science)

  • Yoshida Teruhiko

    (National Cancer Center Research Institute)

Abstract

A genome-wide association study (GWAS) is a standard strategy for detecting disease susceptibility genes, despite unsettled controversies on many aspects, including optimal study design and statistical analysis. As for study design, a two-stage design has been applied to maximize cost-effectiveness. However, there has been little consensus on appropriate statistical analysis for two-stage design. Thereby perplexing the researchers as to which statistical measures should be applied at the first stage, and how to determine the significance level of the differences at the second stage. Here, using simulation studies, we compared statistical operating characteristics of the screening in a two-stage GWAS by taking into consideration the proper balance of false-positive and false-negative error. As a result, the lower bound of confidence interval for odds ratios is recommended as the first stage measure, and then the second stage criteria should primarily depend on the purpose of the genome screen or its role in the overall gene-hunting scheme. Based on the simulation study, we suggest rules of thumb about which statistics to use in a given situation. An application of all operating characteristics of the screening method to an actual GWAS for gastric cancer illustrates the practical relevance of our discussion.

Suggested Citation

  • Sato Yasunori & Laird Nan & Suganami Hideki & Hamada Chikuma & Niki Naoto & Yoshimura Isao & Yoshida Teruhiko, 2009. "Statistical Screening Method for Genetic Factors Influencing Susceptibility to Common Diseases in a Two-Stage Genome-Wide Association Study," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-23, November.
  • Handle: RePEc:bpj:sagmbi:v:8:y:2009:i:1:n:46
    DOI: 10.2202/1544-6115.1490
    as

    Download full text from publisher

    File URL: https://doi.org/10.2202/1544-6115.1490
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.2202/1544-6115.1490?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jaya M. Satagopan & E. S. Venkatraman & Colin B. Begg, 2004. "Two-Stage Designs for Gene–Disease Association Studies with Sample Size Constraints," Biometrics, The International Biometric Society, vol. 60(3), pages 589-597, September.
    2. Jaya M. Satagopan & David A. Verbel & E. S. Venkatraman & Kenneth E. Offit & Colin B. Begg, 2002. "Two-Stage Designs for Gene–Disease Association Studies," Biometrics, The International Biometric Society, vol. 58(1), pages 163-170, March.
    3. Robert Sladek & Ghislain Rocheleau & Johan Rung & Christian Dina & Lishuang Shen & David Serre & Philippe Boutin & Daniel Vincent & Alexandre Belisle & Samy Hadjadj & Beverley Balkau & Barbara Heude &, 2007. "A genome-wide association study identifies novel risk loci for type 2 diabetes," Nature, Nature, vol. 445(7130), pages 881-885, February.
    4. Christopher Genovese & Larry Wasserman, 2002. "Operating characteristics and extensions of the false discovery rate procedure," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(3), pages 499-517, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Duncan C. Thomas, 2005. "Discussion on "Statistical Issues Arising in the Women's Health Initiative"," Biometrics, The International Biometric Society, vol. 61(4), pages 930-933, December.
    2. Minjung Kwak & Jungnam Joo & Gang Zheng, 2009. "A Robust Test for Two-Stage Design in Genome-Wide Association Studies," Biometrics, The International Biometric Society, vol. 65(4), pages 1288-1295, December.
    3. T. Tony Cai & Wenguang Sun, 2017. "Optimal screening and discovery of sparse signals with applications to multistage high throughput studies," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 197-223, January.
    4. Ping Rao & Hao Wang & Honghong Fang & Qing Gao & Jie Zhang & Manshu Song & Yong Zhou & Youxin Wang & Wei Wang, 2016. "Association between IGF2BP2 Polymorphisms and Type 2 Diabetes Mellitus: A Case–Control Study and Meta-Analysis," IJERPH, MDPI, vol. 13(6), pages 1-13, June.
    5. A Bottle & P Aylin, 2011. "Predicting the false alarm rate in multi-institution mortality monitoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 62(9), pages 1711-1718, September.
    6. Ebrahimi, Nader, 2008. "Simultaneous control of false positives and false negatives in multiple hypotheses testing," Journal of Multivariate Analysis, Elsevier, vol. 99(3), pages 437-450, March.
    7. Greve, Jane, 2008. "Obesity and labor market outcomes in Denmark," Economics & Human Biology, Elsevier, vol. 6(3), pages 350-362, December.
    8. B. Moerkerke & E. Goetghebeur & J. De Riek & I. Roldán‐Ruiz, 2006. "Significance and impotence: towards a balanced view of the null and the alternative hypotheses in marker selection for plant breeding," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 169(1), pages 61-79, January.
    9. Ruth Heller & Saharon Rosset, 2021. "Optimal control of false discovery criteria in the two‐group model," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(1), pages 133-155, February.
    10. Gómez-Villegas Miguel A. & Salazar Isabel & Sanz Luis, 2014. "A Bayesian decision procedure for testing multiple hypotheses in DNA microarray experiments," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 13(1), pages 49-65, February.
    11. Arianna Dondi & Jacopo Lenzi & Egidio Candela & Sugitha Sureshkumar & Francesca Morigi & Carlotta Biagi & Marcello Lanari, 2022. "Social Determinants of Health and Distance Learning in Italy in the Era of the SARS-CoV-2 Pandemic," IJERPH, MDPI, vol. 19(9), pages 1-12, May.
    12. John PA Ioannidis & Nikolaos A Patsopoulos & Evangelos Evangelou, 2007. "Heterogeneity in Meta-Analyses of Genome-Wide Association Investigations," PLOS ONE, Public Library of Science, vol. 2(9), pages 1-7, September.
    13. Paul F O’Reilly & Clive J Hoggart & Yotsawat Pomyen & Federico C F Calboli & Paul Elliott & Marjo-Riitta Jarvelin & Lachlan J M Coin, 2012. "MultiPhen: Joint Model of Multiple Phenotypes Can Increase Discovery in GWAS," PLOS ONE, Public Library of Science, vol. 7(5), pages 1-1, May.
    14. Joshua Habiger & David Watts & Michael Anderson, 2017. "Multiple testing with heterogeneous multinomial distributions," Biometrics, The International Biometric Society, vol. 73(2), pages 562-570, June.
    15. Qingyun Cai & Hock Peng Chan, 2017. "A Double Application of the Benjamini-Hochberg Procedure for Testing Batched Hypotheses," Methodology and Computing in Applied Probability, Springer, vol. 19(2), pages 429-443, June.
    16. Mihai C. Giurcanu, 2017. "Oracle M-Estimation for Time Series Models," Journal of Time Series Analysis, Wiley Blackwell, vol. 38(3), pages 479-504, May.
    17. Long Qu & Dan Nettleton & Jack C. M. Dekkers, 2012. "Improved Estimation of the Noncentrality Parameter Distribution from a Large Number of t-Statistics, with Applications to False Discovery Rate Estimation in Microarray Data Analysis," Biometrics, The International Biometric Society, vol. 68(4), pages 1178-1187, December.
    18. Glueck Deborah H & Mandel Jan & Karimpour-Fard Anis & Hunter Lawrence & Muller Keith E, 2008. "Exact Calculations of Average Power for the Benjamini-Hochberg Procedure," The International Journal of Biostatistics, De Gruyter, vol. 4(1), pages 1-22, June.
    19. E. M. Conlon & B. L. Postier & B. A. Methe & K. P. Nevin & D. R. Lovley, 2009. "Hierarchical Bayesian meta-analysis models for cross-platform microarray studies," Journal of Applied Statistics, Taylor & Francis Journals, vol. 36(10), pages 1067-1085.
    20. Anti, Sebastian, 2021. "Land grabs and labor in Cambodia," Journal of Development Economics, Elsevier, vol. 149(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:8:y:2009:i:1:n:46. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.