IDEAS home Printed from https://ideas.repec.org/a/bpj/sagmbi/v13y2014i6p18n4.html
   My bibliography  Save this article

Robust methods to detect disease-genotype association in genetic association studies: calculate p-values using exact conditional enumeration instead of simulated permutations or asymptotic approximations

Author

Listed:
  • Langaas Mette

    (Department of Mathematical Sciences, Norwegian University of Science and Technology, No 7491 Trondheim, Norway)

  • Bakke Øyvind

    (Department of Mathematical Sciences, Norwegian University of Science and Technology, No 7491 Trondheim, Norway)

Abstract

In genetic association studies, detecting disease-genotype association is a primary goal. We study seven robust test statistics for such association when the underlying genetic model is unknown, for data on disease status (case or control) and genotype (three genotypes of a biallelic genetic marker). In such studies, p-values have predominantly been calculated by asymptotic approximations or by simulated permutations. We consider an exact method, conditional enumeration. When the number of simulated permutations tends to infinity, the permutation p-value approaches the conditional enumeration p-value, but calculating the latter is much more efficient than performing simulated permutations. We have studied case-control sample sizes with 500–5000 cases and 500–15,000 controls, and significance levels from 5×10–8 to 0.05, thus our results are applicable to genetic association studies with only a few genetic markers under study, intermediate follow-up studies, and genome-wide association studies. Our main findings are: (i) If all monotone genetic models are of interest, the best performance in the situations under study is achieved for the robust test statistics based on the maximum over a range of Cochran-Armitage trend tests with different scores and for the constrained likelihood ratio test. (ii) For significance levels below 0.05, for the test statistics under study, asymptotic approximations may give a test size up to 20 times the nominal level, and should therefore be used with caution. (iii) Calculating p-values based on exact conditional enumeration is a powerful, valid and computationally feasible approach, and we advocate its use in genetic association studies.

Suggested Citation

  • Langaas Mette & Bakke Øyvind, 2014. "Robust methods to detect disease-genotype association in genetic association studies: calculate p-values using exact conditional enumeration instead of simulated permutations or asymptotic approximati," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 13(6), pages 675-692, December.
  • Handle: RePEc:bpj:sagmbi:v:13:y:2014:i:6:p:18:n:4
    DOI: 10.1515/sagmb-2013-0084
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/sagmb-2013-0084
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/sagmb-2013-0084?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Morris, Nathan & Elston, Robert, 2011. "A Note on Comparing the Power of Test Statistics at Low Significance Levels," The American Statistician, American Statistical Association, vol. 65(3), pages 164-166.
    2. Zang, Yong & Fung, Wing Kam & Zheng, Gang, 2010. "Simple Algorithms to Calculate Asymptotic Null Distributions of Robust Tests in Case-Control Genetic Association Studies in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i08).
    3. Phipson Belinda & Smyth Gordon K, 2010. "Permutation P-values Should Never Be Zero: Calculating Exact P-values When Permutations Are Randomly Drawn," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-16, October.
    4. Jungnam Joo & Minjung Kwak & Kwangmi Ahn & Gang Zheng, 2009. "A Robust Genome-Wide Scan Statistic of the Wellcome Trust Case–Control Consortium," Biometrics, The International Biometric Society, vol. 65(4), pages 1115-1122, December.
    5. Devan V. Mehrotra & Ivan S. F. Chan & Roger L. Berger, 2003. "A Cautionary Note on Exact Unconditional Inference for a Difference between Two Independent Binomial Proportions," Biometrics, The International Biometric Society, vol. 59(2), pages 441-450, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kozlitina Julia & Schucany William R., 2015. "A robust distribution-free test for genetic association studies of quantitative traits," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 14(5), pages 443-464, November.
    2. Qu Long, 2014. "Combining dependent F-tests for robust association of quantitative traits under genetic model uncertainty," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 13(2), pages 123-139, April.
    3. Masha Shunko & Julie Niederhoff & Yaroslav Rosokha, 2018. "Humans Are Not Machines: The Behavioral Impact of Queueing Design on Service Time," Management Science, INFORMS, vol. 64(1), pages 453-473, January.
    4. Romero, Julian & Rosokha, Yaroslav, 2018. "Constructing strategies in the indefinitely repeated prisoner’s dilemma game," European Economic Review, Elsevier, vol. 104(C), pages 185-219.
    5. Silke Janitza & Ender Celik & Anne-Laure Boulesteix, 2018. "A computationally fast variable importance test for random forests for high-dimensional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(4), pages 885-915, December.
    6. Chris J. Lloyd, 2010. "Bootstrap and Second-Order Tests of Risk Difference," Biometrics, The International Biometric Society, vol. 66(3), pages 975-982, September.
    7. Chin Lin & Hsiang-Cheng Chen & Wen-Hui Fang & Chih-Chien Wang & Yi-Jen Peng & Herng-Sheng Lee & Hung Chang & Chi-Ming Chu & Guo-Shu Huang & Wei-Teing Chen & Yu-Jui Tsai & Hong-Ling Lin & Fu-Huang Lin , 2016. "Angiotensin-Converting Enzyme Insertion/Deletion Polymorphism and Susceptibility to Osteoarthritis of the Knee: A Case-Control Study and Meta-Analysis," PLOS ONE, Public Library of Science, vol. 11(9), pages 1-18, September.
    8. Angela L. Riffo-Campos & Guillermo Ayala & Juan Domingo, 2021. "Ordering of Omics Features Using Beta Distributions on Montecarlo p -Values," Mathematics, MDPI, vol. 9(11), pages 1-18, June.
    9. Baddeley, Adrian & Hardegen, Andrew & Lawrence, Thomas & Milne, Robin K. & Nair, Gopalan & Rakshit, Suman, 2017. "On two-stage Monte Carlo tests of composite hypotheses," Computational Statistics & Data Analysis, Elsevier, vol. 114(C), pages 75-87.
    10. Chen, Zhongxue, 2013. "Association tests through combining p-values for case control genome-wide association studies," Statistics & Probability Letters, Elsevier, vol. 83(8), pages 1854-1862.
    11. Chris J. Lloyd, 2008. "A New Exact and More Powerful Unconditional Test of No Treatment Effect from Binary Matched Pairs," Biometrics, The International Biometric Society, vol. 64(3), pages 716-723, September.
    12. Jesse Hemerik & Jelle J. Goeman, 2021. "Another Look at the Lady Tasting Tea and Differences Between Permutation Tests and Randomisation Tests," International Statistical Review, International Statistical Institute, vol. 89(2), pages 367-381, August.
    13. Phipps, Mary C. & Byron, Peter M., 2007. "A filter for "confidence interval P-values"," Computational Statistics & Data Analysis, Elsevier, vol. 51(12), pages 6435-6446, August.
    14. Guogen Shan & Gregory Wilding, 2015. "Unconditional tests for association in 2 × 2 contingency tables in the total sum fixed design," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 69(1), pages 67-83, February.
    15. Fabian J.E. Telschow & Michael R. Pierrynowski & Stephan F. Huckemann, 2021. "Functional inference on rotational curves under sample‐specific group actions and identification of human gait," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(4), pages 1256-1276, December.
    16. Hivert, Benjamin & Agniel, Denis & Thiébaut, Rodolphe & Hejblum, Boris P., 2024. "Post-clustering difference testing: Valid inference and practical considerations with applications to ecological and biological data," Computational Statistics & Data Analysis, Elsevier, vol. 193(C).
    17. Stefan Wellek, 2015. "Nearly exact sample size calculation for powerful non-randomized tests for differences between binomial proportions," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 69(4), pages 358-373, November.
    18. Hiromitsu Kobayashi & Chorong Song & Harumi Ikei & Bum-Jin Park & Takahide Kagawa & Yoshifumi Miyazaki, 2017. "Diurnal Changes in Distribution Characteristics of Salivary Cortisol and Immunoglobulin A Concentrations," IJERPH, MDPI, vol. 14(9), pages 1-9, August.
    19. Joseph Obaje Ataguba & Celestine Udoka Ugonabo, 2023. "Framework for measuring the efficiency and efficacy of sale of distressed mortgaged properties using imports of statistical tests deployed in clinical studies," SN Business & Economics, Springer, vol. 3(8), pages 1-32, August.
    20. Lucy L. Gao & Daniela Witten & Jacob Bien, 2022. "Testing for association in multiview network data," Biometrics, The International Biometric Society, vol. 78(3), pages 1018-1030, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:13:y:2014:i:6:p:18:n:4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.