IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i6p1285-d1090387.html
   My bibliography  Save this article

Association Testing of a Group of Genetic Markers Based on Next-Generation Sequencing Data and Continuous Response Using a Linear Model Framework

Author

Listed:
  • Zheng Xu

    (Department of Mathematics and Statistics, Wright State University, Dayton, OH 45324, USA)

Abstract

Association testing has been widely used to study the relationship between phenotypes and genetic variants. Most testing methods are based on genotypes. To avoid genotype calling and directly test on next-generation sequencing (NGS) data, sequencing data-based methods have been proposed and shown advantages over genotype-based testing methods in scenarios where genotype calling is inaccurate. Most sequencing data-based testing methods are based on a single genetic marker. The objective of this paper is to extend the methods to allow testing for the association of a continuous response variable with a group of common variants or a group of rare variants without genotype calling. Our proposed methods are derived based on a standard linear model framework. We derive the joint significant test (JS) for a group of common genetic variables and the variable collapse test (VC) for a group of rare genetic variables. We have conducted extensive simulation studies to evaluate the performance of different estimators. According to our results, we found (1) all methods, including our proposed NGS data-based methods and genotype-based methods, can control the Type I error rate probability well; (2) our proposed NGS data-based methods can achieve better performance in terms of statistical power compared with their corresponding genotype-based methods in the literature; (3) when sequencing depth increases, the performance of all methods increases, and the difference between the performance of NGS data-based methods and corresponding genotype-based methods decreases. In conclusion, we have proposed NGS data-based methods that allow testing for the significance of a group of variants using a linear model framework and have shown the advantage of our NGS data-based methods over genotype-based methods in the literature.

Suggested Citation

  • Zheng Xu, 2023. "Association Testing of a Group of Genetic Markers Based on Next-Generation Sequencing Data and Continuous Response Using a Linear Model Framework," Mathematics, MDPI, vol. 11(6), pages 1-32, March.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:6:p:1285-:d:1090387
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/6/1285/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/6/1285/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Iuliana Ionita-Laza & Joseph D Buxbaum & Nan M Laird & Christoph Lange, 2011. "A New Testing Strategy to Identify Rare Variants with Either Risk or Protective Effect on Disease," PLOS Genetics, Public Library of Science, vol. 7(2), pages 1-6, February.
    2. Vincent Plagnol & Jason D Cooper & John A Todd & David G Clayton, 2007. "A Method to Address Differential Bias in Genotyping in Large-Scale Association Studies," PLOS Genetics, Public Library of Science, vol. 3(5), pages 1-9, May.
    3. Rasmus Nielsen & Thorfinn Korneliussen & Anders Albrechtsen & Yingrui Li & Jun Wang, 2012. "SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-10, July.
    4. Hailiang Huang & Pritam Chanda & Alvaro Alonso & Joel S Bader & Dan E Arking, 2011. "Gene-Based Tests of Association," PLOS Genetics, Public Library of Science, vol. 7(7), pages 1-15, July.
    5. Dajiang J Liu & Suzanne M Leal, 2010. "A Novel Adaptive Method for the Analysis of Next-Generation Sequencing Data to Detect Complex Trait Associations with Rare Variants Due to Gene Main Effects and Interactions," PLOS Genetics, Public Library of Science, vol. 6(10), pages 1-14, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zheng Xu & Song Yan & Cong Wu & Qing Duan & Sixia Chen & Yun Li, 2023. "Next-Generation Sequencing Data-Based Association Testing of a Group of Genetic Markers for Complex Responses Using a Generalized Linear Model Framework," Mathematics, MDPI, vol. 11(11), pages 1-28, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zheng Xu & Song Yan & Cong Wu & Qing Duan & Sixia Chen & Yun Li, 2023. "Next-Generation Sequencing Data-Based Association Testing of a Group of Genetic Markers for Complex Responses Using a Generalized Linear Model Framework," Mathematics, MDPI, vol. 11(11), pages 1-28, June.
    2. Chung-Feng Kao & Jia-Rou Liu & Hung Hung & Po-Hsiu Kuo, 2015. "A Robust GWSS Method to Simultaneously Detect Rare and Common Variants for Complex Disease," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-14, April.
    3. Martin Ladouceur & Zari Dastani & Yurii S Aulchenko & Celia M T Greenwood & J Brent Richards, 2012. "The Empirical Power of Rare Variant Association Methods: Results from Sanger Sequencing in 1,998 Individuals," PLOS Genetics, Public Library of Science, vol. 8(2), pages 1-11, February.
    4. Elodie Persyn & Richard Redon & Lise Bellanger & Christian Dina, 2018. "The impact of a fine-scale population stratification on rare variant association test results," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-17, December.
    5. Ruth Greenblatt & Peter Bacchetti & Ross Boylan & Kord Kober & Gayle Springer & Kathryn Anastos & Michael Busch & Mardge Cohen & Seble Kassaye & Deborah Gustafson & Bradley Aouizerat & on behalf of th, 2019. "Genetic and clinical predictors of CD4 lymphocyte recovery during suppressive antiretroviral therapy: Whole exome sequencing and antiretroviral therapy response phenotypes," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-25, August.
    6. Nanye Long & Samuel P Dickson & Jessica M Maia & Hee Shin Kim & Qianqian Zhu & Andrew S Allen, 2013. "Leveraging Prior Information to Detect Causal Variants via Multi-Variant Regression," PLOS Computational Biology, Public Library of Science, vol. 9(6), pages 1-11, June.
    7. Charlotte Wang & Wen-Hsin Kao & Chuhsing Kate Hsiao, 2015. "Using Hamming Distance as Information for SNP-Sets Clustering and Testing in Disease Association Studies," PLOS ONE, Public Library of Science, vol. 10(8), pages 1-24, August.
    8. Pallav Bhatnagar & Emily Barron-Casella & Christopher J Bean & Jacqueline N Milton & Clinton T Baldwin & Martin H Steinberg & Michael DeBaun & James F Casella & Dan E Arking, 2013. "Genome-Wide Meta-Analysis of Systolic Blood Pressure in Children with Sickle Cell Disease," PLOS ONE, Public Library of Science, vol. 8(9), pages 1-1, September.
    9. Emily Mathieu, 2016. "AGGrEGATOr: A Gene-based GEne-Gene interActTiOn test for case-control association studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(2), pages 151-171, April.
    10. Le Zhang & Chunqiu Zheng & Tian Li & Lei Xing & Han Zeng & Tingting Li & Huan Yang & Jia Cao & Badong Chen & Ziyuan Zhou, 2017. "Building Up a Robust Risk Mathematical Platform to Predict Colorectal Cancer," Complexity, Hindawi, vol. 2017, pages 1-14, October.
    11. Diana Chang & Feng Gao & Andrea Slavney & Li Ma & Yedael Y Waldman & Aaron J Sams & Paul Billing-Ross & Aviv Madar & Richard Spritz & Alon Keinan, 2014. "Accounting for eXentricities: Analysis of the X Chromosome in GWAS Reveals X-Linked Genes Implicated in Autoimmune Diseases," PLOS ONE, Public Library of Science, vol. 9(12), pages 1-31, December.
    12. Ren-Hua Chung & Wei-Yun Tsai & Eden R Martin, 2014. "Family-Based Association Test Using Both Common and Rare Variants and Accounting for Directions of Effects for Sequencing Data," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-7, September.
    13. Yuanjia Wang & Yin-Hsiu Chen & Qiong Yang, 2012. "Joint Rare Variant Association Test of the Average and Individual Effects for Sequencing Studies," PLOS ONE, Public Library of Science, vol. 7(3), pages 1-13, March.
    14. Ahn Kwangmi & Gordon Derek & Finch Stephen J, 2009. "Increase of Rejection Rate in Case-Control Studies with the Differential Genotyping Error Rates," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-11, May.
    15. Brandon Coombes & Saonli Basu & Sharmistha Guha & Nicholas Schork, 2015. "Weighted Score Tests Implementing Model-Averaging Schemes in Detection of Rare Variants in Case-Control Studies," PLOS ONE, Public Library of Science, vol. 10(10), pages 1-21, October.
    16. Weiming Zhang & Michael P. Epstein & Tasha E. Fingerlin & Debashis Ghosh, 2017. "Links Between the Sequence Kernel Association and the Kernel-Based Adaptive Cluster Tests," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(1), pages 246-258, June.
    17. Hirzi Luqman & Daniel Wegmann & Simone Fior & Alex Widmer, 2023. "Climate-induced range shifts drive adaptive response via spatio-temporal sieving of alleles," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    18. Daniel D Kinnamon & Ray E Hershberger & Eden R Martin, 2012. "Reconsidering Association Testing Methods Using Single-Variant Test Statistics as Alternatives to Pooling Tests for Sequence Data with Rare Variants," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-15, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:6:p:1285-:d:1090387. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.