IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v64y2008i3p673-684.html
   My bibliography  Save this article

Haplotype‐Based Regression Analysis and Inference of Case–Control Studies with Unphased Genotypes and Measurement Errors in Environmental Exposures

Author

Listed:
  • Iryna Lobach
  • Raymond J. Carroll
  • Christine Spinka
  • Mitchell H. Gail
  • Nilanjan Chatterjee

Abstract

Summary It is widely believed that risks of many complex diseases are determined by genetic susceptibilities, environmental exposures, and their interaction. Chatterjee and Carroll (2005, Biometrika92, 399–418) developed an efficient retrospective maximum‐likelihood method for analysis of case–control studies that exploits an assumption of gene–environment independence and leaves the distribution of the environmental covariates to be completely nonparametric. Spinka, Carroll, and Chatterjee (2005, Genetic Epidemiology29, 108–127) extended this approach to studies where certain types of genetic information, such as haplotype phases, may be missing on some subjects. We further extend this approach to situations when some of the environmental exposures are measured with error. Using a polychotomous logistic regression model, we allow disease status to have K+ 1 levels. We propose use of a pseudolikelihood and a related EM algorithm for parameter estimation. We prove consistency and derive the resulting asymptotic covariance matrix of parameter estimates when the variance of the measurement error is known and when it is estimated using replications. Inferences with measurement error corrections are complicated by the fact that the Wald test often behaves poorly in the presence of large amounts of measurement error. The likelihood‐ratio (LR) techniques are known to be a good alternative. However, the LR tests are not technically correct in this setting because the likelihood function is based on an incorrect model, i.e., a prospective model in a retrospective sampling scheme. We corrected standard asymptotic results to account for the fact that the LR test is based on a likelihood‐type function. The performance of the proposed method is illustrated using simulation studies emphasizing the case when genetic information is in the form of haplotypes and missing data arises from haplotype‐phase ambiguity. An application of our method is illustrated using a population‐based case–control study of the association between calcium intake and the risk of colorectal adenoma.

Suggested Citation

  • Iryna Lobach & Raymond J. Carroll & Christine Spinka & Mitchell H. Gail & Nilanjan Chatterjee, 2008. "Haplotype‐Based Regression Analysis and Inference of Case–Control Studies with Unphased Genotypes and Measurement Errors in Environmental Exposures," Biometrics, The International Biometric Society, vol. 64(3), pages 673-684, September.
  • Handle: RePEc:bla:biomet:v:64:y:2008:i:3:p:673-684
    DOI: 10.1111/j.1541-0420.2007.00930.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1541-0420.2007.00930.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1541-0420.2007.00930.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Daowen Zhang & Marie Davidian, 2001. "Linear Mixed Models with Flexible Distributions of Random Effects for Longitudinal Data," Biometrics, The International Biometric Society, vol. 57(3), pages 795-802, September.
    2. Lin, D.Y. & Zeng, D., 2006. "Likelihood-Based Inference on Haplotype Effects in Genetic Association Studies," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 89-104, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yanyuan Ma & Raymond J. Carroll, 2016. "Semiparametric estimation in the secondary analysis of case–control studies," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(1), pages 127-151, January.
    2. Tianying Wang & Alex Asher, 2021. "Improved Semiparametric Analysis of Polygenic Gene–Environment Interactions in Case–Control Studies," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(3), pages 386-401, December.
    3. Jun Zhang & Zhenghui Feng & Peirong Xu & Hua Liang, 2017. "Generalized varying coefficient partially linear measurement errors models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(1), pages 97-120, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhang, Daowen & Davidian, Marie, 2004. "Likelihood and conditional likelihood inference for generalized additive mixed models for clustered data," Journal of Multivariate Analysis, Elsevier, vol. 91(1), pages 90-106, October.
    2. Huang, Pei & McCarl, Bruce A., 2014. "Estimating Decadal Climate Variability Effects on Crop Yields: A Bayesian Hierarchical Approach," 2014 Annual Meeting, July 27-29, 2014, Minneapolis, Minnesota 169828, Agricultural and Applied Economics Association.
    3. Fabienne Comte & Adeline Samson, 2012. "Nonparametric estimation of random-effects densities in linear mixed-effects model," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(4), pages 951-975, December.
    4. Lamarche, Carlos & Shi, Xuan & Young, Derek S., 2024. "Conditional Quantile Functions for Zero-Inflated Longitudinal Count Data," Econometrics and Statistics, Elsevier, vol. 31(C), pages 49-65.
    5. Jinbo Chen & Dongyu Lin & Hagit Hochner, 2012. "Semiparametric Maximum Likelihood Methods for Analyzing Genetic and Environmental Effects with Case-Control Mother–Child Pair Data," Biometrics, The International Biometric Society, vol. 68(3), pages 869-877, September.
    6. Peng Zhang & Peter X.-K. Song & Annie Qu & Tom Greene, 2008. "Efficient Estimation for Patient-Specific Rates of Disease Progression Using Nonnormal Linear Mixed Models," Biometrics, The International Biometric Society, vol. 64(1), pages 29-38, March.
    7. Manuel Arellano & Stéphane Bonhomme, 2012. "Identifying Distributional Characteristics in Random Coefficients Panel Data Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(3), pages 987-1020.
    8. M. Teimourian & T. Baghfalaki & M. Ganjali & D. Berridge, 2015. "Joint modeling of mixed skewed continuous and ordinal longitudinal responses: a Bayesian approach," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(10), pages 2233-2256, October.
    9. Jinbo Chen & Carmen Rodriguez, 2007. "Conditional Likelihood Methods for Haplotype-Based Association Analysis Using Matched Case–Control Data," Biometrics, The International Biometric Society, vol. 63(4), pages 1099-1107, December.
    10. Pereira, Luz Adriana & Gutiérrez, Luis & Taylor-Rodríguez, Daniel & Mena, Ramsés H., 2023. "Bayesian nonparametric hypothesis testing for longitudinal data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    11. Xianzheng Huang & Leonard A. Stefanski & Marie Davidian, 2009. "Latent-Model Robustness in Joint Models for a Primary Endpoint and a Longitudinal Process," Biometrics, The International Biometric Society, vol. 65(3), pages 719-727, September.
    12. Sinha Samiran & Gruber Stephen B & Mukherjee Bhramar & Rennert Gad, 2008. "Inference of the Haplotype Effect in a Matched Case-Control Study Using Unphased Genotype Data," The International Journal of Biostatistics, De Gruyter, vol. 4(1), pages 1-28, May.
    13. Ye, Rendao & Wang, Tonghui & Gupta, Arjun K., 2014. "Distribution of matrix quadratic forms under skew-normal settings," Journal of Multivariate Analysis, Elsevier, vol. 131(C), pages 229-239.
    14. Rubio, F.J., 2015. "On the propriety of the posterior of hierarchical linear mixed models with flexible random effects distributions," Statistics & Probability Letters, Elsevier, vol. 96(C), pages 154-161.
    15. Lourdes Montenegro & Víctor Lachos & Heleno Bolfarine, 2010. "Inference for a skew extension of the Grubbs model," Statistical Papers, Springer, vol. 51(3), pages 701-715, September.
    16. Jodi M. Casabianca & Charles Lewis, 2015. "IRT Item Parameter Recovery With Marginal Maximum Likelihood Estimation Using Loglinear Smoothing Models," Journal of Educational and Behavioral Statistics, , vol. 40(6), pages 547-578, December.
    17. Francis K. C. Hui & Samuel Müller & Alan H. Welsh, 2021. "Random Effects Misspecification Can Have Severe Consequences for Random Effects Inference in Linear Mixed Models," International Statistical Review, International Statistical Institute, vol. 89(1), pages 186-206, April.
    18. Seo, Byungtae & Ha, Il Do, 2024. "Semiparametric accelerated failure time models under unspecified random effect distributions," Computational Statistics & Data Analysis, Elsevier, vol. 195(C).
    19. Warrington Nicole M. & Tilling Kate & Howe Laura D. & Paternoster Lavinia & Pennell Craig E. & Wu Yan Yan & Briollais Laurent, 2014. "Robustness of the linear mixed effects model to error distribution assumptions and the consequences for genome-wide association studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 13(5), pages 567-587, October.
    20. Özgür Asar & David Bolin & Peter J. Diggle & Jonas Wallin, 2020. "Linear mixed effects models for non‐Gaussian continuous repeated measurement data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1015-1065, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:64:y:2008:i:3:p:673-684. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.