IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v79y2023i1p332-343.html
   My bibliography  Save this article

Generalized case‐control sampling under generalized linear models

Author

Listed:
  • Jacob M. Maronge
  • Ran Tao
  • Jonathan S. Schildcrout
  • Paul J. Rathouz

Abstract

A generalized case‐control (GCC) study, like the standard case‐control study, leverages outcome‐dependent sampling (ODS) to extend to nonbinary responses. We develop a novel, unifying approach for analyzing GCC study data using the recently developed semiparametric extension of the generalized linear model (GLM), which is substantially more robust to model misspecification than existing approaches based on parametric GLMs. For valid estimation and inference, we use a conditional likelihood to account for the biased sampling design. We describe analysis procedures for estimation and inference for the semiparametric GLM under a conditional likelihood, and we discuss problems with estimation and inference under a conditional likelihood when the response distribution is misspecified. We demonstrate the flexibility of our approach over existing ones through extensive simulation studies, and we apply the methodology to an analysis of the Asset and Health Dynamics Among the Oldest Old study, which motives our research. The proposed approach yields a simple yet versatile solution for handling ODS in a wide variety of possible response distributions and sampling schemes encountered in practice.

Suggested Citation

  • Jacob M. Maronge & Ran Tao & Jonathan S. Schildcrout & Paul J. Rathouz, 2023. "Generalized case‐control sampling under generalized linear models," Biometrics, The International Biometric Society, vol. 79(1), pages 332-343, March.
  • Handle: RePEc:bla:biomet:v:79:y:2023:i:1:p:332-343
    DOI: 10.1111/biom.13571
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13571
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13571?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Weaver, Mark A. & Zhou, Haibo, 2005. "An Estimated Likelihood Method for Continuous Outcome Regression Models With Outcome-Dependent Sampling," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 459-469, June.
    2. Ran Tao & Donglin Zeng & Dan-Yu Lin, 2017. "Efficient Semiparametric Inference Under Two-Phase Sampling, With Applications to Genetic Association Studies," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1468-1476, October.
    3. N. E. Breslow & N. Chatterjee, 1999. "Design and analysis of two‐phase studies with binary outcome applied to Wilms tumour prognosis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 48(4), pages 457-468.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chiara Di Gravio & Ran Tao & Jonathan S. Schildcrout, 2023. "Design and analysis of two‐phase studies with multivariate longitudinal data," Biometrics, The International Biometric Society, vol. 79(2), pages 1420-1432, June.
    2. Brady Ryan & Ananthika Nirmalkanna & Candemir Cigsar & Yildiz E. Yilmaz, 2023. "Evaluation of Designs and Estimation Methods Under Response-Dependent Two-Phase Sampling for Genetic Association Studies," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 15(2), pages 510-539, July.
    3. Xiaofei Wang & Haibo Zhou, 2006. "A Semiparametric Empirical Likelihood Method for Biased Sampling Schemes with Auxiliary Covariates," Biometrics, The International Biometric Society, vol. 62(4), pages 1149-1160, December.
    4. Xiaofei Wang & Haibo Zhou, 2010. "Design and Inference for Cancer Biomarker Study with an Outcome and Auxiliary-Dependent Subsampling," Biometrics, The International Biometric Society, vol. 66(2), pages 502-511, June.
    5. Lu Chen & Li Hsu & Kathleen Malone, 2009. "A Frailty-Model-Based Approach to Estimating the Age-Dependent Penetrance Function of Candidate Genes Using Population-Based Case-Control Study Designs: An Application to Data on the BRCA1 Gene," Biometrics, The International Biometric Society, vol. 65(4), pages 1105-1114, December.
    6. Benchimol, Jonathan & El-Shagi, Makram & Saadon, Yossi, 2022. "Do expert experience and characteristics affect inflation forecasts?," Journal of Economic Behavior & Organization, Elsevier, vol. 201(C), pages 205-226.
    7. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2018. "Outcome†dependent sampling with interval†censored failure time data," Biometrics, The International Biometric Society, vol. 74(1), pages 58-67, March.
    8. Deng, Lifeng & Ding, Jieli & Liu, Yanyan & Wei, Chengdong, 2018. "Regression analysis for the proportional hazards model with parameter constraints under case-cohort design," Computational Statistics & Data Analysis, Elsevier, vol. 117(C), pages 194-206.
    9. Jason Dietrich, 2005. "The effects of sampling strategies on the small sample properties of the logit estimator," Journal of Applied Statistics, Taylor & Francis Journals, vol. 32(6), pages 543-554.
    10. Ruth M. Pfeiffer & Nilanjan Chatterjee, 2005. "On a Supplemented Case–Control Design," Biometrics, The International Biometric Society, vol. 61(2), pages 584-590, June.
    11. Gustavo Amorim & Ran Tao & Sarah Lotspeich & Pamela A. Shaw & Thomas Lumley & Bryan E. Shepherd, 2021. "Two‐phase sampling designs for data validation in settings with covariate measurement error and continuous outcome," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1368-1389, October.
    12. Judith Clarke & Marsha Courchane, 2004. "Implications of Stratified Sampling for Fair Lending Binary Logit Models," The Journal of Real Estate Finance and Economics, Springer, vol. 30(1), pages 5-31, October.
    13. Haibo Zhou & Guoyou Qin & Matthew P. Longnecker, 2011. "A Partial Linear Model in the Outcome-Dependent Sampling Setting to Evaluate the Effect of Prenatal PCB Exposure on Cognitive Function in Children," Biometrics, The International Biometric Society, vol. 67(3), pages 876-885, September.
    14. Takumi Saegusa, 2020. "Confidence bands for a distribution function with merged data from multiple sources," Statistics in Transition New Series, Polish Statistical Association, vol. 21(4), pages 144-158, August.
    15. Jieli Ding & Tsui-Shan Lu & Jianwen Cai & Haibo Zhou, 2017. "Recent progresses in outcome-dependent sampling with failure time data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(1), pages 57-82, January.
    16. Jonathan S. Schildcrout & Shawn P. Garbett & Patrick J. Heagerty, 2013. "Outcome Vector Dependent Sampling with Longitudinal Continuous Response Data: Stratified Sampling Based on Summary Statistics," Biometrics, The International Biometric Society, vol. 69(2), pages 405-416, June.
    17. Sarah C. Lotspeich & Bryan E. Shepherd & Gustavo G. C. Amorim & Pamela A. Shaw & Ran Tao, 2022. "Efficient odds ratio estimation under two‐phase sampling using error‐prone data from a multi‐national HIV research cohort," Biometrics, The International Biometric Society, vol. 78(4), pages 1674-1685, December.
    18. Yuichi Hirose, 2011. "Efficiency of profile likelihood in semi-parametric models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 63(6), pages 1247-1275, December.
    19. Jonathan S. Schildcrout & Patrick J. Heagerty, 2011. "Outcome-Dependent Sampling from Existing Cohorts with Longitudinal Binary Response Data: Study Planning and Analysis," Biometrics, The International Biometric Society, vol. 67(4), pages 1583-1593, December.
    20. Schill, Walter & Enders, Dirk & Drescher, Karsten, 2014. "A SAS Package for Logistic Two-Phase Studies," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 57(i09).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:79:y:2023:i:1:p:332-343. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.