IDEAS home Printed from https://ideas.repec.org/a/bla/jorssa/v183y2020i1p379-402.html
   My bibliography  Save this article

Outcome‐dependent sampling in cluster‐correlated data settings with application to hospital profiling

Author

Listed:
  • Glen McGee
  • Jonathan Schildcrout
  • Sharon‐Lise Normand
  • Sebastien Haneuse

Abstract

Hospital readmission is a key marker of quality of healthcare and an important policy measure, used by the Centers for Medicare and Medicaid Services to determine, in part, reimbursement rates. Currently, analyses of readmissions are based on a logistic–normal generalized linear mixed model that permits estimation of hospital‐specific measures while adjusting for case mix differences. Recent moves to identify and address healthcare disparities call for expanding case mix adjustment to include measures of socio‐economic status while minimizing additional burden to hospitals associated with collecting data on such measures. Towards resolving this dilemma, we propose that detailed socio‐economic data be collected on a subsample of patients via an outcome‐dependent sampling scheme, specifically the cluster‐stratified case–control design. Estimation and inference, for both the fixed and the random‐effects components, are performed via pseudo‐maximum‐likelihood wherein inverse probability weights are incorporated in the usual integrated likelihood to account for the design. In comprehensive simulations, cluster‐stratified case–control sampling proves to be an efficient design whenever interest lies in fixed or random effects of a generalized linear mixed model and covariates are unobserved or expensive to collect. The methods are motivated by and illustrated with an analysis of N = 889661 Medicare beneficiaries hospitalized between 2011 and 2013 with congestive heart failure at one of K = 3116 hospitals. Results highlight that the framework proposed provides a means of mitigating disparities in terms of which hospitals are indicated as being poor performers, relative to a naive analysis that fails to adjust for missing case mix variables.

Suggested Citation

  • Glen McGee & Jonathan Schildcrout & Sharon‐Lise Normand & Sebastien Haneuse, 2020. "Outcome‐dependent sampling in cluster‐correlated data settings with application to hospital profiling," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(1), pages 379-402, January.
  • Handle: RePEc:bla:jorssa:v:183:y:2020:i:1:p:379-402
    DOI: 10.1111/rssa.12503
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssa.12503
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssa.12503?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sophia Rabe‐Hesketh & Anders Skrondal, 2006. "Multilevel modelling of complex survey data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 169(4), pages 805-827, October.
    2. Klaus Larsen & Jørgen Holm Petersen & Esben Budtz-Jørgensen & Lars Endahl, 2000. "Interpreting Parameters in the Logistic Regression Model with Random Effects," Biometrics, The International Biometric Society, vol. 56(3), pages 909-914, September.
    3. J. M. Neuhaus & A. J. Scott & C. J. Wild, 2006. "Family-Specific Approaches to the Analysis of Case–Control Family Data," Biometrics, The International Biometric Society, vol. 62(2), pages 488-494, June.
    4. D. Pfeffermann & C. J. Skinner & D. J. Holmes & H. Goldstein & J. Rasbash, 1998. "Weighting for unequal selection probabilities in multilevel models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(1), pages 23-40.
    5. John M. Neuhaus & Alastair J. Scott & Christopher J. Wild & Yannan Jiang & Charles E. McCulloch & Ross Boylan, 2014. "Likelihood-based analysis of longitudinal data from outcome-related sampling designs," Biometrics, The International Biometric Society, vol. 70(1), pages 44-52, March.
    6. Jonathan S. Schildcrout & Shawn P. Garbett & Patrick J. Heagerty, 2013. "Outcome Vector Dependent Sampling with Longitudinal Continuous Response Data: Stratified Sampling Based on Summary Statistics," Biometrics, The International Biometric Society, vol. 69(2), pages 405-416, June.
    7. Jonathan S. Schildcrout & Paul J. Rathouz, 2010. "Longitudinal Studies of Binary Response Data Following Case–Control and Stratified Case–Control Sampling: Design and Analysis," Biometrics, The International Biometric Society, vol. 66(2), pages 365-373, June.
    8. Harvey Goldstein & David J. Spiegelhalter, 1996. "League Tables and Their Limitations: Statistical Issues in Comparisons of Institutional Performance," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 159(3), pages 385-409, May.
    9. J. Neuhaus, 2002. "The analysis of retrospective family studies," Biometrika, Biometrika Trust, vol. 89(1), pages 23-37, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Glen McGee & Marianthi‐Anna Kioumourtzoglou & Marc G. Weisskopf & Sebastien Haneuse & Brent A. Coull, 2020. "On the interplay between exposure misclassification and informative cluster size," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1209-1226, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Glen McGee & Marianthi‐Anna Kioumourtzoglou & Marc G. Weisskopf & Sebastien Haneuse & Brent A. Coull, 2020. "On the interplay between exposure misclassification and informative cluster size," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1209-1226, November.
    2. John M. Neuhaus & Alastair J. Scott & Christopher J. Wild & Yannan Jiang & Charles E. McCulloch & Ross Boylan, 2014. "Likelihood-based analysis of longitudinal data from outcome-related sampling designs," Biometrics, The International Biometric Society, vol. 70(1), pages 44-52, March.
    3. Jonathan S. Schildcrout & Shawn P. Garbett & Patrick J. Heagerty, 2013. "Outcome Vector Dependent Sampling with Longitudinal Continuous Response Data: Stratified Sampling Based on Summary Statistics," Biometrics, The International Biometric Society, vol. 69(2), pages 405-416, June.
    4. Sara Sauer & Bethany Hedt‐Gauthier & Claudia Rivera‐Rodriguez & Sebastien Haneuse, 2022. "Small‐sample inference for cluster‐based outcome‐dependent sampling schemes in resource‐limited settings: Investigating low birthweight in Rwanda," Biometrics, The International Biometric Society, vol. 78(2), pages 701-715, June.
    5. Yingye Zheng & Patrick J. Heagerty & Li Hsu & Polly A. Newcomb, 2010. "On Combining Family-Based and Population-Based Case–Control Data in Association Studies," Biometrics, The International Biometric Society, vol. 66(4), pages 1024-1033, December.
    6. Woojin Chung & Roeul Kim, 2020. "A Reversal of the Association between Education Level and Obesity Risk during Ageing: A Gender-Specific Longitudinal Study in South Korea," IJERPH, MDPI, vol. 17(18), pages 1-19, September.
    7. Patricia Dörr & Jan Pablo Burgard, 2019. "Data-driven transformations and survey-weighting for linear mixed models," Research Papers in Economics 2019-16, University of Trier, Department of Economics.
    8. Joseph L Dieleman & Tara Templin, 2014. "Random-Effects, Fixed-Effects and the within-between Specification for Clustered Data in Observational Health Studies: A Simulation Study," PLOS ONE, Public Library of Science, vol. 9(10), pages 1-17, October.
    9. Woojin Chung & Roeul Kim, 2020. "Differential Risk of Cognitive Impairment across Paid and Unpaid Occupations in the Middle-Age Population: Evidence from the Korean Longitudinal Study of Aging, 2006–2016," IJERPH, MDPI, vol. 17(9), pages 1-14, April.
    10. Laura M. Stapleton & Yoonjeong Kang, 2018. "Design Effects of Multilevel Estimates From National Probability Samples," Sociological Methods & Research, , vol. 47(3), pages 430-457, August.
    11. Woojin Chung & Roeul Kim, 2020. "Which Occupation is Highly Associated with Cognitive Impairment? A Gender-Specific Longitudinal Study of Paid and Unpaid Occupations in South Korea," IJERPH, MDPI, vol. 17(21), pages 1-17, October.
    12. Nora Würz & Timo Schmid & Nikos Tzavidis, 2022. "Estimating regional income indicators under transformations and access to limited population auxiliary information," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 1679-1706, October.
    13. Anders Skrondal & Sophia Rabe‐Hesketh, 2009. "Prediction in multilevel generalized linear models," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 172(3), pages 659-687, June.
    14. Robert G. Clark & David G. Steel, 2022. "Sample design for analysis using high‐influence probability sampling," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 1733-1756, October.
    15. Francesco Schirripa Spagnolo & Nicola Salvati & Antonella D’Agostino & Ides Nicaise, 2020. "The use of sampling weights in M‐quantile random‐effects regression: an application to Programme for International Student Assessment mathematics scores," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(4), pages 991-1012, August.
    16. Bowen, Mary Elizabeth, 2009. "Childhood socioeconomic status and racial differences in disability: Evidence from the Health and Retirement Study (1998-2006)," Social Science & Medicine, Elsevier, vol. 69(3), pages 433-441, August.
    17. Ana Maria Osorio & Catalina Bolancé & Nyovane Madise & Katharina Rathmann, 2013. "Social Determinants of Child Health in Colombia: Can Community Education Moderate the Effect of Family Characteristics?," Working Papers XREAP2013-02, Xarxa de Referència en Economia Aplicada (XREAP), revised Mar 2013.
    18. Amini, Chiara & Nivorozhkin, Eugene, 2015. "The urban–rural divide in educational outcomes: Evidence from Russia," International Journal of Educational Development, Elsevier, vol. 44(C), pages 118-133.
    19. Yergeau, Marie-Eve, 2020. "Tourism and local welfare: A multilevel analysis in Nepal’s protected areas," World Development, Elsevier, vol. 127(C).
    20. Jonathan S. Schildcrout & Patrick J. Heagerty, 2011. "Outcome-Dependent Sampling from Existing Cohorts with Longitudinal Binary Response Data: Study Planning and Analysis," Biometrics, The International Biometric Society, vol. 67(4), pages 1583-1593, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:183:y:2020:i:1:p:379-402. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.