IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v15y2023i1d10.1007_s12561-022-09351-9.html
   My bibliography  Save this article

Bayesian Framework for Causal Inference with Principal Stratification and Clusters

Author

Listed:
  • Li He

    (Clemson University)

  • Yu-Bo Wang

    (Clemson University)

  • William C. Bridges

    (Clemson University)

  • Zhulin He

    (Foundation Medicine, Inc.)

  • S. Megan Che

    (Clemson University)

Abstract

In observational studies, principal stratification is a well-established method in causal analysis to adjust the treatment effect estimation for post-treatment variables. However, this inference could be challenging when the data have a clustering structure, which is pervasive in observational studies. Adding to the issues is the fact that often the variables associated with the clusters are only recorded as the cluster label due to a budget constraint or measuring difficulties. Furthermore, the true nature of the relationship between these cluster level variables and the outcome may be unclear. Although accommodating this clustering structure via random effects based on the cluster label can address the bias issues, estimating the model is inevitably tedious and overfitting can occur with principal stratification and clustering. In this article, we propose a comprehensive framework for estimating a treatment effect when both post-treatment variable and clustering exist in a data set. Specifically, following the idea of principal stratification, we define the clustering structure as random effects with a spike and slab prior in a Bayesian hierarchical model. As a result, a parsimonious model which only contains clusters with significant effects on the outcome can be obtained without much computational cost. We demonstrate the desirable features of the proposed method with two real data sets, one about academic performance and the other about infant birth weight. To further examine the empirical performance of the proposed method, simulations with data generating mechanisms similar to our data applications, and other four hypothetical data sets are conducted.

Suggested Citation

  • Li He & Yu-Bo Wang & William C. Bridges & Zhulin He & S. Megan Che, 2023. "Bayesian Framework for Causal Inference with Principal Stratification and Clusters," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 15(1), pages 114-140, April.
  • Handle: RePEc:spr:stabio:v:15:y:2023:i:1:d:10.1007_s12561-022-09351-9
    DOI: 10.1007/s12561-022-09351-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-022-09351-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-022-09351-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Soohyung Lee & Lesley J. Turner & Seokjin Woo & Kyunghee Kim, 2014. "All or Nothing? The Impact of School and Classroom Gender Composition on Effort and Academic Achievement," NBER Working Papers 20722, National Bureau of Economic Research, Inc.
    2. Skinner, Chris J. & D'Arrigo, Julia, 2011. "Inverse probability weighting for clustered nonresponse," LSE Research Online Documents on Economics 40308, London School of Economics and Political Science, LSE Library.
    3. C. J. Skinner & D'arrigo, 2011. "Inverse probability weighting for clustered nonresponse," Biometrika, Biometrika Trust, vol. 98(4), pages 953-966.
    4. Scheipl, Fabian, 2011. "spikeSlabGAM: Bayesian Variable Selection, Model Choice and Regularization for Generalized Additive Mixed Models in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 43(i14).
    5. Chase N. Joyner & Christopher S. McMahan & Joshua M. Tebbs & Christopher R. Bilder, 2020. "From mixed effects modeling to spike and slab variable selection: A Bayesian regression model for group testing data," Biometrics, The International Biometric Society, vol. 76(3), pages 913-923, September.
    6. Yun Li & Jeremy M.G. Taylor & Michael R. Elliott, 2010. "A Bayesian Approach to Surrogacy Assessment Using Principal Stratification in Clinical Trials," Biometrics, The International Biometric Society, vol. 66(2), pages 523-531, June.
    7. Park, Trevor & Casella, George, 2008. "The Bayesian Lasso," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 681-686, June.
    8. Yu-Bo Wang & Cuilin Zhang & Zhen Chen, 2021. "Intergenerational Associations Between Maternal Diet and Childhood Adiposity: A Bayesian Regularized Mediation Analysis," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(3), pages 524-542, December.
    9. Zhang, Junni L. & Rubin, Donald B. & Mealli, Fabrizia, 2009. "Likelihood-Based Analysis of Causal Effects of Job-Training Programs Using Principal Stratification," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 166-176.
    10. Jin, Hui & Rubin, Donald B., 2008. "Principal Stratification for Causal Inference With Extended Partial Compliance," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 101-111, March.
    11. Constantine E. Frangakis & Donald B. Rubin, 2002. "Principal Stratification in Causal Inference," Biometrics, The International Biometric Society, vol. 58(1), pages 21-29, March.
    12. Nicholas G. Polson & James G. Scott & Jesse Windle, 2013. "Bayesian Inference for Logistic Models Using Pólya--Gamma Latent Variables," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1339-1349, December.
    13. Wagner, Helga & Duller, Christine, 2012. "Bayesian model selection for logistic regression models with random intercept," Computational Statistics & Data Analysis, Elsevier, vol. 56(5), pages 1256-1274.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andrea Mercatanti & Fan Li, 2017. "Do debit cards decrease cash demand?: causal inference and sensitivity analysis using principal stratification," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(4), pages 759-776, August.
    2. VanderWeele Tyler J, 2011. "Principal Stratification -- Uses and Limitations," The International Journal of Biostatistics, De Gruyter, vol. 7(1), pages 1-14, July.
    3. Corwin M. Zigler & Thomas R. Belin, 2012. "A Bayesian Approach to Improved Estimation of Causal Effect Predictiveness for a Principal Surrogate Endpoint," Biometrics, The International Biometric Society, vol. 68(3), pages 922-932, September.
    4. Chase N. Joyner & Christopher S. McMahan & Joshua M. Tebbs & Christopher R. Bilder, 2020. "From mixed effects modeling to spike and slab variable selection: A Bayesian regression model for group testing data," Biometrics, The International Biometric Society, vol. 76(3), pages 913-923, September.
    5. Michael R. Elliott & Anna Conlon & Yun Li, 2013. "Discussion on “Surrogate Measures and Consistent Surrogates”," Biometrics, The International Biometric Society, vol. 69(3), pages 565-569, September.
    6. Ying Huang & Shibasish Dasgupta, 2019. "Likelihood-Based Methods for Assessing Principal Surrogate Endpoints in Vaccine Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 11(3), pages 504-523, December.
    7. Mealli Fabrizia & Mattei Alessandra, 2012. "A Refreshing Account of Principal Stratification," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-19, April.
    8. German Blanco & Carlos A. Flores & Alfonso Flores-Lagunes, 2013. "Bounds on Average and Quantile Treatment Effects of Job Corps Training on Wages," Journal of Human Resources, University of Wisconsin Press, vol. 48(3), pages 659-701.
    9. Giovanni Mellace & Roberto Rocci, 2011. "Principal Stratification in sample selection problems with non normal error terms," CEIS Research Paper 194, Tor Vergata University, CEIS, revised 02 May 2011.
    10. Hui Jin & John Barnard & Donald B. Rubin, 2010. "A Modified General Location Model for Noncompliance With Missing Data," Journal of Educational and Behavioral Statistics, , vol. 35(2), pages 154-173, April.
    11. Michael E. Sobel & Bengt Muthén, 2012. "Compliance Mixture Modelling with a Zero-Effect Complier Class and Missing Data," Biometrics, The International Biometric Society, vol. 68(4), pages 1037-1045, December.
    12. Shuxi Zeng & Fan Li & Peng Ding, 2020. "Is being an only child harmful to psychological health?: evidence from an instrumental variable analysis of China's one‐child policy," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(4), pages 1615-1635, October.
    13. Plewis Ian & Shlomo Natalie, 2017. "Using Response Propensity Models to Improve the Quality of Response Data in Longitudinal Studies," Journal of Official Statistics, Sciendo, vol. 33(3), pages 753-779, September.
    14. Jouni Kuha & Myrsini Katsikatsou & Irini Moustaki, 2018. "Latent variable modelling with non‐ignorable item non‐response: multigroup response propensity models for cross‐national analysis," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 1169-1192, October.
    15. Fabrizia Mealli & Barbara Pacini & Elena Stanghellini, 2016. "Identification of Principal Causal Effects Using Additional Outcomes in Concentration Graphs," Journal of Educational and Behavioral Statistics, , vol. 41(5), pages 463-480, October.
    16. Gilbert Peter B. & Blette Bryan S. & Hudgens Michael G. & Shepherd Bryan E., 2020. "Post-randomization Biomarker Effect Modification Analysis in an HIV Vaccine Clinical Trial," Journal of Causal Inference, De Gruyter, vol. 8(1), pages 54-69, January.
    17. Anindya Bhadra & Jyotishka Datta & Nicholas G. Polson & Brandon T. Willard, 2020. "Global-Local Mixtures: A Unifying Framework," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 82(2), pages 426-447, August.
    18. Park Soojin & Kürüm Esra, 2020. "A Two-Stage Joint Modeling Method for Causal Mediation Analysis in the Presence of Treatment Noncompliance," Journal of Causal Inference, De Gruyter, vol. 8(1), pages 131-149, January.
    19. von Hinke, Stephanie & Davey Smith, George & Lawlor, Debbie A. & Propper, Carol & Windmeijer, Frank, 2016. "Genetic markers as instrumental variables," Journal of Health Economics, Elsevier, vol. 45(C), pages 131-148.
    20. Ovielt Baltodano L'opez & Roberto Casarin, 2022. "A Dynamic Stochastic Block Model for Multi-Layer Networks," Papers 2209.09354, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:15:y:2023:i:1:d:10.1007_s12561-022-09351-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.