IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v15y2023i1d10.1007_s12561-022-09351-9.html
   My bibliography  Save this article

Bayesian Framework for Causal Inference with Principal Stratification and Clusters

Author

Listed:
  • Li He

    (Clemson University)

  • Yu-Bo Wang

    (Clemson University)

  • William C. Bridges

    (Clemson University)

  • Zhulin He

    (Foundation Medicine, Inc.)

  • S. Megan Che

    (Clemson University)

Abstract

In observational studies, principal stratification is a well-established method in causal analysis to adjust the treatment effect estimation for post-treatment variables. However, this inference could be challenging when the data have a clustering structure, which is pervasive in observational studies. Adding to the issues is the fact that often the variables associated with the clusters are only recorded as the cluster label due to a budget constraint or measuring difficulties. Furthermore, the true nature of the relationship between these cluster level variables and the outcome may be unclear. Although accommodating this clustering structure via random effects based on the cluster label can address the bias issues, estimating the model is inevitably tedious and overfitting can occur with principal stratification and clustering. In this article, we propose a comprehensive framework for estimating a treatment effect when both post-treatment variable and clustering exist in a data set. Specifically, following the idea of principal stratification, we define the clustering structure as random effects with a spike and slab prior in a Bayesian hierarchical model. As a result, a parsimonious model which only contains clusters with significant effects on the outcome can be obtained without much computational cost. We demonstrate the desirable features of the proposed method with two real data sets, one about academic performance and the other about infant birth weight. To further examine the empirical performance of the proposed method, simulations with data generating mechanisms similar to our data applications, and other four hypothetical data sets are conducted.

Suggested Citation

  • Li He & Yu-Bo Wang & William C. Bridges & Zhulin He & S. Megan Che, 2023. "Bayesian Framework for Causal Inference with Principal Stratification and Clusters," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 15(1), pages 114-140, April.
  • Handle: RePEc:spr:stabio:v:15:y:2023:i:1:d:10.1007_s12561-022-09351-9
    DOI: 10.1007/s12561-022-09351-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-022-09351-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-022-09351-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Soohyung Lee & Lesley J. Turner & Seokjin Woo & Kyunghee Kim, 2014. "All or Nothing? The Impact of School and Classroom Gender Composition on Effort and Academic Achievement," NBER Working Papers 20722, National Bureau of Economic Research, Inc.
    2. Skinner, Chris J. & D'Arrigo, Julia, 2011. "Inverse probability weighting for clustered nonresponse," LSE Research Online Documents on Economics 40308, London School of Economics and Political Science, LSE Library.
    3. C. J. Skinner & D'arrigo, 2011. "Inverse probability weighting for clustered nonresponse," Biometrika, Biometrika Trust, vol. 98(4), pages 953-966.
    4. Scheipl, Fabian, 2011. "spikeSlabGAM: Bayesian Variable Selection, Model Choice and Regularization for Generalized Additive Mixed Models in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 43(i14).
    5. Chase N. Joyner & Christopher S. McMahan & Joshua M. Tebbs & Christopher R. Bilder, 2020. "From mixed effects modeling to spike and slab variable selection: A Bayesian regression model for group testing data," Biometrics, The International Biometric Society, vol. 76(3), pages 913-923, September.
    6. Yun Li & Jeremy M.G. Taylor & Michael R. Elliott, 2010. "A Bayesian Approach to Surrogacy Assessment Using Principal Stratification in Clinical Trials," Biometrics, The International Biometric Society, vol. 66(2), pages 523-531, June.
    7. Park, Trevor & Casella, George, 2008. "The Bayesian Lasso," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 681-686, June.
    8. Yu-Bo Wang & Cuilin Zhang & Zhen Chen, 2021. "Intergenerational Associations Between Maternal Diet and Childhood Adiposity: A Bayesian Regularized Mediation Analysis," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(3), pages 524-542, December.
    9. Zhang, Junni L. & Rubin, Donald B. & Mealli, Fabrizia, 2009. "Likelihood-Based Analysis of Causal Effects of Job-Training Programs Using Principal Stratification," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 166-176.
    10. Jin, Hui & Rubin, Donald B., 2008. "Principal Stratification for Causal Inference With Extended Partial Compliance," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 101-111, March.
    11. Constantine E. Frangakis & Donald B. Rubin, 2002. "Principal Stratification in Causal Inference," Biometrics, The International Biometric Society, vol. 58(1), pages 21-29, March.
    12. Nicholas G. Polson & James G. Scott & Jesse Windle, 2013. "Bayesian Inference for Logistic Models Using Pólya--Gamma Latent Variables," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1339-1349, December.
    13. Wagner, Helga & Duller, Christine, 2012. "Bayesian model selection for logistic regression models with random intercept," Computational Statistics & Data Analysis, Elsevier, vol. 56(5), pages 1256-1274.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andrea Mercatanti & Fan Li, 2017. "Do debit cards decrease cash demand?: causal inference and sensitivity analysis using principal stratification," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(4), pages 759-776, August.
    2. VanderWeele Tyler J, 2011. "Principal Stratification -- Uses and Limitations," The International Journal of Biostatistics, De Gruyter, vol. 7(1), pages 1-14, July.
    3. Chase N. Joyner & Christopher S. McMahan & Joshua M. Tebbs & Christopher R. Bilder, 2020. "From mixed effects modeling to spike and slab variable selection: A Bayesian regression model for group testing data," Biometrics, The International Biometric Society, vol. 76(3), pages 913-923, September.
    4. Corwin M. Zigler & Thomas R. Belin, 2012. "A Bayesian Approach to Improved Estimation of Causal Effect Predictiveness for a Principal Surrogate Endpoint," Biometrics, The International Biometric Society, vol. 68(3), pages 922-932, September.
    5. Michael R. Elliott & Anna Conlon & Yun Li, 2013. "Discussion on “Surrogate Measures and Consistent Surrogates”," Biometrics, The International Biometric Society, vol. 69(3), pages 565-569, September.
    6. German Blanco & Carlos A. Flores & Alfonso Flores-Lagunes, 2013. "Bounds on Average and Quantile Treatment Effects of Job Corps Training on Wages," Journal of Human Resources, University of Wisconsin Press, vol. 48(3), pages 659-701.
    7. Giovanni Mellace & Roberto Rocci, 2011. "Principal Stratification in sample selection problems with non normal error terms," CEIS Research Paper 194, Tor Vergata University, CEIS, revised 02 May 2011.
    8. Virginia X. He & Matt P. Wand, 2024. "Bayesian generalized additive model selection including a fast variational option," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 108(3), pages 639-668, September.
    9. Michael E. Sobel & Bengt Muthén, 2012. "Compliance Mixture Modelling with a Zero-Effect Complier Class and Missing Data," Biometrics, The International Biometric Society, vol. 68(4), pages 1037-1045, December.
    10. Jouni Kuha & Myrsini Katsikatsou & Irini Moustaki, 2018. "Latent variable modelling with non‐ignorable item non‐response: multigroup response propensity models for cross‐national analysis," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 1169-1192, October.
    11. Anindya Bhadra & Jyotishka Datta & Nicholas G. Polson & Brandon T. Willard, 2020. "Global-Local Mixtures: A Unifying Framework," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 82(2), pages 426-447, August.
    12. Bia, Michela & Flores-Lagunes, Alfonso & Mercatanti, Andrea, 2018. "Evaluation of Language Training Programs in Luxembourg using Principal Stratification," GLO Discussion Paper Series 289, Global Labor Organization (GLO).
    13. Qi Long & Roderick J. A. Little & Xihong Lin, 2010. "Estimating causal effects in trials involving multitreatment arms subject to non‐compliance: a Bayesian framework," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 59(3), pages 513-531, May.
    14. Uddin, Md Nazir & Gaskins, Jeremy T., 2023. "Shared Bayesian variable shrinkage in multinomial logistic regression," Computational Statistics & Data Analysis, Elsevier, vol. 177(C).
    15. Bartolucci, Francesco & Grilli, Leonardo, 2011. "Modeling Partial Compliance Through Copulas in a Principal Stratification Framework," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 469-479.
    16. Peter Z. Schochet, 2013. "Student Mobility, Dosage, and Principal Stratification in School-Based RCTs," Journal of Educational and Behavioral Statistics, , vol. 38(4), pages 323-354, August.
    17. John Engberg & Dennis Epple & Jason Imbrogno & Holger Sieg & Ron Zimmer, 2014. "Evaluating Education Programs That Have Lotteried Admission and Selective Attrition," Journal of Labor Economics, University of Chicago Press, vol. 32(1), pages 27-63.
    18. Soojin Park & Gregory J. Palardy, 2020. "Sensitivity Evaluation of Methods for Estimating Complier Average Causal Mediation Effects to Assumptions," Journal of Educational and Behavioral Statistics, , vol. 45(4), pages 475-506, August.
    19. Debashis Ghosh & Jeremy M. G. Taylor & Daniel J. Sargent, 2012. "Rejoinder for “Meta-analysis for Surrogacy: Accelerated Failure Time Models and Semicompeting Risks Modeling”," Biometrics, The International Biometric Society, vol. 68(1), pages 245-247, March.
    20. Sylvia Fruhwirth-Schnatter & Peter Knaus, 2022. "Sparse Bayesian State-Space and Time-Varying Parameter Models," Papers 2207.12147, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:15:y:2023:i:1:d:10.1007_s12561-022-09351-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.