IDEAS home Printed from https://ideas.repec.org/a/spr/aistmt/v77y2025i1d10.1007_s10463-024-00913-w.html
   My bibliography  Save this article

Information projection approach to smoothed propensity score weighting for handling selection bias under missing at random

Author

Listed:
  • Hengfang Wang

    (Fujian Normal University)

  • Jae Kwang Kim

    (Iowa State University)

Abstract

Propensity score weighting is widely used to correct the selection bias in the sample with missing data. The propensity score function is often developed using a model for the response probability, which completely ignores the outcome regression model. In this paper, we explore an alternative approach by developing smoothed propensity score weights that provide a more efficient estimation by removing unnecessary auxiliary variables in the propensity score model. The smoothed propensity score function is obtained by applying the information projection of the original propensity score function to the space that satisfies the moment conditions on the balancing scores obtained from the outcome regression model. By including the covariates for the outcome regression models only in the density ratio model, we can achieve an efficiency gain. Penalized regression is used to identify important covariates. Some limited simulation studies are presented to compare with the existing methods.

Suggested Citation

  • Hengfang Wang & Jae Kwang Kim, 2025. "Information projection approach to smoothed propensity score weighting for handling selection bias under missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 77(1), pages 127-153, February.
  • Handle: RePEc:spr:aistmt:v:77:y:2025:i:1:d:10.1007_s10463-024-00913-w
    DOI: 10.1007/s10463-024-00913-w
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10463-024-00913-w
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10463-024-00913-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kott, Phillip S. & Chang, Ted, 2010. "Using Calibration Weighting to Adjust for Nonignorable Unit Nonresponse," Journal of the American Statistical Association, American Statistical Association, vol. 105(491), pages 1265-1275.
    2. Hainmueller, Jens, 2012. "Entropy Balancing for Causal Effects: A Multivariate Reweighting Method to Produce Balanced Samples in Observational Studies," Political Analysis, Cambridge University Press, vol. 20(1), pages 25-46, January.
    3. Yilin Chen & Pengfei Li & Changbao Wu, 2020. "Doubly Robust Inference With Nonprobability Survey Samples," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 2011-2021, December.
    4. Antoine, Bertille & Dovonon, Prosper, 2021. "Robust estimation with exponentially tilted Hellinger distance," Journal of Econometrics, Elsevier, vol. 224(2), pages 330-344.
    5. Peisong Han & Lu Wang, 2013. "Estimation with missing data: beyond double robustness," Biometrika, Biometrika Trust, vol. 100(2), pages 417-430.
    6. Jun Shao & Lei Wang, 2016. "Semiparametric inverse propensity weighting for nonignorable missing data," Biometrika, Biometrika Trust, vol. 103(1), pages 175-187.
    7. Bryan S. Graham & Cristine Campos De Xavier Pinto & Daniel Egel, 2012. "Inverse Probability Tilting for Moment Condition Models with Missing Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(3), pages 1053-1079.
    8. Kosuke Imai & Marc Ratkovic, 2014. "Covariate balancing propensity score," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(1), pages 243-263, January.
    9. Montanari, Giorgio E. & Ranalli, M. Giovanna, 2005. "Nonparametric Model Calibration Estimation in Survey Sampling," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1429-1442, December.
    10. Jae Kwang Kim & C. J. Skinner, 2013. "Weighting in survey analysis under informative sampling," Biometrika, Biometrika Trust, vol. 100(2), pages 385-398.
    11. Kwun Chuen Gary Chan & Sheung Chi Phillip Yam & Zheng Zhang, 2016. "Globally efficient non-parametric inference of average treatment effects by empirical balancing calibration weighting," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(3), pages 673-700, June.
    12. repec:mpr:mprres:4937 is not listed on IDEAS
    13. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    14. Sixia Chen & David Haziza, 2017. "Multiply robust imputation procedures for zero-inflated distributions in surveys," METRON, Springer;Sapienza Università di Roma, vol. 75(3), pages 333-343, December.
    15. Zhiqiang Tan, 2010. "Bounded, efficient and doubly robust estimation with inverse weighting," Biometrika, Biometrika Trust, vol. 97(3), pages 661-682.
    16. Johnson, Brent A. & Lin, D.Y. & Zeng, Donglin, 2008. "Penalized Estimating Functions and Variable Selection in Semiparametric Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 672-680, June.
    17. F. J. Breidt & G. Claeskens & J. D. Opsomer, 2005. "Model-assisted estimation for complex surveys using penalised splines," Biometrika, Biometrika Trust, vol. 92(4), pages 831-846, December.
    18. repec:mpr:mprres:4780 is not listed on IDEAS
    19. Z Tan, 2020. "Regularized calibrated estimation of propensity scores with model misspecification and high-dimensional data," Biometrika, Biometrika Trust, vol. 107(1), pages 137-158.
    20. Shu Yang & Jae Kwang Kim & Rui Song, 2020. "Doubly robust inference when combining probability and non‐probability samples with high dimensional data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(2), pages 445-465, April.
    21. Ben B. Hansen, 2008. "The prognostic analogue of the propensity score," Biometrika, Biometrika Trust, vol. 95(2), pages 481-488.
    22. Jean-François Beaumont, 2008. "A new approach to weighting and inference in sample surveys," Biometrika, Biometrika Trust, vol. 95(3), pages 539-553.
    23. Shu Yang & Peng Ding, 2020. "Combining Multiple Observational Data Sources to Estimate Causal Effects," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(531), pages 1540-1554, July.
    24. Xinwei Ma & Jingshen Wang, 2020. "Robust Inference Using Inverse Probability Weighting," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 1851-1860, December.
    25. Sixia Chen & David Haziza, 2017. "Multiply robust imputation procedures for the treatment of item nonresponse in surveys," Biometrika, Biometrika Trust, vol. 104(2), pages 439-453.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shixiao Zhang & Peisong Han & Changbao Wu, 2023. "Calibration Techniques Encompassing Survey Sampling, Missing Data Analysis and Causal Inference," International Statistical Review, International Statistical Institute, vol. 91(2), pages 165-192, August.
    2. Dasom Lee & Shu Yang & Lin Dong & Xiaofei Wang & Donglin Zeng & Jianwen Cai, 2023. "Improving trial generalizability using observational studies," Biometrics, The International Biometric Society, vol. 79(2), pages 1213-1225, June.
    3. Yimin Dai & Ying Yan, 2024. "Mahalanobis balancing: A multivariate perspective on approximate covariate balancing," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 51(4), pages 1450-1471, December.
    4. Li, Wei & Luo, Shanshan & Xu, Wangli, 2024. "Calibrated regression estimation using empirical likelihood under data fusion," Computational Statistics & Data Analysis, Elsevier, vol. 190(C).
    5. He, Xin & Mao, Xiaojun & Wang, Zhonglei, 2024. "Nonparametric augmented probability weighting with sparsity," Computational Statistics & Data Analysis, Elsevier, vol. 191(C).
    6. Xu, Wenfu & Tan, Zhiqiang, 2024. "High-dimensional model-assisted inference for treatment effects with multi-valued treatments," Journal of Econometrics, Elsevier, vol. 244(1).
    7. Sean Yiu & Li Su, 2022. "Joint calibrated estimation of inverse probability of treatment and censoring weights for marginal structural models," Biometrics, The International Biometric Society, vol. 78(1), pages 115-127, March.
    8. Kwun Chuen Gary Chan & Sheung Chi Phillip Yam & Zheng Zhang, 2016. "Globally efficient non-parametric inference of average treatment effects by empirical balancing calibration weighting," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(3), pages 673-700, June.
    9. Susan Athey & Guido W. Imbens & Stefan Wager, 2018. "Approximate residual balancing: debiased inference of average treatment effects in high dimensions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(4), pages 597-623, September.
    10. Pedro H. C. Sant'Anna & Xiaojun Song & Qi Xu, 2022. "Covariate distribution balance via propensity scores," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(6), pages 1093-1120, September.
    11. Cousineau, Martin & Verter, Vedat & Murphy, Susan A. & Pineau, Joelle, 2023. "Estimating causal effects with optimization-based methods: A review and empirical comparison," European Journal of Operational Research, Elsevier, vol. 304(2), pages 367-380.
    12. Ganesh Karapakula, 2023. "Stable Probability Weighting: Large-Sample and Finite-Sample Estimation and Inference Methods for Heterogeneous Causal Effects of Multivalued Treatments Under Limited Overlap," Papers 2301.05703, arXiv.org, revised Jan 2023.
    13. Phillip Heiler, 2022. "Efficient Covariate Balancing for the Local Average Treatment Effect," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(4), pages 1569-1582, October.
    14. Hamori, Shigeyuki & Motegi, Kaiji & Zhang, Zheng, 2019. "Calibration estimation of semiparametric copula models with data missing at random," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 85-109.
    15. Sean Yiu & Li Su, 2018. "Covariate association eliminating weights: a unified weighting framework for causal effect estimation," Biometrika, Biometrika Trust, vol. 105(3), pages 709-722.
    16. Martin Cousineau & Vedat Verter & Susan A. Murphy & Joelle Pineau, 2022. "Estimating causal effects with optimization-based methods: A review and empirical comparison," Papers 2203.00097, arXiv.org.
    17. Michael Zimmert, 2018. "The Finite Sample Performance of Treatment Effects Estimators based on the Lasso," Papers 1805.05067, arXiv.org.
    18. Y Cui & E J Tchetgen Tchetgen, 2024. "Selective machine learning of doubly robust functionals," Biometrika, Biometrika Trust, vol. 111(2), pages 517-535.
    19. Ray Chambers & Setareh Ranjbar & Nicola Salvati & Barbara Pacini, 2022. "Weighting, informativeness and causal inference, with an application to rainfall enhancement," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 1584-1612, October.
    20. Chen, Sixia & Haziza, David, 2023. "A unified framework of multiply robust estimation approaches for handling incomplete data," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aistmt:v:77:y:2025:i:1:d:10.1007_s10463-024-00913-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.