IDEAS home Printed from https://ideas.repec.org/a/sae/evarev/v45y2021i1-2p34-69.html
   My bibliography  Save this article

Imputation of Missing Covariate Data Prior to Propensity Score Analysis: A Tutorial and Evaluation of the Robustness of Practical Approaches

Author

Listed:
  • Walter L. Leite
  • Burak Aydin
  • Dee D. Cetin-Berber

Abstract

Background: Propensity score analysis (PSA) is a popular method to remove selection bias due to covariates in quasi-experimental designs, but it requires handling of missing data on covariates before propensity scores are estimated. Multiple imputation (MI) and single imputation (SI) are approaches to handle missing data in PSA. Objectives: The objectives of this study are to review MI-within, MI-across, and SI approaches to handle missing data on covariates prior to PSA, investigate the robustness of MI-across and SI with a Monte Carlo simulation study, and demonstrate the analysis of missing data and PSA with a step-by-step illustrative example. Research design: The Monte Carlo simulation study compared strategies to impute missing data in continuous and categorical covariates for estimation of propensity scores. Manipulated conditions included sample size, the number of covariates, the size of the treatment effect, missing data mechanism, and percentage of missing data. Imputation strategies included MI-across and SI by joint modeling or multivariate imputation by chained equations (MICE). Results: The results indicated that the MI-across method performed well, and SI also performed adequately with smaller percentages of missing data. The illustrative example demonstrated MI and SI, propensity score estimation, calculation of propensity score weights, covariate balance evaluation, estimation of the average treatment effect on the treated, and sensitivity analysis using data from the National Longitudinal Survey of Youth.

Suggested Citation

  • Walter L. Leite & Burak Aydin & Dee D. Cetin-Berber, 2021. "Imputation of Missing Covariate Data Prior to Propensity Score Analysis: A Tutorial and Evaluation of the Robustness of Practical Approaches," Evaluation Review, , vol. 45(1-2), pages 34-69, February.
  • Handle: RePEc:sae:evarev:v:45:y:2021:i:1-2:p:34-69
    DOI: 10.1177/0193841X211020245
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0193841X211020245
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0193841X211020245?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Arpino, Bruno & Mealli, Fabrizia, 2011. "The specification of the propensity score in multilevel observational studies," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1770-1780, April.
    2. van Buuren, Stef & Groothuis-Oudshoorn, Karin, 2011. "mice: Multivariate Imputation by Chained Equations in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i03).
    3. Lumley, Thomas, 2004. "Analysis of Complex Survey Samples," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 9(i08).
    4. Abadie, Alberto & Imbens, Guido W., 2011. "Bias-Corrected Matching Estimators for Average Treatment Effects," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 1-11.
    5. Shaun Seaman & Ian White, 2014. "Inverse Probability Weighting with Missing Predictors of Treatment Assignment or Missingness," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 43(16), pages 3499-3515, August.
    6. Alberto Abadie & Guido W. Imbens, 2002. "Simple and Bias-Corrected Matching Estimators for Average Treatment Effects," NBER Technical Working Papers 0283, National Bureau of Economic Research, Inc.
    7. Ho, Daniel & Imai, Kosuke & King, Gary & Stuart, Elizabeth A., 2011. "MatchIt: Nonparametric Preprocessing for Parametric Causal Inference," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 42(i08).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zachary K. Collier & Minji Kong & Olushola Soyoye & Kamal Chawla & Ann M. Aviles & Yasser Payne, 2024. "Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items," Journal of Educational and Behavioral Statistics, , vol. 49(2), pages 241-267, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lucija Muehlenbachs & Elisheba Spiller & Christopher Timmins, 2015. "The Housing Market Impacts of Shale Gas Development," American Economic Review, American Economic Association, vol. 105(12), pages 3633-3659, December.
    2. Akoh Fabien Yao & Maxime Sèbe & Laura Recuero Virto & Abdelhak Nassiri & Hervé Dumez, 2024. "The effect of LNG bunkering on port competitiveness using multilevel data analysis [L'effet du soutage par GNL sur la compétitivité des ports à l'aide de l'analyse de données à plusieurs niveaux]," Post-Print hal-04611804, HAL.
    3. Ostapchuk, Igor & Gagalyuk, Taras & Curtiss, Jarmila, 2021. "Post-acquisition integration and growth of farms: the case of Ukrainian agroholdings," International Food and Agribusiness Management Review, International Food and Agribusiness Management Association, vol. 24(4), April.
    4. Benjamin Crost, 2011. "The Effect of Subsidized Employment on Happiness," SOEPpapers on Multidisciplinary Panel Data Research 384, DIW Berlin, The German Socio-Economic Panel (SOEP).
    5. Graf, Nikolaus & Hofer, Helmut & Winter-Ebmer, Rudolf, 2011. "Labor supply effects of a subsidized old-age part-time scheme in Austria," Zeitschrift für ArbeitsmarktForschung - Journal for Labour Market Research, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany], vol. 44(3), pages 217-229.
    6. Raphael Nishimura & James Wagner & Michael Elliott, 2016. "Alternative Indicators for the Risk of Non-response Bias: A Simulation Study," International Statistical Review, International Statistical Institute, vol. 84(1), pages 43-62, April.
    7. Takeshima, Hiroyuki, 2015. "Identifying the effects of market imperfections for a scale biased agricultural technology: Tractors in Nigeria," 2015 Conference, August 9-14, 2015, Milan, Italy 211937, International Association of Agricultural Economists.
    8. Reint Gropp & Thomas Mosk & Steven Ongena & Carlo Wix, 2019. "Banks Response to Higher Capital Requirements: Evidence from a Quasi-Natural Experiment," The Review of Financial Studies, Society for Financial Studies, vol. 32(1), pages 266-299.
    9. Jiaming Zeng & Michael F. Gensheimer & Daniel L. Rubin & Susan Athey & Ross D. Shachter, 2022. "Uncovering interpretable potential confounders in electronic medical records," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    10. Lenis, David & Ackerman, Benjamin & Stuart, Elizabeth A., 2018. "Measuring model misspecification: Application to propensity score methods with complex survey data," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 48-57.
    11. Songliang Chen & Fang Han, 2024. "On the limiting variance of matching estimators," Papers 2411.05758, arXiv.org.
    12. Lina Cardona-Sosa & Leonardo Morales, 2015. "Efectos laborales de los servicios de cuidado infantil: evidencia del programa Buen Comienzo," Borradores de Economia 882, Banco de la Republica de Colombia.
    13. Leite, Walter & Zhang, Huibin & collier, zachary & Chawla, Kamal & , l.kong@ufl.edu & Lee, Yongseok & Quan, Jia & Soyoye, Olushola, 2024. "Machine Learning for Propensity Score Estimation: A Systematic Review and Reporting Guidelines," OSF Preprints gmrk7, Center for Open Science.
    14. Sauer, J. & Davidova, S. & Gorton, M., 2013. "Migration and Agricultural Efficiency in Kosovo," 87th Annual Conference, April 8-10, 2013, Warwick University, Coventry, UK 158864, Agricultural Economics Society.
    15. Zoltan Hermann, 2013. "Are you on the right track? The effect of educational tracks on student achievement in upper-secondary education in Hungary," Budapest Working Papers on the Labour Market 1316, Institute of Economics, Centre for Economic and Regional Studies.
    16. Miquel-Àngel Garcia-López & Albert Solé-Ollé & Elisabet Viladecans Marsal, 2014. "Do Land Use Policies Follow Road Construction," CESifo Working Paper Series 4672, CESifo Group Munich.
    17. Veronika Bertram-Huemmer & Kati Kraehnert, 2018. "Does Index Insurance Help Households Recover from Disaster? Evidence from IBLI Mongolia," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 100(1), pages 145-171.
    18. Steven Lehrer & Gregory Kordas, 2013. "Matching using semiparametric propensity scores," Empirical Economics, Springer, vol. 44(1), pages 13-45, February.
    19. Galina Besstremyannaya, 2014. "Heterogeneous effect of coinsurance rate on healthcare costs: generalized finite mixtures and matching estimators," Discussion Papers 14-014, Stanford Institute for Economic Policy Research.
    20. Chen, Wei & Klaiber, H. Allen, 2020. "Does road expansion induce traffic? An evaluation of Vehicle-Kilometers Traveled in China," Journal of Environmental Economics and Management, Elsevier, vol. 104(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:evarev:v:45:y:2021:i:1-2:p:34-69. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.