IDEAS home Printed from https://ideas.repec.org/a/sae/evarev/v45y2021i1-2p34-69.html
   My bibliography  Save this article

Imputation of Missing Covariate Data Prior to Propensity Score Analysis: A Tutorial and Evaluation of the Robustness of Practical Approaches

Author

Listed:
  • Walter L. Leite
  • Burak Aydin
  • Dee D. Cetin-Berber

Abstract

Background: Propensity score analysis (PSA) is a popular method to remove selection bias due to covariates in quasi-experimental designs, but it requires handling of missing data on covariates before propensity scores are estimated. Multiple imputation (MI) and single imputation (SI) are approaches to handle missing data in PSA. Objectives: The objectives of this study are to review MI-within, MI-across, and SI approaches to handle missing data on covariates prior to PSA, investigate the robustness of MI-across and SI with a Monte Carlo simulation study, and demonstrate the analysis of missing data and PSA with a step-by-step illustrative example. Research design: The Monte Carlo simulation study compared strategies to impute missing data in continuous and categorical covariates for estimation of propensity scores. Manipulated conditions included sample size, the number of covariates, the size of the treatment effect, missing data mechanism, and percentage of missing data. Imputation strategies included MI-across and SI by joint modeling or multivariate imputation by chained equations (MICE). Results: The results indicated that the MI-across method performed well, and SI also performed adequately with smaller percentages of missing data. The illustrative example demonstrated MI and SI, propensity score estimation, calculation of propensity score weights, covariate balance evaluation, estimation of the average treatment effect on the treated, and sensitivity analysis using data from the National Longitudinal Survey of Youth.

Suggested Citation

  • Walter L. Leite & Burak Aydin & Dee D. Cetin-Berber, 2021. "Imputation of Missing Covariate Data Prior to Propensity Score Analysis: A Tutorial and Evaluation of the Robustness of Practical Approaches," Evaluation Review, , vol. 45(1-2), pages 34-69, February.
  • Handle: RePEc:sae:evarev:v:45:y:2021:i:1-2:p:34-69
    DOI: 10.1177/0193841X211020245
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0193841X211020245
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0193841X211020245?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. van Buuren, Stef & Groothuis-Oudshoorn, Karin, 2011. "mice: Multivariate Imputation by Chained Equations in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i03).
    2. Abadie, Alberto & Imbens, Guido W., 2011. "Bias-Corrected Matching Estimators for Average Treatment Effects," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 1-11.
    3. Shaun Seaman & Ian White, 2014. "Inverse Probability Weighting with Missing Predictors of Treatment Assignment or Missingness," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 43(16), pages 3499-3515, August.
    4. Arpino, Bruno & Mealli, Fabrizia, 2011. "The specification of the propensity score in multilevel observational studies," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1770-1780, April.
    5. Alberto Abadie & Guido W. Imbens, 2002. "Simple and Bias-Corrected Matching Estimators for Average Treatment Effects," NBER Technical Working Papers 0283, National Bureau of Economic Research, Inc.
    6. Ho, Daniel & Imai, Kosuke & King, Gary & Stuart, Elizabeth A., 2011. "MatchIt: Nonparametric Preprocessing for Parametric Causal Inference," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 42(i08).
    7. Lumley, Thomas, 2004. "Analysis of Complex Survey Samples," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 9(i08).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zachary K. Collier & Minji Kong & Olushola Soyoye & Kamal Chawla & Ann M. Aviles & Yasser Payne, 2024. "Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items," Journal of Educational and Behavioral Statistics, , vol. 49(2), pages 241-267, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Takeshima, Hiroyuki, 2015. "Identifying the effects of market imperfections for a scale biased agricultural technology: Tractors in Nigeria," 2015 Conference, August 9-14, 2015, Milan, Italy 211937, International Association of Agricultural Economists.
    2. Reint Gropp & Thomas Mosk & Steven Ongena & Carlo Wix, 2019. "Banks Response to Higher Capital Requirements: Evidence from a Quasi-Natural Experiment," The Review of Financial Studies, Society for Financial Studies, vol. 32(1), pages 266-299.
    3. Jiaming Zeng & Michael F. Gensheimer & Daniel L. Rubin & Susan Athey & Ross D. Shachter, 2022. "Uncovering interpretable potential confounders in electronic medical records," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    4. Lina Cardona-Sosa & Leonardo Morales, 2015. "Efectos laborales de los servicios de cuidado infantil: evidencia del programa Buen Comienzo," Borradores de Economia 882, Banco de la Republica de Colombia.
    5. Miquel-Àngel Garcia-López & Albert Solé-Ollé & Elisabet Viladecans Marsal, 2014. "Do Land Use Policies Follow Road Construction," CESifo Working Paper Series 4672, CESifo Group Munich.
    6. Chen, Wei & Klaiber, H. Allen, 2020. "Does road expansion induce traffic? An evaluation of Vehicle-Kilometers Traveled in China," Journal of Environmental Economics and Management, Elsevier, vol. 104(C).
    7. Steven Lehrer & Gregory Kordas, 2013. "Matching using semiparametric propensity scores," Empirical Economics, Springer, vol. 44(1), pages 13-45, February.
    8. Häggström, Jenny & Persson, Emma & Waernbaum, Ingeborg & de Luna, Xavier, 2015. "CovSel: An R Package for Covariate Selection When Estimating Average Causal Effects," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 68(i01).
    9. Harsh Parikh & Cynthia Rudin & Alexander Volfovsky, 2018. "MALTS: Matching After Learning to Stretch," Papers 1811.07415, arXiv.org, revised Jun 2023.
    10. Michela Bia & Roberto Leombruni & Pierre-Jean Messe, 2009. "Young in-Old out: a new evaluation based on Generalized Propensity Score," LABORatorio R. Revelli Working Papers Series 93, LABORatorio R. Revelli, Centre for Employment Studies.
    11. Nicolas Moreau, 2018. "A SAS macro to estimate Average Treatment Effects with Matching Estimators," Working Papers hal-01691489, HAL.
    12. Graf, Nikolaus & Hofer, Helmut & Winter-Ebmer, Rudolf, 2011. "Labor supply effects of a subsidized old-age part-time scheme in Austria," Zeitschrift für ArbeitsmarktForschung - Journal for Labour Market Research, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany], vol. 44(3), pages 217-229.
    13. Lucija Muehlenbachs & Elisheba Spiller & Christopher Timmins, 2015. "The Housing Market Impacts of Shale Gas Development," American Economic Review, American Economic Association, vol. 105(12), pages 3633-3659, December.
    14. Mark Van Duijn & Jan Rouwendal & Richard Boersema, 2014. "Transformations of industrial heritage: Insights into external effects on house prices," ERES eres2014_59, European Real Estate Society (ERES).
    15. Miquel-Àngel Garcia-López & Albert Solé-Ollé & Elisabet Viladecans Marsal, 2014. "Do Land Use Policies Follow Road Construction," CESifo Working Paper Series 4672, CESifo.
    16. Bao-We-Wal BAMBE, 2022. "Inflation Targeting and Private Domestic Investment in Developing Countries," Working Papers REM 2022/0237, ISEG - Lisbon School of Economics and Management, REM, Universidade de Lisboa.
    17. Sauer, J. & Gorton, M. & Davidova, S., 2014. "Migration and Agricultural Efficiency – Empirical Evidence for Kosovo," Proceedings “Schriften der Gesellschaft für Wirtschafts- und Sozialwissenschaften des Landbaues e.V.”, German Association of Agricultural Economists (GEWISOLA), vol. 49, March.
    18. Huaiyu Zang & Hang J. Kim & Bin Huang & Rhonda Szczesniak, 2023. "Bayesian causal inference for observational studies with missingness in covariates and outcomes," Biometrics, The International Biometric Society, vol. 79(4), pages 3624-3636, December.
    19. Veronika Bertram-Huemmer & Kati Kraehnert, 2018. "Does Index Insurance Help Households Recover from Disaster? Evidence from IBLI Mongolia," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 100(1), pages 145-171.
    20. Moeltner, Klaus & Puri, Roshan & Johnston, Robert J., 2023. "Regression and matching in hedonic analysis: Empirical guidance for estimator choice," 2023 Annual Meeting, July 23-25, Washington D.C. 335807, Agricultural and Applied Economics Association.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:evarev:v:45:y:2021:i:1-2:p:34-69. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.