IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2002.12710.html
   My bibliography  Save this paper

Causal mediation analysis with double machine learning

Author

Listed:
  • Helmut Farbmacher
  • Martin Huber
  • Luk'av{s} Laff'ers
  • Henrika Langen
  • Martin Spindler

Abstract

This paper combines causal mediation analysis with double machine learning to control for observed confounders in a data-driven way under a selection-on-observables assumption in a high-dimensional setting. We consider the average indirect effect of a binary treatment operating through an intermediate variable (or mediator) on the causal path between the treatment and the outcome, as well as the unmediated direct effect. Estimation is based on efficient score functions, which possess a multiple robustness property w.r.t. misspecifications of the outcome, mediator, and treatment models. This property is key for selecting these models by double machine learning, which is combined with data splitting to prevent overfitting in the estimation of the effects of interest. We demonstrate that the direct and indirect effect estimators are asymptotically normal and root-n consistent under specific regularity conditions and investigate the finite sample properties of the suggested methods in a simulation study when considering lasso as machine learner. We also provide an empirical application to the U.S. National Longitudinal Survey of Youth, assessing the indirect effect of health insurance coverage on general health operating via routine checkups as mediator, as well as the direct effect. We find a moderate short term effect of health insurance coverage on general health which is, however, not mediated by routine checkups.

Suggested Citation

  • Helmut Farbmacher & Martin Huber & Luk'av{s} Laff'ers & Henrika Langen & Martin Spindler, 2020. "Causal mediation analysis with double machine learning," Papers 2002.12710, arXiv.org, revised Feb 2021.
  • Handle: RePEc:arx:papers:2002.12710
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2002.12710
    File Function: Latest version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. James Heckman & Rodrigo Pinto & Peter Savelyev, 2013. "Understanding the Mechanisms through Which an Influential Early Childhood Program Boosted Adult Outcomes," American Economic Review, American Economic Association, vol. 103(6), pages 2052-2086, October.
    2. Martin Huber & Michael Lechner & Anthony Strittmatter, 2018. "Direct and indirect effects of training vouchers for the unemployed," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(2), pages 441-463, February.
    3. Gabriella Conti & James J. Heckman & Rodrigo Pinto, 2016. "The Effects of Two Influential Early Childhood Interventions on Health and Healthy Behaviour," Economic Journal, Royal Economic Society, vol. 126(596), pages 28-65, October.
    4. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    5. José A. Pagán & Andrea Puig & Beth J. Soldo, 2007. "Health insurance coverage and the use of preventive services by Mexican adults," Health Economics, John Wiley & Sons, Ltd., vol. 16(12), pages 1359-1369.
    6. Keele, Luke & Tingley, Dustin & Teppei Yamamoto, "undated". "Identifying Mechanisms behind Policy Interventions via Causal Mediation Analysis," Working Paper 135661, Harvard University OpenScholar.
    7. Zheng Wenjing & van der Laan Mark J., 2012. "Targeted Maximum Likelihood Estimation of Natural Direct Effects," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-40, January.
    8. Martin Huber, 2014. "Identifying Causal Mechanisms (Primarily) Based On Inverse Probability Weighting," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 29(6), pages 920-943, September.
    9. Martin Huber, 2015. "Causal Pitfalls in the Decomposition of Wage Gaps," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 33(2), pages 179-191, April.
    10. Flores, Carlos A. & Flores-Lagunes, Alfonso, 2009. "Identification and Estimation of Causal Mechanisms and Net Effects of a Treatment under Unconfoundedness," IZA Discussion Papers 4237, Institute of Labor Economics (IZA).
    11. Matias Busso & John DiNardo & Justin McCrary, 2014. "New Evidence on the Finite Sample Properties of Propensity Score Reweighting and Matching Estimators," The Review of Economics and Statistics, MIT Press, vol. 96(5), pages 885-897, December.
    12. Jeffrey M. Albert & Suchitra Nelson, 2011. "Generalized Causal Mediation Analysis," Biometrics, The International Biometric Society, vol. 67(3), pages 1028-1038, September.
    13. Imai, Kosuke & Yamamoto, Teppei, 2013. "Identification and Sensitivity Analysis for Multiple Causal Mechanisms: Revisiting Evidence from Framing Experiments," Political Analysis, Cambridge University Press, vol. 21(2), pages 141-171, April.
    14. Iván Díaz & Nima S. Hejazi, 2020. "Causal mediation analysis for stochastic interventions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(3), pages 661-683, July.
    15. A. Belloni & V. Chernozhukov & I. Fernández‐Val & C. Hansen, 2017. "Program Evaluation and Causal Inference With High‐Dimensional Data," Econometrica, Econometric Society, vol. 85, pages 233-298, January.
    16. van der Laan Mark J. & Rubin Daniel, 2006. "Targeted Maximum Likelihood Learning," The International Journal of Biostatistics, De Gruyter, vol. 2(1), pages 1-40, December.
    17. Cardella, Eric & Depew, Briggs, 2014. "The effect of health insurance coverage on the reported health of young adults," Economics Letters, Elsevier, vol. 124(3), pages 406-410.
    18. Nakanishi, Noriyuki & Tatara, Kozo & Fujiwara, Hitoshi, 1996. "Do preventive health services reduce eventual demand for medical care?," Social Science & Medicine, Elsevier, vol. 43(6), pages 999-1005, September.
    19. Thomas R. Ten Have & Marshall M. Joffe & Kevin G. Lynch & Gregory K. Brown & Stephen A. Maisto & Aaron T. Beck, 2007. "Causal Mediation Analyses with Rank Preserving Models," Biometrics, The International Biometric Society, vol. 63(3), pages 926-934, September.
    20. Luke Keele & Dustin Tingley & Teppei Yamamoto, 2015. "Identifying Mechanisms Behind Policy Interventions Via Causal Mediation Analysis," Journal of Policy Analysis and Management, John Wiley & Sons, Ltd., vol. 34(4), pages 937-963, September.
    21. Luna Bellani & Michela Bia, 2019. "The long‐run effect of childhood poverty and the mediating role of education," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(1), pages 37-68, January.
    22. Ye Luo & Martin Spindler & Jannis Kuck, 2016. "High-Dimensional $L_2$Boosting: Rate of Convergence," Papers 1602.08927, arXiv.org, revised Jul 2022.
    23. Guido W. Imbens, 2004. "Nonparametric Estimation of Average Treatment Effects Under Exogeneity: A Review," The Review of Economics and Statistics, MIT Press, vol. 86(1), pages 4-29, February.
    24. José A. Pagán & Andrea Puig & Beth J. Soldo, 2007. "Health insurance coverage and the use of preventive services by Mexican adults," Health Economics, John Wiley & Sons, Ltd., vol. 16(12), pages 1359-1369, December.
    25. Jinyong Hahn, 1998. "On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects," Econometrica, Econometric Society, vol. 66(2), pages 315-332, March.
    26. C H Miles & I Shpitser & P Kanki & S Meloni & E J Tchetgen Tchetgen, 2020. "On semiparametric estimation of a path-specific effect in the presence of mediator-outcome confounding," Biometrika, Biometrika Trust, vol. 107(1), pages 159-172.
    27. Kosali Simon & Aparna Soni & John Cawley, 2017. "The Impact of Health Insurance on Preventive Care and Health Behaviors: Evidence from the First Two Years of the ACA Medicaid Expansions," Journal of Policy Analysis and Management, John Wiley & Sons, Ltd., vol. 36(2), pages 390-417, March.
    28. Martin Huber & Michael Lechner & Giovanni Mellace, 2017. "Why Do Tougher Caseworkers Increase Employment? The Role of Program Assignment as a Causal Mechanism," The Review of Economics and Statistics, MIT Press, vol. 99(1), pages 180-183, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Nora Bearth & Michael Lechner, 2024. "Causal Machine Learning for Moderation Effects," Papers 2401.08290, arXiv.org, revised Apr 2024.
    2. Dongli Chen & Qianxuan Huang, 2024. "The New Policy for Innovative Transformation in Regional Industrial Chains, the Conversion of New and Old Kinetic Energy, and Energy Poverty Alleviation," Energies, MDPI, vol. 17(11), pages 1-37, May.
    3. Ruiyu Hu & Zemenghong Bao & Zhisen Lin & Kun Lv, 2024. "The Innovative Construction of Provinces, Regional Artificial Intelligence Development, and the Resilience of Regional Innovation Ecosystems: Quasi-Natural Experiments Based on Spatial Difference-in-D," Sustainability, MDPI, vol. 16(18), pages 1-37, September.
    4. Rahul Singh & Liyuan Xu & Arthur Gretton, 2020. "Kernel Methods for Causal Functions: Dose, Heterogeneous, and Incremental Response Curves," Papers 2010.04855, arXiv.org, revised Oct 2022.
    5. Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.
    6. Zemenghong Bao & Zhisen Lin & Tiantian Jin & Kun Lv, 2024. "Regional Breakthrough Innovation Change Strategies, Ecological Location Suitability of High-Tech Industry Innovation Ecosystems, and Green Energy," Energies, MDPI, vol. 17(16), pages 1-34, August.
    7. Xinyu Wei & Mingwang Cheng & Kaifeng Duan & Xiangxing Kong, 2024. "Effects of Big Data on PM 2.5 : A Study Based on Double Machine Learning," Land, MDPI, vol. 13(3), pages 1-21, March.
    8. AmirEmad Ghassami & Andrew Ying & Ilya Shpitser & Eric Tchetgen Tchetgen, 2021. "Minimax Kernel Machine Learning for a Class of Doubly Robust Functionals with Application to Proximal Causal Inference," Papers 2104.02929, arXiv.org, revised Mar 2022.
    9. Hugo Bodory & Martin Huber & Lukáš Lafférs, 2022. "Evaluating (weighted) dynamic treatment effects by double machine learning [Identification of causal effects using instrumental variables]," The Econometrics Journal, Royal Economic Society, vol. 25(3), pages 628-648.
    10. Lu Kang & Jie Lv & Haoyang Zhang, 2024. "Can the Water Resource Fee-to-Tax Reform Promote the “Three-Wheel Drive” of Corporate Green Energy-Saving Innovations? Quasi-Natural Experimental Evidence from China," Energies, MDPI, vol. 17(12), pages 1-38, June.
    11. Helena Chuliá & Sabuhi Khalili & Jorge M. Uribe, 2024. "Monitoring time-varying systemic risk in sovereign debt and currency markets with generative AI," IREA Working Papers 202402, University of Barcelona, Research Institute of Applied Economics, revised Feb 2024.
    12. Jonathan Fuhr & Philipp Berens & Dominik Papies, 2024. "Estimating Causal Effects with Double Machine Learning -- A Method Evaluation," Papers 2403.14385, arXiv.org, revised Apr 2024.
    13. Rahul Singh & Liyuan Xu & Arthur Gretton, 2021. "Sequential Kernel Embedding for Mediated and Time-Varying Dose Response Curves," Papers 2111.03950, arXiv.org, revised Jul 2023.
    14. Victor Chernozhukov & Whitney K. Newey & Rahul Singh, 2022. "Automatic Debiased Machine Learning of Causal and Structural Effects," Econometrica, Econometric Society, vol. 90(3), pages 967-1027, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Martin Huber & Mark Schelker & Anthony Strittmatter, 2022. "Direct and Indirect Effects based on Changes-in-Changes," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(1), pages 432-443, January.
    2. Viviana Celli, 2022. "Causal mediation analysis in economics: Objectives, assumptions, models," Journal of Economic Surveys, Wiley Blackwell, vol. 36(1), pages 214-234, February.
    3. Martin Huber & Michael Lechner & Giovanni Mellace, 2016. "The Finite Sample Performance of Estimators for Mediation Analysis Under Sequential Conditional Independence," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(1), pages 139-160, January.
    4. Hsu Yu-Chin & Huber Martin & Lai Tsung-Chih, 2019. "Nonparametric estimation of natural direct and indirect effects based on inverse probability weighting," Journal of Econometric Methods, De Gruyter, vol. 8(1), pages 1-20, January.
    5. Martin Huber & Lukáš Lafférs, 2022. "Bounds on direct and indirect effects under treatment/mediator endogeneity and outcome attrition," Econometric Reviews, Taylor & Francis Journals, vol. 41(10), pages 1141-1163, November.
    6. Martin Huber & Anna Solovyeva, 2020. "Direct and Indirect Effects under Sample Selection and Outcome Attrition," Econometrics, MDPI, vol. 8(4), pages 1-25, December.
    7. Viviana Celli, 2019. "Causal Mediation Analysis in Economics: objectives, assumptions, models," Working Papers 12/19, Sapienza University of Rome, DISS.
    8. Martin Huber & Yu‐Chin Hsu & Ying‐Ying Lee & Layal Lettry, 2020. "Direct and indirect effects of continuous treatments based on generalized propensity score weighting," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 35(7), pages 814-840, November.
    9. Martin Huber & Michael Lechner & Giovanni Mellace, 2017. "Why Do Tougher Caseworkers Increase Employment? The Role of Program Assignment as a Causal Mechanism," The Review of Economics and Statistics, MIT Press, vol. 99(1), pages 180-183, March.
    10. Frölich, Markus & Huber, Martin & Wiesenfarth, Manuel, 2017. "The finite sample performance of semi- and non-parametric estimators for treatment effects and policy evaluation," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 91-102.
    11. Huber, Martin, 2019. "An introduction to flexible methods for policy evaluation," FSES Working Papers 504, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
    12. Martin Huber & Anna Solovyeva, 2020. "On the Sensitivity of Wage Gap Decompositions," Journal of Labor Research, Springer, vol. 41(1), pages 1-33, June.
    13. Markus Frölich & Martin Huber, 2017. "Direct and indirect treatment effects–causal chains and mediation analysis with instrumental variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(5), pages 1645-1666, November.
    14. Ganesh Karapakula, 2023. "Stable Probability Weighting: Large-Sample and Finite-Sample Estimation and Inference Methods for Heterogeneous Causal Effects of Multivalued Treatments Under Limited Overlap," Papers 2301.05703, arXiv.org, revised Jan 2023.
    15. Wunsch, Conny & Strobl, Renate, 2018. "Identification of causal mechanisms based on between-subject double randomization designs," CEPR Discussion Papers 13028, C.E.P.R. Discussion Papers.
    16. Patrick Ring & Christoph A. Schütt & Dennis J. Snower, 2023. "Care and anger motives in social dilemmas," Theory and Decision, Springer, vol. 95(2), pages 273-308, August.
    17. Martin Huber, 2015. "Causal Pitfalls in the Decomposition of Wage Gaps," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 33(2), pages 179-191, April.
    18. Schütt, Christoph A., 2023. "The effect of perceived similarity and social proximity on the formation of prosocial preferences," Journal of Economic Psychology, Elsevier, vol. 99(C).
    19. Michael Pollmann, 2020. "Causal Inference for Spatial Treatments," Papers 2011.00373, arXiv.org, revised Jan 2023.
    20. Hugo Bodory & Martin Huber & Michael Lechner, 2024. "The Finite Sample Performance of Instrumental Variable-Based Estimators of the Local Average Treatment Effect When Controlling for Covariates," Computational Economics, Springer;Society for Computational Economics, vol. 64(4), pages 2053-2078, October.

    More about this item

    JEL classification:

    • C21 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2002.12710. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.