IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2407.04448.html
   My bibliography  Save this paper

Learning control variables and instruments for causal analysis in observational data

Author

Listed:
  • Nicolas Apfel
  • Julia Hatamyar
  • Martin Huber
  • Jannis Kueck

Abstract

This study introduces a data-driven, machine learning-based method to detect suitable control variables and instruments for assessing the causal effect of a treatment on an outcome in observational data, if they exist. Our approach tests the joint existence of instruments, which are associated with the treatment but not directly with the outcome (at least conditional on observables), and suitable control variables, conditional on which the treatment is exogenous, and learns the partition of instruments and control variables from the observed data. The detection of sets of instruments and control variables relies on the condition that proper instruments are conditionally independent of the outcome given the treatment and suitable control variables. We establish the consistency of our method for detecting control variables and instruments under certain regularity conditions, investigate the finite sample performance through a simulation study, and provide an empirical application to labor market data from the Job Corps study.

Suggested Citation

  • Nicolas Apfel & Julia Hatamyar & Martin Huber & Jannis Kueck, 2024. "Learning control variables and instruments for causal analysis in observational data," Papers 2407.04448, arXiv.org.
  • Handle: RePEc:arx:papers:2407.04448
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2407.04448
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Black, Dan A. & Joo, Joonhwi & LaLonde, Robert J. & Smith, Jeffrey A. & Taylor, Evan J., 2015. "Simple Tests for Selection Bias: Learning More from Instrumental Variables," IZA Discussion Papers 9346, Institute of Labor Economics (IZA).
    2. Frank Windmeijer & Xiaoran Liang & Fernando P. Hartwig & Jack Bowden, 2021. "The confidence interval method for selecting valid instrumental variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(4), pages 752-776, September.
    3. Peter Z. Schochet & John Burghardt & Steven Glazerman, 2001. "National Job Corps Study: The Impacts of Job Corps on Participants' Employment and Related Outcomes," Mathematica Policy Research Reports db6c4204b8e1408bb0c6289ec, Mathematica Policy Research.
    4. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    5. Wooldridge, Jeffrey M., 1992. "A Test for Functional Form Against Nonparametric Alternatives," Econometric Theory, Cambridge University Press, vol. 8(4), pages 452-475, December.
    6. Huber, Martin, 2013. "A simple test for the ignorability of non-compliance in experiments," Economics Letters, Elsevier, vol. 120(3), pages 389-391.
    7. Christian N. Brinch & Magne Mogstad & Matthew Wiswall, 2017. "Beyond LATE with a Discrete Instrument," Journal of Political Economy, University of Chicago Press, vol. 125(4), pages 985-1039.
    8. Zijian Guo & Hyunseung Kang & T. Tony Cai & Dylan S. Small, 2018. "Confidence intervals for causal effects with invalid instruments by using two‐stage hard thresholding with voting," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(4), pages 793-815, September.
    9. Guido W. Imbens & Jeffrey M. Wooldridge, 2009. "Recent Developments in the Econometrics of Program Evaluation," Journal of Economic Literature, American Economic Association, vol. 47(1), pages 5-86, March.
    10. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    11. Frank Windmeijer & Helmut Farbmacher & Neil Davies & George Davey Smith, 2019. "On the Use of the Lasso for Instrumental Variables Estimation with Some Invalid Instruments," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(527), pages 1339-1350, July.
    12. de Luna Xavier & Johansson Per, 2014. "Testing for the Unconfoundedness Assumption Using an Instrumental Assumption," Journal of Causal Inference, De Gruyter, vol. 2(2), pages 187-199, September.
    13. Racine, Jeff, 1997. "Consistent Significance Testing for Nonparametric Regression," Journal of Business & Economic Statistics, American Statistical Association, vol. 15(3), pages 369-378, July.
    14. Joshua D. Angrist & Miikka Rokkanen, 2015. "Wanna Get Away? Regression Discontinuity Estimation of Exam School Effects Away From the Cutoff," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1331-1344, December.
    15. Joshua D. Angrist, 2004. "Treatment effect heterogeneity in theory and practice," Economic Journal, Royal Economic Society, vol. 114(494), pages 52-83, March.
    16. Matthew A. Masten & Alexandre Poirier, 2021. "Salvaging Falsified Instrumental Variable Models," Econometrica, Econometric Society, vol. 89(3), pages 1449-1469, May.
    17. Peter Z. Schochet & John Burghardt & Sheena McConnell, 2008. "Does Job Corps Work? Impact Findings from the National Job Corps Study," American Economic Review, American Economic Association, vol. 98(5), pages 1864-1886, December.
    18. repec:mpr:mprres:6097 is not listed on IDEAS
    19. Imbens, Guido W & Angrist, Joshua D, 1994. "Identification and Estimation of Local Average Treatment Effects," Econometrica, Econometric Society, vol. 62(2), pages 467-475, March.
    20. Guido W. Imbens, 2004. "Nonparametric Estimation of Average Treatment Effects Under Exogeneity: A Review," The Review of Economics and Statistics, MIT Press, vol. 86(1), pages 4-29, February.
    21. Jaime Sevilla & Alexandra Mayn, 2021. "A conditional independence test for causality in econometrics," Papers 2107.09765, arXiv.org.
    22. Hong, Yongmiao & White, Halbert, 1995. "Consistent Specification Testing via Nonparametric Series Regression," Econometrica, Econometric Society, vol. 63(5), pages 1133-1159, September.
    23. Nicolas Apfel & Helmut Farbmacher & Rebecca Groh & Martin Huber & Henrika Langen, 2022. "Detecting Grouped Local Average Treatment Effects and Selecting True Instruments," Papers 2207.04481, arXiv.org, revised Oct 2023.
    24. Jeffery Racine & Jeffrey Hart & Qi Li, 2006. "Testing the Significance of Categorical Predictor Variables in Nonparametric Regression Models," Econometric Reviews, Taylor & Francis Journals, vol. 25(4), pages 523-544.
    25. repec:mpr:mprres:2951 is not listed on IDEAS
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Martin Huber & Jannis Kueck, 2022. "Testing the identification of causal effects in observational data," Papers 2203.15890, arXiv.org, revised Jun 2023.
    2. Huber, Martin, 2019. "An introduction to flexible methods for policy evaluation," FSES Working Papers 504, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
    3. Huber Martin & Wüthrich Kaspar, 2019. "Local Average and Quantile Treatment Effects Under Endogeneity: A Review," Journal of Econometric Methods, De Gruyter, vol. 8(1), pages 1-27, January.
    4. Susan Athey & Guido W. Imbens, 2017. "The State of Applied Econometrics: Causality and Policy Evaluation," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 3-32, Spring.
    5. Black, Dan A. & Joo, Joonhwi & LaLonde, Robert & Smith, Jeffrey A. & Taylor, Evan J., 2022. "Simple Tests for Selection: Learning More from Instrumental Variables," Labour Economics, Elsevier, vol. 79(C).
    6. Huber, Martin & Wüthrich, Kaspar, 2017. "Evaluating local average and quantile treatment effects under endogeneity based on instruments: a review," FSES Working Papers 479, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
    7. Martin Huber, 2024. "An Introduction to Causal Discovery," Papers 2407.08602, arXiv.org.
    8. Amanda E Kowalski, 2023. "Behaviour within a Clinical Trial and Implications for Mammography Guidelines," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 90(1), pages 432-462.
    9. Denis Fougère & Nicolas Jacquemet, 2020. "Policy Evaluation Using Causal Inference Methods," SciencePo Working papers Main hal-03455978, HAL.
    10. Nicolas Apfel & Helmut Farbmacher & Rebecca Groh & Martin Huber & Henrika Langen, 2022. "Detecting Grouped Local Average Treatment Effects and Selecting True Instruments," Papers 2207.04481, arXiv.org, revised Oct 2023.
    11. Flores, Carlos A. & Flores-Lagunes, Alfonso, 2009. "Identification and Estimation of Causal Mechanisms and Net Effects of a Treatment under Unconfoundedness," IZA Discussion Papers 4237, Institute of Labor Economics (IZA).
    12. Channing Arndt & Sam Jones & Finn Tarp, 2009. "Aid and Growth: Have We Come Full Circle?," Discussion Papers 09-22, University of Copenhagen. Department of Economics.
    13. Vira Semenova, 2020. "Generalized Lee Bounds," Papers 2008.12720, arXiv.org, revised Feb 2023.
    14. Chen, Xuan & Flores, Carlos A. & Flores-Lagunes, Alfonso, 2015. "Going Beyond LATE: Bounding Average Treatment Effects of Job Corps Training," IZA Discussion Papers 9511, Institute of Labor Economics (IZA).
    15. Brett R. Gordon & Robert Moakler & Florian Zettelmeyer, 2022. "Close Enough? A Large-Scale Exploration of Non-Experimental Approaches to Advertising Measurement," Papers 2201.07055, arXiv.org, revised Oct 2022.
    16. Ozkan Eren & Serkan Ozbeklik, 2014. "Who Benefits From Job Corps? A Distributional Analysis Of An Active Labor Market Program," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 29(4), pages 586-611, June.
    17. Brett R. Gordon & Robert Moakler & Florian Zettelmeyer, 2023. "Close Enough? A Large-Scale Exploration of Non-Experimental Approaches to Advertising Measurement," Marketing Science, INFORMS, vol. 42(4), pages 768-793, July.
    18. Jonathan Fuhr & Philipp Berens & Dominik Papies, 2024. "Estimating Causal Effects with Double Machine Learning -- A Method Evaluation," Papers 2403.14385, arXiv.org, revised Apr 2024.
    19. Flores-Lagunes, Alfonso & Gonzalez, Arturo & Neumann, Todd C., 2007. "Estimating the Effects of Length of Exposure to a Training Program: The Case of Job Corps," IZA Discussion Papers 2846, Institute of Labor Economics (IZA).
    20. German Blanco & Xuan Chen & Carlos A. Flores & Alfonso Flores-Lagunes, 2020. "Bounds on Average and Quantile Treatment Effects on Duration Outcomes Under Censoring, Selection, and Noncompliance," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(4), pages 901-920, October.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2407.04448. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.