IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v68y2022i5p3399-3418.html
   My bibliography  Save this article

An Instrumental Variable Forest Approach for Detecting Heterogeneous Treatment Effects in Observational Studies

Author

Listed:
  • Guihua Wang

    (Jindal School of Management, University of Texas at Dallas, Richardson, Texas 75080)

  • Jun Li

    (Ross School of Business, University of Michigan, Ann Arbor, Michigan 48109)

  • Wallace J. Hopp

    (Ross School of Business, University of Michigan, Ann Arbor, Michigan 48109)

Abstract

This study addresses the ubiquitous challenge of using big observational data to identify heterogeneous treatment effects. This problem arises in precision medicine, targeted marketing, personalized education, and many other environments. Identifying heterogeneous treatment effects presents several analytical challenges including high dimensionality and endogeneity issues. We develop a new instrumental variable tree (IVT) approach that incorporates the instrumental variable method into a causal tree (CT) to correct for potential endogeneity biases that may exist in observational data. Our IVT approach partitions subjects into subgroups with similar treatment effects within subgroups and different treatment effects across subgroups. The estimated treatment effects are asymptotically consistent under a set of mild assumptions. Using simulated data, we show our approach has a better coverage rate and smaller mean-squared error than the conventional CT approach. We also demonstrate that an instrumental variable forest (IVF) constructed using IVTs has better accuracy and stratification than a generalized random forest. Finally, by applying the IVF approach to an empirical assessment of laparoscopic colectomy, we demonstrate the importance of accounting for endogeneity to make accurate comparisons of the heterogeneous effects of the treatment (teaching hospitals) and control (nonteaching hospitals) on different types of patients.

Suggested Citation

  • Guihua Wang & Jun Li & Wallace J. Hopp, 2022. "An Instrumental Variable Forest Approach for Detecting Heterogeneous Treatment Effects in Observational Studies," Management Science, INFORMS, vol. 68(5), pages 3399-3418, May.
  • Handle: RePEc:inm:ormnsc:v:68:y:2022:i:5:p:3399-3418
    DOI: 10.1287/mnsc.2021.4084
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mnsc.2021.4084
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.2021.4084?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Maria R. Ibanez & Jonathan R. Clark & Robert S. Huckman & Bradley R. Staats, 2018. "Discretionary Task Ordering: Queue Management in Radiological Services," Management Science, INFORMS, vol. 64(9), pages 4389-4407, September.
    2. Stefan Wager & Susan Athey, 2018. "Estimation and Inference of Heterogeneous Treatment Effects using Random Forests," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1228-1242, July.
    3. Tom Fangyun Tan & Serguei Netessine, 2014. "When Does the Devil Make Work? An Empirical Study of the Impact of Workload on Worker Productivity," Management Science, INFORMS, vol. 60(6), pages 1574-1593, June.
    4. Hessam Bavafa & Lorin M. Hitt & Christian Terwiesch, 2018. "The Impact of E-Visits on Visit Frequencies and Patient Health: Evidence from Primary Care," Management Science, INFORMS, vol. 64(12), pages 5461-5480, December.
    5. Heller, Ruth & Rosenbaum, Paul R. & Small, Dylan S., 2009. "Split Samples and Design Sensitivity in Observational Studies," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 1090-1101.
    6. Hamsa Bastani & Joel Goh & Mohsen Bayati, 2019. "Evidence of Upcoding in Pay-for-Performance Programs," Management Science, INFORMS, vol. 65(3), pages 1042-1060, March.
    7. King, Gary & Nielsen, Richard, 2019. "Why Propensity Scores Should Not Be Used for Matching," Political Analysis, Cambridge University Press, vol. 27(4), pages 435-454, October.
    8. Rina Friedberg & Julie Tibshirani & Susan Athey & Stefan Wager, 2018. "Local Linear Forests," Papers 1807.11408, arXiv.org, revised Sep 2020.
    9. Sexton, Joseph & Laake, Petter, 2009. "Standard errors for bagged and random forest estimators," Computational Statistics & Data Analysis, Elsevier, vol. 53(3), pages 801-811, January.
    10. Ann P. Bartel & Carri W. Chan & Song-Hee Kim, 2020. "Should Hospitals Keep Their Patients Longer? The Role of Inpatient Care in Reducing Postdischarge Mortality," Management Science, INFORMS, vol. 66(6), pages 2326-2346, June.
    11. Gowrisankaran, Gautam & Town, Robert J., 1999. "Estimating the quality of care in hospitals using instrumental variables," Journal of Health Economics, Elsevier, vol. 18(6), pages 747-767, December.
    12. Matt Taddy & Matt Gardner & Liyun Chen & David Draper, 2016. "A Nonparametric Bayesian Analysis of Heterogenous Treatment Effects in Digital Experimentation," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 661-672, October.
    13. Diwas Singh KC & Christian Terwiesch, 2012. "An Econometric Analysis of Patient Flows in the Cardiac Intensive Care Unit," Manufacturing & Service Operations Management, INFORMS, vol. 14(1), pages 50-65, January.
    14. X Nie & S Wager, 2021. "Quasi-oracle estimation of heterogeneous treatment effects [TensorFlow: A system for large-scale machine learning]," Biometrika, Biometrika Trust, vol. 108(2), pages 299-319.
    15. Susan F. Lu & Huaxia Rui & Abraham Seidmann, 2018. "Does Technology Substitute for Nurses? Staffing Decisions in Nursing Homes," Management Science, INFORMS, vol. 64(4), pages 1842-1859, April.
    16. Lu Tian & Ash A. Alizadeh & Andrew J. Gentles & Robert Tibshirani, 2014. "A Simple Method for Estimating Interactions Between a Treatment and a Large Number of Covariates," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(508), pages 1517-1532, December.
    17. Diwas Singh KC & Christian Terwiesch, 2011. "The Effects of Focus on Performance: Evidence from California Hospitals," Management Science, INFORMS, vol. 57(11), pages 1897-1912, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hiroyuki Egami & Md. Shafiur Rahman & Tsuyoshi Yamamoto & Chihiro Egami & Takahisa Wakabayashi, 2024. "Causal effect of video gaming on mental well-being in Japan 2020–2022," Nature Human Behaviour, Nature, vol. 8(10), pages 1943-1956, October.
    2. Qi Feng & J. George Shanthikumar, 2022. "Developing operations management data analytics," Production and Operations Management, Production and Operations Management Society, vol. 31(12), pages 4544-4557, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Diwas Singh KC & Stefan Scholtes & Christian Terwiesch, 2020. "Empirical Research in Healthcare Operations: Past Research, Present Understanding, and Future Opportunities," Manufacturing & Service Operations Management, INFORMS, vol. 22(1), pages 73-83, January.
    2. Michael Lechner & Jana Mareckova, 2022. "Modified Causal Forest," Papers 2209.03744, arXiv.org.
    3. Cousineau, Martin & Verter, Vedat & Murphy, Susan A. & Pineau, Joelle, 2023. "Estimating causal effects with optimization-based methods: A review and empirical comparison," European Journal of Operational Research, Elsevier, vol. 304(2), pages 367-380.
    4. Martin Cousineau & Vedat Verter & Susan A. Murphy & Joelle Pineau, 2022. "Estimating causal effects with optimization-based methods: A review and empirical comparison," Papers 2203.00097, arXiv.org.
    5. Gabriel Okasa, 2022. "Meta-Learners for Estimation of Causal Effects: Finite Sample Cross-Fit Performance," Papers 2201.12692, arXiv.org.
    6. Lechner, Michael, 2018. "Modified Causal Forests for Estimating Heterogeneous Causal Effects," IZA Discussion Papers 12040, Institute of Labor Economics (IZA).
    7. Michael C Knaus & Michael Lechner & Anthony Strittmatter, 2021. "Machine learning estimation of heterogeneous causal effects: Empirical Monte Carlo evidence," The Econometrics Journal, Royal Economic Society, vol. 24(1), pages 134-161.
    8. Mirko Kremer & Francis de Véricourt, 2022. "Mismanaging diagnostic accuracy under congestion," ESMT Research Working Papers ESMT-22-01, ESMT European School of Management and Technology.
    9. Daniel Goller, 2023. "Analysing a built-in advantage in asymmetric darts contests using causal machine learning," Annals of Operations Research, Springer, vol. 325(1), pages 649-679, June.
    10. Delasay, Mohammad & Ingolfsson, Armann & Kolfal, Bora & Schultz, Kenneth, 2019. "Load effect on service times," European Journal of Operational Research, Elsevier, vol. 279(3), pages 673-686.
    11. David M. Ritzwoller & Vasilis Syrgkanis, 2024. "Simultaneous Inference for Local Structural Parameters with Random Forests," Papers 2405.07860, arXiv.org, revised Sep 2024.
    12. Phillip Heiler & Michael C. Knaus, 2021. "Effect or Treatment Heterogeneity? Policy Evaluation with Aggregated and Disaggregated Treatments," Papers 2110.01427, arXiv.org, revised Aug 2023.
    13. Valente, Marica, 2023. "Policy evaluation of waste pricing programs using heterogeneous causal effect estimation," Journal of Environmental Economics and Management, Elsevier, vol. 117(C).
    14. Miller, Steve, 2020. "Causal forest estimation of heterogeneous and time-varying environmental policy effects," Journal of Environmental Economics and Management, Elsevier, vol. 103(C).
    15. Seokjun Youn & H. Neil Geismar & Michael Pinedo, 2022. "Planning and scheduling in healthcare for better care coordination: Current understanding, trending topics, and future opportunities," Production and Operations Management, Production and Operations Management Society, vol. 31(12), pages 4407-4423, December.
    16. Noa Zychlinski, 2023. "Applications of fluid models in service operations management," Queueing Systems: Theory and Applications, Springer, vol. 103(1), pages 161-185, February.
    17. Costanza Naguib, 2023. "Is the Impact of Opening the Borders Heterogeneous?," Diskussionsschriften dp2312, Universitaet Bern, Departement Volkswirtschaft.
    18. Jillian A. Berry Jaeker & Anita L. Tucker, 2017. "Past the Point of Speeding Up: The Negative Effects of Workload Saturation on Efficiency and Patient Severity," Management Science, INFORMS, vol. 63(4), pages 1042-1062, April.
    19. Newham, Melissa & Valente, Marica, 2024. "The cost of influence: How gifts to physicians shape prescriptions and drug costs," Journal of Health Economics, Elsevier, vol. 95(C).
    20. Michael Lechner & Jana Mareckova, 2024. "Comprehensive Causal Machine Learning," Papers 2405.10198, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:68:y:2022:i:5:p:3399-3418. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.