IDEAS home Printed from https://ideas.repec.org/a/inm/orijoc/v34y2022i3p1626-1643.html
   My bibliography  Save this article

Causal Rule Sets for Identifying Subgroups with Enhanced Treatment Effects

Author

Listed:
  • Tong Wang

    (Tippie College of Business, University of Iowa, Iowa City, Iowa 52242)

  • Cynthia Rudin

    (Department of Computer Science, Duke University, Durham, North Carolina 27708)

Abstract

A key question in causal inference analyses is how to find subgroups with elevated treatment effects. This paper takes a machine learning approach and introduces a generative model, causal rule sets (CRS), for interpretable subgroup discovery. A CRS model uses a small set of short decision rules to capture a subgroup in which the average treatment effect is elevated. We present a Bayesian framework for learning a causal rule set. The Bayesian model consists of a prior that favors simple models for better interpretability as well as avoiding overfitting and a Bayesian logistic regression that captures the likelihood of data, characterizing the relation between outcomes, attributes, and subgroup membership. The Bayesian model has tunable parameters that can characterize subgroups with various sizes, providing users with more flexible choices of models from the treatment-efficient frontier . We find maximum a posteriori models using iterative discrete Monte Carlo steps in the joint solution space of rules sets and parameters. To improve search efficiency, we provide theoretically grounded heuristics and bounding strategies to prune and confine the search space. Experiments show that the search algorithm can efficiently recover true underlying subgroups. We apply CRS on public and real-world data sets from domains in which interpretability is indispensable. We compare CRS with state-of-the-art rule-based subgroup discovery models. Results show that CRS achieves consistently competitive performance on data sets from various domains, represented by high treatment-efficient frontiers. Summary of Contribution: This paper is motivated by the large heterogeneity of treatment effect in many applications and the need to accurately locate subgroups for enhanced treatment effect. Existing methods either rely on prior hypotheses to discover subgroups or greedy methods, such as tree-based recursive partitioning. Our method adopts a machine learning approach to find an optimal subgroup learned with a carefully global objective. Our model is more flexible in capturing subgroups by using a set of short decision rules compared with tree-based baselines. We evaluate our model using a novel metric, treatment-efficient frontier, that characterizes the trade-off between the subgroup size and achievable treatment effect, and our model demonstrates better performance than baseline models.

Suggested Citation

  • Tong Wang & Cynthia Rudin, 2022. "Causal Rule Sets for Identifying Subgroups with Enhanced Treatment Effects," INFORMS Journal on Computing, INFORMS, vol. 34(3), pages 1626-1643, May.
  • Handle: RePEc:inm:orijoc:v:34:y:2022:i:3:p:1626-1643
    DOI: 10.1287/ijoc.2021.1143
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/ijoc.2021.1143
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijoc.2021.1143?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Daniel B. Neill, 2012. "Fast subset scan for spatial pattern detection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 74(2), pages 337-360, March.
    2. David Figlio & Jonathan Guryan & Krzysztof Karbownik & Jeffrey Roth, 2014. "The Effects of Poor Neonatal Health on Children's Cognitive Development," American Economic Review, American Economic Association, vol. 104(12), pages 3921-3955, December.
    3. Sekhon, Jasjeet S., 2011. "Multivariate and Propensity Score Matching Software with Automated Balance Optimization: The Matching package for R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 42(i07).
    4. John B. Holbein & D. Sunshine Hillygus, 2016. "Making Young Voters: The Impact of Preregistration on Youth Turnout," American Journal of Political Science, John Wiley & Sons, vol. 60(2), pages 364-382, April.
    5. Raj Chetty & Nathaniel Hendren & Lawrence F. Katz, 2016. "The Effects of Exposure to Better Neighborhoods on Children: New Evidence from the Moving to Opportunity Experiment," American Economic Review, American Economic Association, vol. 106(4), pages 855-902, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Heejun Shin & Joseph Antonelli, 2023. "Improved inference for doubly robust estimators of heterogeneous treatment effects," Biometrics, The International Biometric Society, vol. 79(4), pages 3140-3152, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Douglas Almond & Janet Currie & Valentina Duque, 2018. "Childhood Circumstances and Adult Outcomes: Act II," Journal of Economic Literature, American Economic Association, vol. 56(4), pages 1360-1446, December.
    2. Guirong Li & Jiajia Xu & Liying Li & Zhaolei Shi & Hongmei Yi & James Chu & Elena Kardanova & Yanyan Li & Prashant Loyalka & Scott Rozelle, 2020. "The Impacts of Highly Resourced Vocational Schools on Student Outcomes in China," China & World Economy, Institute of World Economics and Politics, Chinese Academy of Social Sciences, vol. 28(6), pages 125-150, November.
    3. Paul Bingley & Lorenzo Cappellari & Konstantinos Tatsiramos, 2014. "Family, Community and Long-Term Earnings Inequality," DISCE - Working Papers del Dipartimento di Economia e Finanza def017, Università Cattolica del Sacro Cuore, Dipartimenti e Istituti di Scienze Economiche (DISCE).
    4. Paul R. Flora, 2021. "Regional Spotlight: Poverty in Philadelphia, and Beyond," Economic Insights, Federal Reserve Bank of Philadelphia, vol. 6(4), pages 16-22, December.
    5. Deepak Saraswat, 2022. "Labor Market Impacts of Exposure to Affordable Housing Supply: Evidence from the Low-Income Housing Tax Credit Program," Working papers 2022-09, University of Connecticut, Department of Economics.
    6. Andrés Rodríguez-Pose & Michael Storper, 2020. "Housing, urban growth and inequalities: The limits to deregulation and upzoning in reducing economic and spatial inequality," Urban Studies, Urban Studies Journal Limited, vol. 57(2), pages 223-248, February.
    7. Gordon B. Dahl & Anne C. Gielen, 2021. "Intergenerational Spillovers in Disability Insurance," American Economic Journal: Applied Economics, American Economic Association, vol. 13(2), pages 116-150, April.
    8. Patricio S Dalton & Victor H Gonzalez Jimenez & Charles N Noussair, 2017. "Exposure to Poverty and Productivity," PLOS ONE, Public Library of Science, vol. 12(1), pages 1-19, January.
    9. Francesco Andreoli & Eugenio Peluso, 2016. "So close yet so unequal: Reconsidering spatial inequality in U.S. cities," Working Papers 21/2016, University of Verona, Department of Economics.
    10. Stephen B. Billings & Mark Hoekstra, 2019. "Schools, Neighborhoods, and the Long-Run Effect of Crime-Prone Peers," NBER Working Papers 25730, National Bureau of Economic Research, Inc.
    11. Martti Kaila & Emily Nix & Krista Riukula, 2021. "Disparate Impacts of Job Loss by Parental Income and Implications for Intergenerational Mobility," Opportunity and Inclusive Growth Institute Working Papers 53, Federal Reserve Bank of Minneapolis.
    12. Xi Chen & Chih Ming Tan & Xiaobo Zhang & Xin Zhang, 2020. "The effects of prenatal exposure to temperature extremes on birth outcomes: the case of China," Journal of Population Economics, Springer;European Society for Population Economics, vol. 33(4), pages 1263-1302, October.
    13. Ralf Becker & Maggy Fostier, 2015. "Evaluating non-compulsory educational interventions - the case of peer assisted study groups," Economics Discussion Paper Series 1509, Economics, The University of Manchester.
    14. Bratu, Cristina & Bolotnyy, Valentin, 2023. "Immigrant intergenerational mobility: A focus on childhood environment," European Economic Review, Elsevier, vol. 151(C).
    15. Fabian Kosse & Thomas Deckers & Pia Pinger & Hannah Schildberg-Hörisch & Armin Falk, 2020. "The Formation of Prosociality: Causal Evidence on the Role of Social Environment," Journal of Political Economy, University of Chicago Press, vol. 128(2), pages 434-467.
    16. Lin, Dajun & Lutter, Randall & Ruhm, Christopher J., 2018. "Cognitive performance and labour market outcomes," Labour Economics, Elsevier, vol. 51(C), pages 121-135.
    17. Michael Geruso & Timothy J. Layton & Jacob Wallace, 2023. "What Difference Does a Health Plan Make? Evidence from Random Plan Assignment in Medicaid," American Economic Journal: Applied Economics, American Economic Association, vol. 15(3), pages 341-379, July.
    18. Erich Battistin & Lorenzo Neri, 2017. "School Performance, Score Inflation and Economic Geography," Working Papers 837, Queen Mary University of London, School of Economics and Finance.
    19. Alex Bell & Raj Chetty & Xavier Jaravel & Neviana Petkova & John Van Reenen, 2019. "Who Becomes an Inventor in America? The Importance of Exposure to Innovation," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 134(2), pages 647-713.
    20. Barbara Broadway & Anna Zhu, 2023. "Spatial heterogeneity in welfare reform success," Melbourne Institute Working Paper Series wp2023n13, Melbourne Institute of Applied Economic and Social Research, The University of Melbourne.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijoc:v:34:y:2022:i:3:p:1626-1643. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.