IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0223360.html
   My bibliography  Save this article

Robust policy evaluation from large-scale observational studies

Author

Listed:
  • Md Saiful Islam
  • Md Sarowar Morshed
  • Gary J Young
  • Md Noor-E-Alam

Abstract

Under the current policy decision making paradigm we make or evaluate a policy decision by intervening different socio-economic parameters and analyzing the impact of those interventions. This process involves identifying the causal relation between interventions and outcomes. Matching method is one of the popular techniques to identify such causal relations. However, in one-to-one matching, when a treatment or control unit has multiple pair assignment options with similar match quality, different matching algorithms often assign different pairs. Since all the matching algorithms assign pairs without considering the outcomes, it is possible that with the same data and same hypothesis, different experimenters can reach different conclusions creating an uncertainty in policy decision making. This problem becomes more prominent in the case of large-scale observational studies as there are more pair assignment options. Recently, a robust approach has been proposed to tackle the uncertainty that uses an integer programming model to explore all possible assignments. Though the proposed integer programming model is very efficient in making robust causal inference, it is not scalable to big data observational studies. With the current approach, an observational study with 50,000 samples will generate hundreds of thousands binary variables. Solving such integer programming problem is computationally expensive and becomes even worse with the increase of sample size. In this work, we consider causal inference testing with binary outcomes and propose computationally efficient algorithms that are adaptable for large-scale observational studies. By leveraging the structure of the optimization model, we propose a robustness condition that further reduces the computational burden. We validate the efficiency of the proposed algorithms by testing the causal relation between the Medicare Hospital Readmission Reduction Program (HRRP) and non-index readmissions (i.e., readmission to a hospital that is different from the hospital that discharged the patient) from the State of California Patient Discharge Database from 2010 to 2014. Our result shows that HRRP has a causal relation with the increase in non-index readmissions. The proposed algorithms proved to be highly scalable in testing causal relations from large-scale observational studies.

Suggested Citation

  • Md Saiful Islam & Md Sarowar Morshed & Gary J Young & Md Noor-E-Alam, 2019. "Robust policy evaluation from large-scale observational studies," PLOS ONE, Public Library of Science, vol. 14(10), pages 1-19, October.
  • Handle: RePEc:plo:pone00:0223360
    DOI: 10.1371/journal.pone.0223360
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0223360
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0223360&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0223360?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jon Kleinberg & Jens Ludwig & Sendhil Mullainathan & Ziad Obermeyer, 2015. "Prediction Policy Problems," American Economic Review, American Economic Association, vol. 105(5), pages 491-495, May.
    2. Alexander G. Nikolaev & Sheldon H. Jacobson & Wendy K. Tam Cho & Jason J. Sauppe & Edward C. Sewell, 2013. "Balance Optimization Subset Selection (BOSS): An Alternative Approach for Causal Inference with Observational Data," Operations Research, INFORMS, vol. 61(2), pages 398-412, April.
    3. Christakis, Nicholas A. & Iwashyna, Theodore J., 2003. "The health impact of health care on families: a matched cohort study of hospice use by decedents and mortality outcomes in surviving, widowed spouses," Social Science & Medicine, Elsevier, vol. 57(3), pages 465-475, August.
    4. Iacus, Stefano & King, Gary & Porro, Giuseppe, 2009. "cem: Software for Coarsened Exact Matching," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 30(i09).
    5. Hong, Guanglei & Raudenbush, Stephen W., 2006. "Evaluating Kindergarten Retention Policy: A Case Study of Causal Inference for Multilevel Observational Data," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 901-910, September.
    6. José R. Zubizarreta & Luke Keele, 2017. "Optimal Multilevel Matching in Clustered Observational Studies: A Case Study of the Effectiveness of Private Schools Under a Large-Scale Voucher System," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 547-560, April.
    7. Quinn McNemar, 1947. "Note on the sampling error of the difference between correlated proportions or percentages," Psychometrika, Springer;The Psychometric Society, vol. 12(2), pages 153-157, June.
    8. Astrid Kiil, 2012. "Does employment-based private health insurance increase the use of covered health care services? A matching estimator approach," International Journal of Health Economics and Management, Springer, vol. 12(1), pages 1-38, March.
    9. Iacus, Stefano M. & King, Gary & Porro, Giuseppe, 2011. "Multivariate Matching Methods That Are Monotonic Imbalance Bounding," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 345-361.
    10. Alexis Diamond & Jasjeet S. Sekhon, 2013. "Genetic Matching for Estimating Causal Effects: A General Multivariate Matching Method for Achieving Balance in Observational Studies," The Review of Economics and Statistics, MIT Press, vol. 95(3), pages 932-945, July.
    11. Min Chen, 2018. "Reducing excess hospital readmissions: Does destination matter?," International Journal of Health Economics and Management, Springer, vol. 18(1), pages 67-82, March.
    12. Susan Athey & Guido W. Imbens, 2017. "The State of Applied Econometrics: Causality and Policy Evaluation," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 3-32, Spring.
    13. King, Gary & Nielsen, Richard, 2019. "Why Propensity Scores Should Not Be Used for Matching," Political Analysis, Cambridge University Press, vol. 27(4), pages 435-454, October.
    14. José R. Zubizarreta, 2015. "Stable Weights that Balance Covariates for Estimation With Incomplete Outcome Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 910-922, September.
    15. Ben B. Hansen, 2004. "Full Matching in an Observational Study of Coaching for the SAT," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 609-618, January.
    16. Kiil, Astrid, 2012. "Private health insurance and the use of health care services - a review of the theoretical literature with application to voluntary private health insurance in universal health care systems," DaCHE discussion papers 2012:1, University of Southern Denmark, Dache - Danish Centre for Health Economics.
    17. Iacus, Stefano M. & King, Gary & Porro, Giuseppe, 2012. "Causal Inference without Balance Checking: Coarsened Exact Matching," Political Analysis, Cambridge University Press, vol. 20(1), pages 1-24, January.
    18. José R. Zubizarreta, 2012. "Using Mixed Integer Programming for Matching in an Observational Study of Kidney Failure After Surgery," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1360-1371, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cousineau, Martin & Verter, Vedat & Murphy, Susan A. & Pineau, Joelle, 2023. "Estimating causal effects with optimization-based methods: A review and empirical comparison," European Journal of Operational Research, Elsevier, vol. 304(2), pages 367-380.
    2. Md Saiful Islam & Md Sarowar Morshed & Md. Noor-E-Alam, 2022. "A Computational Framework for Solving Nonlinear Binary Optimization Problems in Robust Causal Inference," INFORMS Journal on Computing, INFORMS, vol. 34(6), pages 3023-3041, November.
    3. Jason J. Sauppe & Sheldon H. Jacobson, 2017. "The role of covariate balance in observational studies," Naval Research Logistics (NRL), John Wiley & Sons, vol. 64(4), pages 323-344, June.
    4. Jason J. Sauppe & Sheldon H. Jacobson & Edward C. Sewell, 2014. "Complexity and Approximation Results for the Balance Optimization Subset Selection Model for Causal Inference in Observational Studies," INFORMS Journal on Computing, INFORMS, vol. 26(3), pages 547-566, August.
    5. Martin Cousineau & Vedat Verter & Susan A. Murphy & Joelle Pineau, 2022. "Estimating causal effects with optimization-based methods: A review and empirical comparison," Papers 2203.00097, arXiv.org.
    6. Marco Morucci & Md. Noor-E-Alam & Cynthia Rudin, 2022. "A Robust Approach to Quantifying Uncertainty in Matching Problems of Causal Inference," INFORMS Joural on Data Science, INFORMS, vol. 1(2), pages 156-171, October.
    7. Keeler, Zachary T. & Stephens, Heather M., 2020. "Valuing shale gas development in resource-dependent communities," Resources Policy, Elsevier, vol. 69(C).
    8. Kotsogiannis, Christos & Salvadori, Luca & Karangwa, John & Mukamana, Theonille, 2024. "Do tax audits have a dynamic impact? Evidence from corporate income tax administrative data," Journal of Development Economics, Elsevier, vol. 170(C).
    9. Massimo Baldini & Giovanni Gallo & Costanza Torricelli, 2020. "The scars of scarcity in the short run: an empirical investigation across Europe," Economia Politica: Journal of Analytical and Institutional Economics, Springer;Fondazione Edison, vol. 37(3), pages 1033-1069, October.
    10. Alessi, Lucia & Battiston, Stefano & Kvedaras, Virmantas, 2024. "Over with carbon? Investors’ reaction to the Paris Agreement and the US withdrawal," Journal of Financial Stability, Elsevier, vol. 71(C).
    11. Avni, Omrit & Guetzkow, Joshua & Hasisi, Badi, 2024. "Bias in prosecutorial decision making: Bridging focal concerns & group threat," Journal of Criminal Justice, Elsevier, vol. 92(C).
    12. Alessi, Lucia & Battiston, Stefano & Kvedaras, Virmantas, 2021. "Over with carbon? Investors' reaction to the Paris Agreement and the US withdrawal," JRC Working Papers in Economics and Finance 2021-12, Joint Research Centre, European Commission.
    13. Zhang, Chi & Managi, Shunsuke, 2020. "Functional social support and maternal stress: A study on the 2017 paid parental leave reform in Japan," Economic Analysis and Policy, Elsevier, vol. 65(C), pages 153-172.
    14. Christos Kotsogiannis & Luca Salvadori & John Karangwa & Innocente Murasi, 2024. "E-invoicing, Tax Audits and VAT Compliance," Working Papers 1454, Barcelona School of Economics.
    15. Leduc, Elisabeth & Tojerow, Ilan, 2020. "Subsidizing Domestic Services as a Tool to Fight Unemployment: Effectiveness and Hidden Costs," IZA Discussion Papers 13544, Institute of Labor Economics (IZA).
    16. Philipp vom Berge & Achim Schmillen, 2023. "Effects of mass layoffs on local employment—evidence from geo-referenced data," Journal of International Economic Law, Oxford University Press, vol. 23(3), pages 509-539.
    17. Sara Pavone & Elena Ragazzi & Lisa Sella, 2015. "Sostenere le imprese agro-industriali in Piemonte: un?analisi controfattuale," SCIENZE REGIONALI, FrancoAngeli Editore, vol. 2015(3 Suppl.), pages 129-143.
    18. Davidson Heath & Giorgo Sertsios, 2022. "Profitability and Financial Leverage: Evidence from a Quasi-Natural Experiment," Management Science, INFORMS, vol. 68(11), pages 8386-8410, November.
    19. Khanna, Rajat, 2021. "Aftermath of a tragedy: A star's death and coauthors’ subsequent productivity," Research Policy, Elsevier, vol. 50(2).
    20. Datta, Nirupam, 2015. "Evaluating Impacts of Watershed Development Program on Agricultural Productivity, Income, and Livelihood in Bhalki Watershed of Bardhaman District, West Bengal," World Development, Elsevier, vol. 66(C), pages 443-456.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0223360. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.