A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets
Author
Abstract
Suggested Citation
Download full text from publisher
References listed on IDEAS
- Baqun Zhang & Anastasios A. Tsiatis & Eric B. Laber & Marie Davidian, 2012. "A Robust Method for Estimating Optimal Treatment Regimes," Biometrics, The International Biometric Society, vol. 68(4), pages 1010-1018, December.
- Iavor Bojinov & Neil Shephard, 2019. "Time Series Experiments and Causal Estimands: Exact Randomization Tests and Trading," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(528), pages 1665-1682, October.
- Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2003.
"Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score,"
Econometrica, Econometric Society, vol. 71(4), pages 1161-1189, July.
- Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2000. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," NBER Technical Working Papers 0251, National Bureau of Economic Research, Inc.
- Guido Imbens, 2000. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometric Society World Congress 2000 Contributed Papers 1166, Econometric Society.
- Eric B. Laber & Nick J. Meyer & Brian J. Reich & Krishna Pacifici & Jaime A. Collazo & John M. Drake, 2018. "Optimal treatment allocations in space and time for on‐line control of an emerging infectious disease," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(4), pages 743-789, August.
- Marc Rysman, 2009. "The Economics of Two-Sided Markets," Journal of Economic Perspectives, American Economic Association, vol. 23(3), pages 125-143, Summer.
- S. A. Murphy, 2003. "Optimal dynamic treatment regimes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(2), pages 331-355, May.
- Baqun Zhang & Anastasios A. Tsiatis & Eric B. Laber & Marie Davidian, 2013. "Robust estimation of optimal dynamic treatment regimes for sequential treatment decisions," Biometrika, Biometrika Trust, vol. 100(3), pages 681-694.
- Andrei Hagiu & Julian Wright, 2019. "The status of workers and platforms in the sharing economy," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 28(1), pages 97-108, January.
- Lan Wang & Yu Zhou & Rui Song & Ben Sherwood, 2018. "Quantile-Optimal Treatment Regimes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1243-1254, July.
- Chengchun Shi & Rui Song & Wenbin Lu & Bo Fu, 2018. "Maximin projection learning for optimal treatment decision with heterogeneous individualized treatment effects," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(4), pages 681-702, September.
- Stefan Wager & Susan Athey, 2018.
"Estimation and Inference of Heterogeneous Treatment Effects using Random Forests,"
Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1228-1242, July.
- Wager, Stefan & Athey, Susan, 2017. "Estimation and Inference of Heterogeneous Treatment Effects Using Random Forests," Research Papers 3576, Stanford University, Graduate School of Business.
- Ying-Qi Zhao & Donglin Zeng & Eric B. Laber & Michael R. Kosorok, 2015. "New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 583-598, June.
- Audrey Boruvka & Daniel Almirall & Katie Witkiewitz & Susan A. Murphy, 2018. "Assessing Time-Varying Causal Effect Moderation in Mobile Health," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1112-1121, July.
- Hudgens, Michael G. & Halloran, M. Elizabeth, 2008. "Toward Causal Inference With Interference," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 832-842, June.
- Yingqi Zhao & Donglin Zeng & A. John Rush & Michael R. Kosorok, 2012. "Estimating Individualized Treatment Rules Using Outcome Weighted Learning," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(499), pages 1106-1118, September.
- repec:bla:biomet:v:71:y:2015:i:4:p:895-904 is not listed on IDEAS
- Roland A. Matsouaka & Junlong Li & Tianxi Cai, 2014. "Evaluating marker-guided treatment selection strategies," Biometrics, The International Biometric Society, vol. 70(3), pages 489-499, September.
- A. Belloni & V. Chernozhukov & I. Fernández‐Val & C. Hansen, 2017.
"Program Evaluation and Causal Inference With High‐Dimensional Data,"
Econometrica, Econometric Society, vol. 85, pages 233-298, January.
- Alexandre Belloni & Victor Chernozhukov & Ivan Fern'andez-Val & Christian Hansen, 2013. "Program Evaluation and Causal Inference with High-Dimensional Data," Papers 1311.2645, arXiv.org, revised Jan 2018.
- Alexandre Belloni & Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2016. "Program evaluation and causal inference with high-dimensional data," CeMMAP working papers 13/16, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2016. "Program evaluation and causal inference with high-dimensional data," CeMMAP working papers CWP13/16, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Shi, Chengchun & Song, Rui & Lu, Wenbin & Fu, Bo, 2018. "Maximin projection learning for optimal treatment decision with heterogeneous individualized treatment effects," LSE Research Online Documents on Economics 102112, London School of Economics and Political Science, LSE Library.
- Shi, Chengchun & Wang, Xiaoyu & Luo, Shikai & Zhu, Hongtu & Ye, Jieping & Song, Rui, 2022. "Dynamic causal effects evaluation in A/B testing with a reinforcement learning framework," LSE Research Online Documents on Economics 113310, London School of Economics and Political Science, LSE Library.
- Imbens,Guido W. & Rubin,Donald B., 2015. "Causal Inference for Statistics, Social, and Biomedical Sciences," Cambridge Books, Cambridge University Press, number 9780521885881, September.
- Ruoqing Zhu & Ying-Qi Zhao & Guanhua Chen & Shuangge Ma & Hongyu Zhao, 2017. "Greedy outcome weighted tree learning of optimal personalized treatment rules," Biometrics, The International Biometric Society, vol. 73(2), pages 391-400, June.
- Chengchun Shi & Sheng Zhang & Wenbin Lu & Rui Song, 2022. "Statistical inference of the value function for reinforcement learning in infinite‐horizon settings," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(3), pages 765-793, July.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Shi, Chengchun & Luo, Shikai & Le, Yuan & Zhu, Hongtu & Song, Rui, 2022. "Statistically efficient advantage learning for offline reinforcement learning in infinite horizons," LSE Research Online Documents on Economics 115598, London School of Economics and Political Science, LSE Library.
- Zhen Li & Jie Chen & Eric Laber & Fang Liu & Richard Baumgartner, 2023. "Optimal Treatment Regimes: A Review and Empirical Comparison," International Statistical Review, International Statistical Institute, vol. 91(3), pages 427-463, December.
- Yunan Wu & Lan Wang, 2021. "Resampling‐based confidence intervals for model‐free robust inference on optimal treatment regimes," Biometrics, The International Biometric Society, vol. 77(2), pages 465-476, June.
- Gao, Yuhe & Shi, Chengchun & Song, Rui, 2023. "Deep spectral Q-learning with application to mobile health," LSE Research Online Documents on Economics 119445, London School of Economics and Political Science, LSE Library.
- Davide Viviano & Jelena Bradic, 2019. "Synthetic learner: model-free inference on treatments over time," Papers 1904.01490, arXiv.org, revised Aug 2022.
- Michael C Knaus & Michael Lechner & Anthony Strittmatter, 2021.
"Machine learning estimation of heterogeneous causal effects: Empirical Monte Carlo evidence,"
The Econometrics Journal, Royal Economic Society, vol. 24(1), pages 134-161.
- Knaus, Michael C. & Lechner, Michael & Strittmatter, Anthony, 2018. "Machine Learning Estimation of Heterogeneous Causal Effects: Empirical Monte Carlo Evidence," IZA Discussion Papers 12039, Institute of Labor Economics (IZA).
- Lechner, Michael & Knaus, Michael C. & Strittmatter, Anthony, 2018. "Machine Learning Estimation of Heterogeneous Causal Effects: Empirical Monte Carlo Evidence," CEPR Discussion Papers 13402, C.E.P.R. Discussion Papers.
- Knaus, Michael C. & Lechner, Michael & anthony.strittmatter@unisg.ch, 2018. "Machine Learning Estimation of Heterogeneous Causal Effects: Empirical Monte Carlo Evidence," Economics Working Paper Series 1817, University of St. Gallen, School of Economics and Political Science.
- Michael C. Knaus & Michael Lechner & Anthony Strittmatter, 2018. "Machine Learning Estimation of Heterogeneous Causal Effects: Empirical Monte Carlo Evidence," Papers 1810.13237, arXiv.org, revised Dec 2018.
- Xin Qiu & Donglin Zeng & Yuanjia Wang, 2018. "Estimation and evaluation of linear individualized treatment rules to guarantee performance," Biometrics, The International Biometric Society, vol. 74(2), pages 517-528, June.
- Michael C. Knaus & Michael Lechner & Anthony Strittmatter, 2022.
"Heterogeneous Employment Effects of Job Search Programs: A Machine Learning Approach,"
Journal of Human Resources, University of Wisconsin Press, vol. 57(2), pages 597-636.
- Knaus, Michael C. & Lechner, Michael & Strittmatter, Anthony, 2017. "Heterogeneous Employment Effects of Job Search Programmes: A Machine Learning Approach," IZA Discussion Papers 10961, Institute of Labor Economics (IZA).
- Michael Knaus & Michael Lechner & Anthony Strittmatter, 2017. "Heterogeneous Employment Effects of Job Search Programmes: A Machine Learning Approach," Papers 1709.10279, arXiv.org, revised May 2018.
- Lechner, Michael & Strittmatter, Anthony & Knaus, Michael C., 2017. "Heterogeneous Employment Effects of Job Search Programmes: A Machine Learning Approach," CEPR Discussion Papers 12224, C.E.P.R. Discussion Papers.
- Knaus, Michael C. & Lechner, Michael & Strittmatter, Anthony, 2017. "Heterogeneous Employment Effects of Job Search Programmes: A Machine Learning Approach," Economics Working Paper Series 1711, University of St. Gallen, School of Economics and Political Science.
- Davide Viviano & Jelena Bradic, 2021. "Dynamic covariate balancing: estimating treatment effects over time with potential local projections," Papers 2103.01280, arXiv.org, revised Jan 2024.
- Chengchun Shi & Sheng Zhang & Wenbin Lu & Rui Song, 2022. "Statistical inference of the value function for reinforcement learning in infinite‐horizon settings," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(3), pages 765-793, July.
- Ashesh Rambachan & Neil Shephard, 2019. "Econometric analysis of potential outcomes time series: instruments, shocks, linearity and the causal response function," Papers 1903.01637, arXiv.org, revised Feb 2020.
- Li, Ting & Shi, Chengchun & Lu, Zhaohua & Li, Yi & Zhu, Hongtu, 2024. "Evaluating dynamic conditional quantile treatment effects with applications in ridesharing," LSE Research Online Documents on Economics 122488, London School of Economics and Political Science, LSE Library.
- Zhou, Yunzhe & Qi, Zhengling & Shi, Chengchun & Li, Lexin, 2023. "Optimizing pessimism in dynamic treatment regimes: a Bayesian learning approach," LSE Research Online Documents on Economics 118233, London School of Economics and Political Science, LSE Library.
- Kristin A. Linn & Eric B. Laber & Leonard A. Stefanski, 2017. "Interactive -Learning for Quantiles," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 638-649, April.
- Hyung G. Park & Danni Wu & Eva Petkova & Thaddeus Tarpey & R. Todd Ogden, 2023. "Bayesian Index Models for Heterogeneous Treatment Effects on a Binary Outcome," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 15(2), pages 397-418, July.
- Rebecca Hager & Anastasios A. Tsiatis & Marie Davidian, 2018. "Optimal two‐stage dynamic treatment regimes from a classification perspective with censored survival data," Biometrics, The International Biometric Society, vol. 74(4), pages 1180-1192, December.
- Baojiang Chen & Ao Yuan & Jing Qin, 2022. "Pool adjacent violators algorithm–assisted learning with application on estimating optimal individualized treatment regimes," Biometrics, The International Biometric Society, vol. 78(4), pages 1475-1488, December.
- Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018.
"High-dimensional econometrics and regularized GMM,"
CeMMAP working papers
CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-Dimensional Econometrics and Regularized GMM," Papers 1806.01888, arXiv.org, revised Jun 2018.
- Q. Clairon & R. Henderson & N. J. Young & E. D. Wilson & C. J. Taylor, 2021. "Adaptive treatment and robust control," Biometrics, The International Biometric Society, vol. 77(1), pages 223-236, March.
- Ruoqing Zhu & Ying-Qi Zhao & Guanhua Chen & Shuangge Ma & Hongyu Zhao, 2017. "Greedy outcome weighted tree learning of optimal personalized treatment rules," Biometrics, The International Biometric Society, vol. 73(2), pages 391-400, June.
More about this item
Keywords
reinforcement learning; policy evaluation; multiagent system; spatiotemporal studies; DMS-2003637; EP/W014971/1;All these keywords.
JEL classification:
- C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:117174. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.