Value enhancement of reinforcement learning via efficient and robust trust region optimization
Author
Abstract
Suggested Citation
Download full text from publisher
References listed on IDEAS
- Linbo Wang & Eric Tchetgen Tchetgen, 2018. "Bounded, efficient and multiply robust estimation of average treatment effects using instrumental variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(3), pages 531-550, June.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Mao, Lu, 2022. "Identification of the outcome distribution and sensitivity analysis under weak confounder–instrument interaction," Statistics & Probability Letters, Elsevier, vol. 189(C).
- Yan Liu, 2022. "Policy Learning under Endogeneity Using Instrumental Variables," Papers 2206.09883, arXiv.org, revised Mar 2024.
- Benjamin R. Baer & Robert L. Strawderman & Ashkan Ertefaie, 2023. "Discussion on “Instrumental variable estimation of the causal hazard ratio,” by Linbo Wang, Eric Tchetgen Tchetgen, Torben Martinussen, and Stijn Vansteelandt," Biometrics, The International Biometric Society, vol. 79(2), pages 554-558, June.
- Ting Ye & Ashkan Ertefaie & James Flory & Sean Hennessy & Dylan S. Small, 2023. "Instrumented difference‐in‐differences," Biometrics, The International Biometric Society, vol. 79(2), pages 569-581, June.
- Dingke Tang & Dehan Kong & Wenliang Pan & Linbo Wang, 2023. "Ultra‐high dimensional variable selection for doubly robust causal inference," Biometrics, The International Biometric Society, vol. 79(2), pages 903-914, June.
- Shaojie Wei & Chao Zhang & Zhi Geng & Shanshan Luo, 2024. "Identifiability and Estimation for Potential-Outcome Means with Misclassified Outcomes," Mathematics, MDPI, vol. 12(18), pages 1-19, September.
- Abhinandan Dalal & Patrick Blobaum & Shiva Kasiviswanathan & Aaditya Ramdas, 2024. "Anytime-Valid Inference for Double/Debiased Machine Learning of Causal Parameters," Papers 2408.09598, arXiv.org, revised Sep 2024.
- Linbo Wang & Eric Tchetgen Tchetgen & Torben Martinussen & Stijn Vansteelandt, 2023. "Instrumental variable estimation of the causal hazard ratio," Biometrics, The International Biometric Society, vol. 79(2), pages 539-550, June.
- Shixiao Zhang & Peisong Han & Changbao Wu, 2023. "Calibration Techniques Encompassing Survey Sampling, Missing Data Analysis and Causal Inference," International Statistical Review, International Statistical Institute, vol. 91(2), pages 165-192, August.
- Hongming Pu & Bo Zhang, 2021. "Estimating optimal treatment rules with an instrumental variable: A partial identification learning approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(2), pages 318-345, April.
- Martin Emil Jakobsen & Jonas Peters, 2020. "Distributional robustness of K-class estimators and the PULSE," Papers 2005.03353, arXiv.org, revised Mar 2022.
- Choi, Jin-young & Lee, Goeun & Lee, Myoung-jae, 2023. "Endogenous treatment effect for any response conditional on control propensity score," Statistics & Probability Letters, Elsevier, vol. 196(C).
- Cui, Yifan & Tchetgen Tchetgen, Eric, 2021. "On a necessary and sufficient identification condition of optimal treatment regimes with an instrumental variable," Statistics & Probability Letters, Elsevier, vol. 178(C).
- Linbo Wang & Eric Tchetgen Tchetgen & Torben Martinussen & Stijn Vansteelandt, 2023. "Rejoinder to discussions on “Instrumental variable estimation of the causal hazard ratio”," Biometrics, The International Biometric Society, vol. 79(2), pages 564-568, June.
- Zhichao Jiang & Shu Yang & Peng Ding, 2022. "Multiply robust estimation of causal effects under principal ignorability," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(4), pages 1423-1445, September.
- Myoung‐jae Lee, 2021. "Instrument residual estimator for any response variable with endogenous binary treatment," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(3), pages 612-635, July.
- Shuxiao Chen & Bo Zhang, 2021. "Estimating and Improving Dynamic Treatment Regimes With a Time-Varying Instrumental Variable," Papers 2104.07822, arXiv.org.
- Haoyu Wei & Hengrui Cai & Chengchun Shi & Rui Song, 2024. "On Efficient Inference of Causal Effects with Multiple Mediators," Papers 2401.05517, arXiv.org.
More about this item
Keywords
mobile health studies; offline reinforcement learning; semi-parametric efficiency; trust region optimization;All these keywords.
JEL classification:
- C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General
NEP fields
This paper has been announced in the following NEP Reports:- NEP-BIG-2024-08-26 (Big Data)
- NEP-CMP-2024-08-26 (Computational Economics)
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:122756. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.