Robust Reinforcement Learning with Dynamic Distortion Risk Measures

My bibliography Save this paper

Robust Reinforcement Learning with Dynamic Distortion Risk Measures

Author

Listed:

Anthony Coache
Sebastian Jaimungal

Registered:

Abstract

In a reinforcement learning (RL) setting, the agent's optimal strategy heavily depends on her risk preferences and the underlying model dynamics of the training environment. These two aspects influence the agent's ability to make well-informed and time-consistent decisions when facing testing environments. In this work, we devise a framework to solve robust risk-aware RL problems where we simultaneously account for environmental uncertainty and risk with a class of dynamic robust distortion risk measures. Robustness is introduced by considering all models within a Wasserstein ball around a reference model. We estimate such dynamic robust risk measures using neural networks by making use of strictly consistent scoring functions, derive policy gradient formulae using the quantile representation of distortion risk measures, and construct an actor-critic algorithm to solve this class of robust risk-aware RL problems. We demonstrate the performance of our algorithm on a portfolio allocation example.

Suggested Citation

Anthony Coache & Sebastian Jaimungal, 2024. "Robust Reinforcement Learning with Dynamic Distortion Risk Measures," Papers 2409.10096, arXiv.org, revised Apr 2025.

Handle: RePEc:arx:papers:2409.10096

Download full text from publisher

References listed on IDEAS

Paul Milgrom & Ilya Segal, 2002. "Envelope Theorems for Arbitrary Choice Sets," Econometrica, Econometric Society, vol. 70(2), pages 583-601, March.
Gneiting, Tilmann, 2011. "Making and Evaluating Point Forecasts," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 746-762.
Saeed Marzban & Erick Delage & Jonathan Yu-Meng Li, 2023. "Deep reinforcement learning for option pricing and hedging under dynamic expectile risk measures," Quantitative Finance, Taylor & Francis Journals, vol. 23(10), pages 1411-1430, October.
Silvana M. Pesenti & Sebastian Jaimungal & Yuri F. Saporito & Rodrigo S. Targino, 2023. "Risk Budgeting Allocation for Dynamic Risk Measures," Papers 2305.11319, arXiv.org, revised Oct 2024.
Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
Carole Bernard & Silvana M. Pesenti & Steven Vanduffel, 2024. "Robust distortion risk measures," Mathematical Finance, Wiley Blackwell, vol. 34(3), pages 774-818, July.
- Carole Bernard & Silvana M. Pesenti & Steven Vanduffel, 2022. "Robust Distortion Risk Measures," Papers 2205.08850, arXiv.org, revised Mar 2023.
Yuhong Xu, 2014. "Robust valuation and risk measurement under model uncertainty," Papers 1407.8024, arXiv.org.
David Wu & Sebastian Jaimungal, 2023. "Robust Risk-Aware Option Hedging," Papers 2303.15216, arXiv.org, revised Dec 2023.
David Wu & Sebastian Jaimungal, 2023. "Robust Risk-Aware Option Hedging," Applied Mathematical Finance, Taylor & Francis Journals, vol. 30(3), pages 153-174, May.
Jose Blanchet & Karthyek Murthy, 2019. "Quantifying Distributional Model Risk via Optimal Transport," Mathematics of Operations Research, INFORMS, vol. 44(2), pages 565-600, May.
Paul Glasserman & Xingbo Xu, 2014. "Robust risk measurement and model risk," Quantitative Finance, Taylor & Francis Journals, vol. 14(1), pages 29-58, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Xianhua Peng & Xiang Zhou & Bo Xiao & Yi Wu, 2024. "A Risk Sensitive Contract-unified Reinforcement Learning Approach for Option Hedging," Papers 2411.09659, arXiv.org.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Fritzsch, Simon & Timphus, Maike & Weiß, Gregor, 2024. "Marginals versus copulas: Which account for more model risk in multivariate risk forecasting?," Journal of Banking & Finance, Elsevier, vol. 158(C).
Pascal Franc{c}ois & Genevi`eve Gauthier & Fr'ed'eric Godin & Carlos Octavio P'erez Mendoza, 2024. "Enhancing Deep Hedging of Options with Implied Volatility Surface Feedback Information," Papers 2407.21138, arXiv.org.
Steven Kou & Xianhua Peng, 2016. "On the Measurement of Economic Tail Risk," Operations Research, INFORMS, vol. 64(5), pages 1056-1072, October.
Kim, Sojung & Weber, Stefan, 2022. "Simulation methods for robust risk assessment and the distorted mix approach," European Journal of Operational Research, Elsevier, vol. 298(1), pages 380-398.
Mohammed Berkhouch & Fernanda Maria Müller & Ghizlane Lakhnati & Marcelo Brutti Righi, 2022. "Deviation-Based Model Risk Measures," Computational Economics, Springer;Society for Computational Economics, vol. 59(2), pages 527-547, February.
Frongillo, Rafael M. & Kash, Ian A., 2021. "General truthfulness characterizations via convex analysis," Games and Economic Behavior, Elsevier, vol. 130(C), pages 636-662.
Carole Bernard & Silvana M. Pesenti & Steven Vanduffel, 2024. "Robust distortion risk measures," Mathematical Finance, Wiley Blackwell, vol. 34(3), pages 774-818, July.
- Carole Bernard & Silvana M. Pesenti & Steven Vanduffel, 2022. "Robust Distortion Risk Measures," Papers 2205.08850, arXiv.org, revised Mar 2023.
Lux, Thibaut & Papapantoleon, Antonis, 2019. "Model-free bounds on Value-at-Risk using extreme value information and statistical distances," Insurance: Mathematics and Economics, Elsevier, vol. 86(C), pages 73-83.
Xianhua Peng & Xiang Zhou & Bo Xiao & Yi Wu, 2024. "A Risk Sensitive Contract-unified Reinforcement Learning Approach for Option Hedging," Papers 2411.09659, arXiv.org.
Aleksandrina Goeva & Henry Lam & Huajie Qian & Bo Zhang, 2019. "Optimization-Based Calibration of Simulation Input Models," Operations Research, INFORMS, vol. 67(5), pages 1362-1382, September.
Pascal Franc{c}ois & Genevi`eve Gauthier & Fr'ed'eric Godin & Carlos Octavio P'erez Mendoza, 2024. "Is the difference between deep hedging and delta hedging a statistical arbitrage?," Papers 2407.14736, arXiv.org, revised Oct 2024.
Parisa Davar & Fr'ed'eric Godin & Jose Garrido, 2024. "Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients," Papers 2406.15612, arXiv.org, revised Jun 2024.
Tobias Fissler & Yannick Hoga, 2024. "How to Compare Copula Forecasts?," Papers 2410.04165, arXiv.org.
Makam, Vaishno Devi & Millossovich, Pietro & Tsanakas, Andreas, 2021. "Sensitivity analysis with χ2-divergences," Insurance: Mathematics and Economics, Elsevier, vol. 100(C), pages 372-383.
Rafael Frongillo, 2022. "Quantum Information Elicitation," Papers 2203.07469, arXiv.org.
Thibaut Lux & Antonis Papapantoleon, 2016. "Model-free bounds on Value-at-Risk using extreme value information and statistical distances," Papers 1610.09734, arXiv.org, revised Nov 2018.
Lahiri, Kajal & Yang, Liu, 2013. "Forecasting Binary Outcomes," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1025-1106, Elsevier.
- Kajal Lahiri & Liu Yang, 2012. "Forecasting Binary Outcomes," Discussion Papers 12-09, University at Albany, SUNY, Department of Economics.
Tobias Fissler & Silvana M. Pesenti, 2022. "Sensitivity Measures Based on Scoring Functions," Papers 2203.00460, arXiv.org, revised Jul 2022.
Knüppel, Malte & Schultefrankenfeld, Guido, 2019. "Assessing the uncertainty in central banks’ inflation outlooks," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1748-1769.
- Knüppel, Malte & Schultefrankenfeld, Guido, 2018. "Assessing the uncertainty in central banks' inflation outlooks," Discussion Papers 56/2018, Deutsche Bundesbank.
Mingbin Ben Feng & Eunhye Song, 2020. "Efficient Nested Simulation Experiment Design via the Likelihood Ratio Method," Papers 2008.13087, arXiv.org, revised May 2024.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2024-10-21 (Big Data)
NEP-CMP-2024-10-21 (Computational Economics)
NEP-RMG-2024-10-21 (Risk Management)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2409.10096. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Robust Reinforcement Learning with Dynamic Distortion Risk Measures

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data