Markov Decision Processes under Model Uncertainty

My bibliography Save this paper

Markov Decision Processes under Model Uncertainty

Author

Listed:

Ariel Neufeld
Julian Sester
Mario v{S}iki'c

Registered:

Abstract

We introduce a general framework for Markov decision problems under model uncertainty in a discrete-time infinite horizon setting. By providing a dynamic programming principle we obtain a local-to-global paradigm, namely solving a local, i.e., a one time-step robust optimization problem leads to an optimizer of the global (i.e. infinite time-steps) robust stochastic optimal control problem, as well as to a corresponding worst-case measure. Moreover, we apply this framework to portfolio optimization involving data of the S&P 500. We present two different types of ambiguity sets; one is fully data-driven given by a Wasserstein-ball around the empirical measure, the second one is described by a parametric set of multivariate normal distributions, where the corresponding uncertainty sets of the parameters are estimated from the data. It turns out that in scenarios where the market is volatile or bearish, the optimal portfolio strategies from the corresponding robust optimization problem outperforms the ones without model uncertainty, showcasing the importance of taking model uncertainty into account.

Suggested Citation

Ariel Neufeld & Julian Sester & Mario v{S}iki'c, 2022. "Markov Decision Processes under Model Uncertainty," Papers 2206.06109, arXiv.org, revised Jan 2023.

Handle: RePEc:arx:papers:2206.06109

Download full text from publisher

References listed on IDEAS

Srisuma, Sorawoot & Linton, Oliver, 2012. "Semiparametric estimation of Markov decision processes with continuous state space," Journal of Econometrics, Elsevier, vol. 166(2), pages 320-341.
- Oliver Linton & Sorawoot Srisuma, 2010. "Semiparametric Estimation of Markov Decision Processeswith Continuous State Space," STICERD - Econometrics Paper Series 550, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
- Linton, Oliver & Srisuma, Sorawoot, 2010. "Semiparametric estimation of Markov decision processeswith continuous state space," LSE Research Online Documents on Economics 58187, London School of Economics and Political Science, LSE Library.
Victor Aguirregabiria & Pedro Mira, 2002. "Swapping the Nested Fixed Point Algorithm: A Class of Estimators for Discrete Markov Decision Models," Econometrica, Econometric Society, vol. 70(4), pages 1519-1543, July.
- Victor Aguirregabiria & Pedro Mira, 1999. "Swapping the Nested Fixed-Point Algorithm: a Class of Estimators for Discrete Markov Decision Models," Computing in Economics and Finance 1999 332, Society for Computational Economics.
- Víctor Aguirregabiria & Pedro Mira, 1999. "Swapping the Nested Fixed Point Algorithm: A Class of Estimators for Discrete Markov Decision Models," Working Papers wp1999_9904, CEMFI.
Nicole Bäuerle & Ulrich Rieder, 2009. "MDP algorithms for portfolio optimization problems in pure jump markets," Finance and Stochastics, Springer, vol. 13(4), pages 591-611, September.
Ariel Neufeld & Julian Sester & Daiying Yin, 2022. "Detecting data-driven robust statistical arbitrage strategies with deep neural networks," Papers 2203.03179, arXiv.org, revised Feb 2024.
Francesco Bertoluzzo & Marco Corazza, 2012. "Reinforcement Learning for automatic financial trading: Introduction and some applications," Working Papers 2012:33, Department of Economics, University of Venice "Ca' Foscari", revised 2012.
Huan Xu & Shie Mannor, 2012. "Distributionally Robust Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 37(2), pages 288-300, May.
Stephen Boyd & Enzo Busseti & Steven Diamond & Ronald N. Kahn & Kwangmoo Koh & Peter Nystrup & Jan Speth, 2017. "Multi-Period Trading via Convex Optimization," Papers 1705.00109, arXiv.org.
Charalambos D. Aliprantis & Kim C. Border, 2006. "Infinite Dimensional Analysis," Springer Books, Springer, edition 0, number 978-3-540-29587-7, March.
Angelos Filos, 2019. "Reinforcement Learning for Portfolio Management," Papers 1909.09571, arXiv.org.
Jay Cao & Jacky Chen & John Hull & Zissis Poulos, 2021. "Deep Hedging of Derivatives Using Reinforcement Learning," Papers 2103.16409, arXiv.org.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Ariel Neufeld & Matthew Ng Cheng En & Ying Zhang, 2024. "Robust SGLD algorithm for solving non-convex distributionally robust optimisation problems," Papers 2403.09532, arXiv.org, revised Mar 2025.
Ariel Neufeld & Julian Sester, 2024. "Non-concave distributionally robust stochastic control in a discrete time finite horizon setting," Papers 2404.05230, arXiv.org.
Marlon Moresco & M'elina Mailhot & Silvana M. Pesenti, 2023. "Uncertainty Propagation and Dynamic Robust Risk Measures," Papers 2308.12856, arXiv.org, revised Feb 2024.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Ariel Neufeld & Julian Sester & Mario Šikić, 2023. "Markov decision processes under model uncertainty," Mathematical Finance, Wiley Blackwell, vol. 33(3), pages 618-665, July.
Campi, Luciano & Zabaljauregui, Diego, 2020. "Optimal market making under partial information with general intensities," LSE Research Online Documents on Economics 104612, London School of Economics and Political Science, LSE Library.
Hiroyuki Kasahara & Katsumi Shimotsu, 2018. "Estimation of Discrete Choice Dynamic Programming Models," The Japanese Economic Review, Japanese Economic Association, vol. 69(1), pages 28-58, March.
- Hiroyuki Kasahara & Katsumi Shimotsu, 2018. "Estimation of Discrete Choice Dynamic Programming Models," The Japanese Economic Review, Springer, vol. 69(1), pages 28-58, March.
Jay Lu & Yao Luo & Kota Saito & Yi Xin, 2024. "Did Harold Zuercher Have Time-Separable Preferences?," Papers 2406.07809, arXiv.org.
Victor Aguirregabiria & Allan Collard-Wexler & Stephen P. Ryan, 2021. "Dynamic Games in Empirical Industrial Organization," NBER Working Papers 29291, National Bureau of Economic Research, Inc.
- Victor Aguirregabiria & Allan Collard-Wexler & Stephen P. Ryan, 2021. "Dynamic Games in Empirical Industrial Organization," Papers 2109.01725, arXiv.org, revised Sep 2021.
- Aguirregabiria, Victor & Collard-Wexler, Allan & Ryan, Stephen, 2021. "Dynamic Games in Empirical Industrial Organization," CEPR Discussion Papers 16514, C.E.P.R. Discussion Papers.
- Victor Aguirregabiria & Allan Collard-Wexler & Stephen P. Ryan, 2021. "Dynamic Games in Empirical Industrial Organization," Working Papers tecipa-706, University of Toronto, Department of Economics.
Jason R. Blevins & Wei Shi & Donald R. Haurin & Stephanie Moulton, 2020. "A Dynamic Discrete Choice Model Of Reverse Mortgage Borrower Behavior," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 61(4), pages 1437-1477, November.
Taisuke Otsu & Martin Pesendorfer, 2021. "Equilibrium multiplicity in dynamic games: testing and estimation," STICERD - Econometrics Paper Series 618, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
- Otsu, Taisuke & Pesendorfer, Martin, 2023. "Equilibrium multiplicity in dynamic games: testing and estimation," LSE Research Online Documents on Economics 113588, London School of Economics and Political Science, LSE Library.
repec:spo:wpmain:info:hdl:2441/7svo6civd6959qvmn4965cth1d is not listed on IDEAS
Khai Xiang Chiong & Alfred Galichon & Matt Shum, 2021. "Duality in dynamic discrete-choice models," Papers 2102.06076, arXiv.org, revised Feb 2021.
Ruan Pretorius & Terence van Zyl, 2022. "Deep Reinforcement Learning and Convex Mean-Variance Optimisation for Portfolio Management," Papers 2203.11318, arXiv.org.
Fabio A. Miessi Sanches & Daniel Silva Junior, Sorawoot Srisuma, 2014. "Ordinary Least Squares Estimation for a Dynamic Game," Working Papers, Department of Economics 2014_19, University of São Paulo (FEA-USP), revised 23 Feb 2015.
Khai Chiong & Alfred Galichon & Matt Shum, 2015. "Duality in Dynamic Discrete Choice Models," Post-Print hal-03568184, HAL.
Diego Zabaljauregui & Luciano Campi, 2019. "Optimal market making under partial information with general intensities," Papers 1902.01157, arXiv.org, revised Apr 2020.
Otero, Karina V., 2016. "Nonparametric identification of dynamic multinomial choice games: unknown payoffs and shocks without interchangeability," MPRA Paper 86784, University Library of Munich, Germany.
Taisuke Otsu & Martin Pesendorfer, 2023. "Equilibrium multiplicity in dynamic games: Testing and estimation," The Econometrics Journal, Royal Economic Society, vol. 26(1), pages 26-42.
repec:hal:spmain:info:hdl:2441/7svo6civd6959qvmn4965cth1d is not listed on IDEAS
Diego Zabaljauregui, 2020. "Optimal market making under partial information and numerical methods for impulse control games with applications," Papers 2009.06521, arXiv.org.
Kaido, Hiroaki, 2017. "Asymptotically Efficient Estimation Of Weighted Average Derivatives With An Interval Censored Variable," Econometric Theory, Cambridge University Press, vol. 33(5), pages 1218-1241, October.
- Hiroaki Kaido, 2013. "Asymptotically Efficient Estimation of Weighted Average Derivatives with an Inverval Censored Variable," Boston University - Department of Economics - Working Papers Series 2013-022, Boston University - Department of Economics.
- Hiroaki Kaido, 2014. "Asymptotically efficient estimation of weighted average derivatives with an interval censored variable," CeMMAP working papers 03/14, Institute for Fiscal Studies.
- Hiroaki Kaido, 2014. "Asymptotically efficient estimation of weighted average derivatives with an interval censored variable," CeMMAP working papers CWP03/14, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Victor Aguirregabiria, 2006. "Another Look at the Identification of Dynamic Discrete Decision Processes: With an Application to Retirement Behavior," 2006 Meeting Papers 169, Society for Economic Dynamics.
- Victor Aguirregabiria, 2007. "Another Look at the Identification of Dynamic Discrete Decision Processes: With an Application to Retirement Behavior," Working Papers tecipa-282, University of Toronto, Department of Economics.
Andrea Attar & Thomas Mariotti & François Salanié, 2021. "Entry-Proofness and Discriminatory Pricing under Adverse Selection," American Economic Review, American Economic Association, vol. 111(8), pages 2623-2659, August.
- Attar, Andrea & Mariotti, Thomas & Salanié, François, 2017. "Entry-Proofness and Discriminatory Pricing under Adverse Selection," TSE Working Papers 17-788, Toulouse School of Economics (TSE), revised Jan 2021.
- Andrea Attar & Thomas Mariotti & François Salanié, 2021. "Entry-proofness and discriminatory pricing under adverse selection," Post-Print hal-03353054, HAL.
- Andrea Attar & Thomas Mariotti & François Salanié, 2021. "Entry-proofness and discriminatory pricing under adverse selection," Working Papers hal-03485384, HAL.
Maria Casanova-Rivas, 2008. "Dynamic Complementarities: A Computational and Empirical Analysis of Couples' Retirement Decisions," 2008 Meeting Papers 1073, Society for Economic Dynamics.
Askoura, Youcef & Billot, Antoine, 2021. "Social decision for a measure society," Journal of Mathematical Economics, Elsevier, vol. 94(C).
- Youcef Askoura & Antoine Billot, 2021. "Social decision for a measure society," Post-Print hal-04120433, HAL.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2206.06109. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Markov Decision Processes under Model Uncertainty

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data