Deep learning for high-dimensional continuous-time stochastic optimal control without explicit solution

My bibliography Save this paper

Deep learning for high-dimensional continuous-time stochastic optimal control without explicit solution

Author

Listed:

Dupret, Jean-Loup
(Université catholique de Louvain, LIDAM/ISBA, Belgium)
Hainaut, Donatien
(Université catholique de Louvain, LIDAM/ISBA, Belgium)

Registered:

Abstract

This paper introduces the GPI-PINN algorithm, a novel numerical scheme for solving continuous-time stochastic optimal control problems in high dimensions when the optimal control does not admit an explicit solution. Combining Physics-Informed Neural Networks with an Actor-Critic structure built upon the Generalized Policy Iteration technique, this successive deep learning algorithm employs two separate neural networks to approximate both the value function and the multidimensional optimal control. This way, the GPI-PINN algorithm manages to achieve a global approximation of the optimal solution across all time and space, which can be evaluated online rapidly. The optimality and convergence of the scheme are demonstrated theoretically and its accuracy and efficacy are shown empirically based on two numerical examples. In particular, we generalize the standard Almgren-Chriss model arising from optimal liquidation in finance by allowing for a price impact model with fully nonlinear temporary and permanent impact functions and by considering a multidimensional setting with numerous co-integrated assets.

Suggested Citation

Dupret, Jean-Loup & Hainaut, Donatien, 2024. "Deep learning for high-dimensional continuous-time stochastic optimal control without explicit solution," LIDAM Discussion Papers ISBA 2024016, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).

Handle: RePEc:aiz:louvad:2024016

Download full text from publisher

References listed on IDEAS

Jim Gatheral, 2010. "No-dynamic-arbitrage and market impact," Quantitative Finance, Taylor & Francis Journals, vol. 10(7), pages 749-759.
Glau, Kathrin & Wunderlich, Linus, 2022. "The deep parametric PDE method and applications to option pricing," Applied Mathematics and Computation, Elsevier, vol. 432(C).
Robert Almgren, 2003. "Optimal execution with nonlinear impact functions and trading-enhanced risk," Applied Mathematical Finance, Taylor & Francis Journals, vol. 10(1), pages 1-18.
Justin Sirignano & Konstantinos Spiliopoulos, 2017. "DGM: A deep learning algorithm for solving partial differential equations," Papers 1708.07469, arXiv.org, revised Sep 2018.
Hainaut, Donatien, 2023. "Valuation of guaranteed minimum accumulation benefits (GMAB) with physics inspired neural networks," LIDAM Discussion Papers ISBA 2023029, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Gianbiagio Curato & Jim Gatheral & Fabrizio Lillo, 2014. "Optimal execution with nonlinear transient market impact," Papers 1412.4839, arXiv.org.
Olivier Guéant & Charles-Albert Lehalle, 2015. "General Intensity Shapes In Optimal Liquidation," Mathematical Finance, Wiley Blackwell, vol. 25(3), pages 457-495, July.
- Olivier Gu'eant & Charles-Albert Lehalle, 2012. "General Intensity Shapes in Optimal Liquidation," Papers 1204.0148, arXiv.org, revised Jun 2013.
Fengpei Li & Vitalii Ihnatiuk & Ryan Kinnear & Anderson Schneider & Yuriy Nevmyvaka, 2022. "Do price trajectory data increase the efficiency of market impact estimation?," Papers 2205.13423, arXiv.org, revised Mar 2023.
J. Doyne Farmer & Austin Gerig & Fabrizio Lillo & Henri Waelbroeck, 2013. "How efficiency shapes market impact," Quantitative Finance, Taylor & Francis Journals, vol. 13(11), pages 1743-1758, November.
- J. Doyne Farmer & Austin Gerig & Fabrizio Lillo & Henri Waelbroeck, 2011. "How efficiency shapes market impact," Papers 1102.5457, arXiv.org, revised Sep 2013.
Seungki Min & Costis Maglaras & Ciamac C. Moallemi, 2018. "Cross-Sectional Variation of Intraday Liquidity, Cross-Impact, and their Effect on Portfolio Execution," Papers 1811.05524, arXiv.org.
Olivier Guéant, 2016. "The Financial Mathematics of Market Liquidity: From Optimal Execution to Market Making," Post-Print hal-01393136, HAL.
Olivier Gu'eant, 2013. "Permanent market impact can be nonlinear," Papers 1305.0413, arXiv.org, revised Mar 2014.
Antje Fruth & Torsten Schöneborn & Mikhail Urusov, 2014. "Optimal Trade Execution And Price Manipulation In Order Books With Time-Varying Liquidity," Mathematical Finance, Wiley Blackwell, vol. 24(4), pages 651-695, October.
- Antje Fruth & Torsten Schoeneborn & Mikhail Urusov, 2011. "Optimal trade execution and price manipulation in order books with time-varying liquidity," Papers 1109.2631, arXiv.org.
Eyal Neuman & Alexander Schied & Chengguo Weng & Xiaole Xue, 2020. "A central bank strategy for defending a currency peg," Papers 2008.00470, arXiv.org.
Hyeong-Ohk Bae & Seunggu Kang & Muhyun Lee, 2024. "Option Pricing and Local Volatility Surface by Physics-Informed Neural Network," Computational Economics, Springer;Society for Computational Economics, vol. 64(5), pages 3143-3159, November.
Nikolay A. Andreev, 2015. "Worst-Case Approach To Strategic Optimal Portfolio Selection Under Transaction Costs And Trading Limits," HSE Working papers WP BRP 45/FE/2015, National Research University Higher School of Economics.
Charles-Albert Lehalle & Charafeddine Mouzouni, 2019. "A Mean Field Game of Portfolio Trading and Its Consequences On Perceived Correlations," Papers 1902.09606, arXiv.org.
- Charles-Albert Lehalle & Charafeddine Mouzouni, 2019. "A mean field game of portfolio trading and its consequences on perceived correlations," Working Papers hal-02003143, HAL.
M. Schneider & F. Lillo, 2019. "Cross-impact and no-dynamic-arbitrage," Quantitative Finance, Taylor & Francis Journals, vol. 19(1), pages 137-154, January.
- Michael Schneider & Fabrizio Lillo, 2016. "Cross-impact and no-dynamic-arbitrage," Papers 1612.07742, arXiv.org, revised Aug 2017.
Sadoghi, Amirhossein & Vecer, Jan, 2022. "Optimal liquidation problem in illiquid markets," European Journal of Operational Research, Elsevier, vol. 296(3), pages 1050-1066.
Amirhossein Sadoghi & Jan Vecer, 2022. "Optimal liquidation problem in illiquid markets," Post-Print hal-03696768, HAL.
Donatien Hainaut & Alex Casas, 2024. "Option pricing in the Heston model with physics inspired neural networks," Annals of Finance, Springer, vol. 20(3), pages 353-376, September.
Alexander Schied & Elias Strehle, 2017. "On the minimizers of energy forms with completely monotone kernel," Papers 1706.04844, arXiv.org, revised Aug 2018.
Arne Lokka & Junwei Xu, 2020. "Optimal liquidation trajectories for the Almgren-Chriss model with Levy processes," Papers 2002.03376, arXiv.org, revised Sep 2020.
Olivier Gu'eant & Jean-Michel Lasry & Jiang Pu, 2014. "A convex duality method for optimal liquidation with participation constraints," Papers 1407.4614, arXiv.org, revised Dec 2014.
Philippe Bergault & Fayc{c}al Drissi & Olivier Gu'eant, 2021. "Multi-asset optimal execution and statistical arbitrage strategies under Ornstein-Uhlenbeck dynamics," Papers 2103.13773, arXiv.org, revised Mar 2022.

More about this item

Keywords

Machine learning ; Stochastic optimal control ; Deep learning ; Physics-Informed Neural Networks ; Optimal liquidation;
All these keywords.

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2024-11-11 (Big Data)
NEP-CMP-2024-11-11 (Computational Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:aiz:louvad:2024016. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Nadja Peiffer (email available below). General contact details of provider: https://edirc.repec.org/data/isuclbe.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Deep learning for high-dimensional continuous-time stochastic optimal control without explicit solution

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data