IDEAS home Printed from https://ideas.repec.org/p/aiz/louvad/2024016.html
   My bibliography  Save this paper

Deep learning for high-dimensional continuous-time stochastic optimal control without explicit solution

Author

Listed:
  • Dupret, Jean-Loup

    (Université catholique de Louvain, LIDAM/ISBA, Belgium)

  • Hainaut, Donatien

    (Université catholique de Louvain, LIDAM/ISBA, Belgium)

Abstract

This paper introduces the GPI-PINN algorithm, a novel numerical scheme for solving continuous-time stochastic optimal control problems in high dimensions when the optimal control does not admit an explicit solution. Combining Physics-Informed Neural Networks with an Actor-Critic structure built upon the Generalized Policy Iteration technique, this successive deep learning algorithm employs two separate neural networks to approximate both the value function and the multidimensional optimal control. This way, the GPI-PINN algorithm manages to achieve a global approximation of the optimal solution across all time and space, which can be evaluated online rapidly. The optimality and convergence of the scheme are demonstrated theoretically and its accuracy and efficacy are shown empirically based on two numerical examples. In particular, we generalize the standard Almgren-Chriss model arising from optimal liquidation in finance by allowing for a price impact model with fully nonlinear temporary and permanent impact functions and by considering a multidimensional setting with numerous co-integrated assets.

Suggested Citation

  • Dupret, Jean-Loup & Hainaut, Donatien, 2024. "Deep learning for high-dimensional continuous-time stochastic optimal control without explicit solution," LIDAM Discussion Papers ISBA 2024016, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
  • Handle: RePEc:aiz:louvad:2024016
    as

    Download full text from publisher

    File URL: https://dial.uclouvain.be/pr/boreal/en/object/boreal%3A287848/datastream/PDF_01/view
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Glau, Kathrin & Wunderlich, Linus, 2022. "The deep parametric PDE method and applications to option pricing," Applied Mathematics and Computation, Elsevier, vol. 432(C).
    2. Robert Almgren, 2003. "Optimal execution with nonlinear impact functions and trading-enhanced risk," Applied Mathematical Finance, Taylor & Francis Journals, vol. 10(1), pages 1-18.
    3. Justin Sirignano & Konstantinos Spiliopoulos, 2017. "DGM: A deep learning algorithm for solving partial differential equations," Papers 1708.07469, arXiv.org, revised Sep 2018.
    4. Jim Gatheral, 2010. "No-dynamic-arbitrage and market impact," Quantitative Finance, Taylor & Francis Journals, vol. 10(7), pages 749-759.
    5. Hainaut, Donatien, 2023. "Valuation of guaranteed minimum accumulation benefits (GMAB) with physics inspired neural networks," LIDAM Discussion Papers ISBA 2023029, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gianbiagio Curato & Jim Gatheral & Fabrizio Lillo, 2014. "Optimal execution with nonlinear transient market impact," Papers 1412.4839, arXiv.org.
    2. Olivier Guéant & Charles-Albert Lehalle, 2015. "General Intensity Shapes In Optimal Liquidation," Mathematical Finance, Wiley Blackwell, vol. 25(3), pages 457-495, July.
    3. Fengpei Li & Vitalii Ihnatiuk & Ryan Kinnear & Anderson Schneider & Yuriy Nevmyvaka, 2022. "Do price trajectory data increase the efficiency of market impact estimation?," Papers 2205.13423, arXiv.org, revised Mar 2023.
    4. Charles-Albert Lehalle & Charafeddine Mouzouni, 2019. "A mean field game of portfolio trading and its consequences on perceived correlations," Working Papers hal-02003143, HAL.
    5. M. Schneider & F. Lillo, 2019. "Cross-impact and no-dynamic-arbitrage," Quantitative Finance, Taylor & Francis Journals, vol. 19(1), pages 137-154, January.
    6. Sadoghi, Amirhossein & Vecer, Jan, 2022. "Optimal liquidation problem in illiquid markets," European Journal of Operational Research, Elsevier, vol. 296(3), pages 1050-1066.
    7. Amirhossein Sadoghi & Jan Vecer, 2022. "Optimal liquidation problem in illiquid markets," Post-Print hal-03696768, HAL.
    8. J. Doyne Farmer & Austin Gerig & Fabrizio Lillo & Henri Waelbroeck, 2013. "How efficiency shapes market impact," Quantitative Finance, Taylor & Francis Journals, vol. 13(11), pages 1743-1758, November.
    9. Donatien Hainaut & Alex Casas, 2024. "Option pricing in the Heston model with physics inspired neural networks," Annals of Finance, Springer, vol. 20(3), pages 353-376, September.
    10. Antje Fruth & Torsten Schöneborn & Mikhail Urusov, 2014. "Optimal Trade Execution And Price Manipulation In Order Books With Time-Varying Liquidity," Mathematical Finance, Wiley Blackwell, vol. 24(4), pages 651-695, October.
    11. Seungki Min & Costis Maglaras & Ciamac C. Moallemi, 2018. "Cross-Sectional Variation of Intraday Liquidity, Cross-Impact, and their Effect on Portfolio Execution," Papers 1811.05524, arXiv.org.
    12. Olivier Guéant, 2016. "The Financial Mathematics of Market Liquidity: From Optimal Execution to Market Making," Post-Print hal-01393136, HAL.
    13. Olivier Gu'eant, 2013. "Permanent market impact can be nonlinear," Papers 1305.0413, arXiv.org, revised Mar 2014.
    14. Alexander Schied & Elias Strehle, 2017. "On the minimizers of energy forms with completely monotone kernel," Papers 1706.04844, arXiv.org, revised Aug 2018.
    15. Arne Lokka & Junwei Xu, 2020. "Optimal liquidation trajectories for the Almgren-Chriss model with Levy processes," Papers 2002.03376, arXiv.org, revised Sep 2020.
    16. Olivier Gu'eant & Jean-Michel Lasry & Jiang Pu, 2014. "A convex duality method for optimal liquidation with participation constraints," Papers 1407.4614, arXiv.org, revised Dec 2014.
    17. Philippe Bergault & Fayc{c}al Drissi & Olivier Gu'eant, 2021. "Multi-asset optimal execution and statistical arbitrage strategies under Ornstein-Uhlenbeck dynamics," Papers 2103.13773, arXiv.org, revised Mar 2022.
    18. Olivier Gu'eant, 2012. "Optimal execution and block trade pricing: a general framework," Papers 1210.6372, arXiv.org, revised Dec 2014.
    19. Nico Achtsis & Dirk Nuyens, 2013. "A Monte Carlo method for optimal portfolio executions," Papers 1312.5919, arXiv.org.
    20. Samuel Drapeau & Peng Luo & Alexander Schied & Dewen Xiong, 2019. "An FBSDE approach to market impact games with stochastic parameters," Papers 2001.00622, arXiv.org.

    More about this item

    Keywords

    Machine learning ; Stochastic optimal control ; Deep learning ; Physics-Informed Neural Networks ; Optimal liquidation;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:aiz:louvad:2024016. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Nadja Peiffer (email available below). General contact details of provider: https://edirc.repec.org/data/isuclbe.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.