IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2208.00765.html
   My bibliography  Save this paper

Solving the optimal stopping problem with reinforcement learning: an application in financial option exercise

Author

Listed:
  • Leonardo Kanashiro Felizardo
  • Elia Matsumoto
  • Emilio Del-Moral-Hernandez

Abstract

The optimal stopping problem is a category of decision problems with a specific constrained configuration. It is relevant to various real-world applications such as finance and management. To solve the optimal stopping problem, state-of-the-art algorithms in dynamic programming, such as the least-squares Monte Carlo (LSMC), are employed. This type of algorithm relies on path simulations using only the last price of the underlying asset as a state representation. Also, the LSMC was thinking for option valuation where risk-neutral probabilities can be employed to account for uncertainty. However, the general optimal stopping problem goals may not fit the requirements of the LSMC showing auto-correlated prices. We employ a data-driven method that uses Monte Carlo simulation to train and test artificial neural networks (ANN) to solve the optimal stopping problem. Using ANN to solve decision problems is not entirely new. We propose a different architecture that uses convolutional neural networks (CNN) to deal with the dimensionality problem that arises when we transform the whole history of prices into a Markovian state. We present experiments that indicate that our proposed architecture improves results over the previous implementations under specific simulated time series function sets. Lastly, we employ our proposed method to compare the optimal exercise of the financial options problem with the LSMC algorithm. Our experiments show that our method can capture more accurate exercise opportunities when compared to the LSMC. We have outstandingly higher (above 974\% improvement) expected payoff from these exercise policies under the many Monte Carlo simulations that used the real-world return database on the out-of-sample (test) data.

Suggested Citation

  • Leonardo Kanashiro Felizardo & Elia Matsumoto & Emilio Del-Moral-Hernandez, 2022. "Solving the optimal stopping problem with reinforcement learning: an application in financial option exercise," Papers 2208.00765, arXiv.org.
  • Handle: RePEc:arx:papers:2208.00765
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2208.00765
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Longstaff, Francis A & Schwartz, Eduardo S, 2001. "Valuing American Options by Simulation: A Simple Least-Squares Approach," The Review of Financial Studies, Society for Financial Studies, vol. 14(1), pages 113-147.
    2. Bulan, Laarni & Mayer, Christopher & Somerville, C. Tsuriel, 2009. "Irreversible investment, real options, and competition: Evidence from real estate development," Journal of Urban Economics, Elsevier, vol. 65(3), pages 237-251, May.
    3. Tze Leung Lai & Tiong Wee Lim, 2004. "Exercise Regions And Efficient Valuation Of American Lookback Options," Mathematical Finance, Wiley Blackwell, vol. 14(2), pages 249-269, April.
    4. Dirk Hackbarth & Erwan Morellec, 2008. "Stock Returns in Mergers and Acquisitions," Journal of Finance, American Finance Association, vol. 63(3), pages 1213-1252, June.
    5. Carriere, Jacques F., 1996. "Valuation of the early-exercise price for options using simulations and nonparametric regression," Insurance: Mathematics and Economics, Elsevier, vol. 19(1), pages 19-30, December.
    6. Gonzalo Cortazar & Eduardo S. Schwartz & Marcelo Salinas, 1998. "Evaluating Environmental Investments: A Real Options Approach," Management Science, INFORMS, vol. 44(8), pages 1059-1070, August.
    7. Stephane Villeneuve, 1999. "Exercise regions of American options on several assets," Finance and Stochastics, Springer, vol. 3(3), pages 295-322.
    8. Lander, Diane M. & Pinches, George E., 1998. "Challenges to the Practical Implementation of Modeling and Valuing Real Options," The Quarterly Review of Economics and Finance, Elsevier, vol. 38(3, Part 2), pages 537-567.
    9. Li, Yong, 2008. "Duration analysis of venture capital staging: A real options perspective," Journal of Business Venturing, Elsevier, vol. 23(5), pages 497-512, September.
    10. Kamrad, Bardia & Lele, Shreevardhan S. & Siddique, Akhtar & Thomas, Robert J., 2005. "Innovation diffusion uncertainty, advertising and pricing policies," European Journal of Operational Research, Elsevier, vol. 164(3), pages 829-850, August.
    11. Ludovic Goudenège & Andrea Molent & Antonino Zanette, 2020. "Machine learning for pricing American options in high-dimensional Markovian and non-Markovian models," Quantitative Finance, Taylor & Francis Journals, vol. 20(4), pages 573-591, April.
    12. Pennings, Enrico & Lint, Onno, 1997. "The option value of advanced R & D," European Journal of Operational Research, Elsevier, vol. 103(1), pages 83-94, November.
    13. Nadarajah, Selvaprabu & Margot, François & Secomandi, Nicola, 2017. "Comparison of least squares Monte Carlo methods with applications to energy real options," European Journal of Operational Research, Elsevier, vol. 256(1), pages 196-204.
    14. Broadie, Mark & Glasserman, Paul, 1997. "Pricing American-style securities using simulation," Journal of Economic Dynamics and Control, Elsevier, vol. 21(8-9), pages 1323-1352, June.
    15. Longstaff, Francis A & Schwartz, Eduardo S, 2001. "Valuing American Options by Simulation: A Simple Least-Squares Approach," University of California at Los Angeles, Anderson Graduate School of Management qt43n1k4jb, Anderson Graduate School of Management, UCLA.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bradley Sturt, 2021. "A nonparametric algorithm for optimal stopping based on robust optimization," Papers 2103.03300, arXiv.org, revised Mar 2023.
    2. Juri Hinz & Tanya Tarnopolskaya & Jeremy Yee, 2020. "Efficient algorithms of pathwise dynamic programming for decision optimization in mining operations," Annals of Operations Research, Springer, vol. 286(1), pages 583-615, March.
    3. Mark Broadie & Jerome B. Detemple, 2004. "ANNIVERSARY ARTICLE: Option Pricing: Valuation Models and Applications," Management Science, INFORMS, vol. 50(9), pages 1145-1177, September.
    4. repec:hum:wpaper:sfb649dp2006-051 is not listed on IDEAS
    5. Denis Belomestny & Grigori Milstein & Vladimir Spokoiny, 2009. "Regression methods in pricing American and Bermudan options using consumption processes," Quantitative Finance, Taylor & Francis Journals, vol. 9(3), pages 315-327.
    6. Nicholas Davey & Nicolas Langrené & Wen Chen & Jonathan R. Rhodes & Simon Dunstall & Saman Halgamuge, 2023. "Designing higher value roads to preserve species at risk by optimally controlling traffic flow," Annals of Operations Research, Springer, vol. 320(2), pages 663-693, January.
    7. Lars Stentoft, 2004. "Convergence of the Least Squares Monte Carlo Approach to American Option Valuation," Management Science, INFORMS, vol. 50(9), pages 1193-1203, September.
    8. Garcia, Diego, 2003. "Convergence and Biases of Monte Carlo estimates of American option prices using a parametric exercise rule," Journal of Economic Dynamics and Control, Elsevier, vol. 27(10), pages 1855-1879, August.
    9. A. -S. Chen & P. -F. Shen, 2003. "Computational complexity analysis of least-squares Monte Carlo (LSM) for pricing US derivatives," Applied Economics Letters, Taylor & Francis Journals, vol. 10(4), pages 223-229.
    10. Ammann, Manuel & Kind, Axel & Wilde, Christian, 2008. "Simulation-based pricing of convertible bonds," Journal of Empirical Finance, Elsevier, vol. 15(2), pages 310-331, March.
    11. Ascione, Giacomo & Mehrdoust, Farshid & Orlando, Giuseppe & Samimi, Oldouz, 2023. "Foreign Exchange Options on Heston-CIR Model Under Lévy Process Framework," Applied Mathematics and Computation, Elsevier, vol. 446(C).
    12. Calypso Herrera & Louis Paulot, 2014. "Parallel American Monte Carlo," Papers 1404.1180, arXiv.org.
    13. Pascal Létourneau & Lars Stentoft, 2019. "Bootstrapping the Early Exercise Boundary in the Least-Squares Monte Carlo Method," JRFM, MDPI, vol. 12(4), pages 1-21, December.
    14. M. Martin Boyer & Lars Stentoft, 2017. "Yes We Can (Price Derivatives on Survivor Indices)," Risk Management and Insurance Review, American Risk and Insurance Association, vol. 20(1), pages 37-62, March.
    15. Manuel Moreno & Javier Navas, 2003. "On the Robustness of Least-Squares Monte Carlo (LSM) for Pricing American Derivatives," Review of Derivatives Research, Springer, vol. 6(2), pages 107-128, May.
    16. Lee, Sangmin & Boomsma, Trine Krogh, 2022. "An approximate dynamic programming algorithm for short-term electric vehicle fleet operation under uncertainty," Applied Energy, Elsevier, vol. 325(C).
    17. Martin B. Haugh & Leonid Kogan, 2004. "Pricing American Options: A Duality Approach," Operations Research, INFORMS, vol. 52(2), pages 258-270, April.
    18. Lars Stentoft, 2013. "American option pricing using simulation with an application to the GARCH model," Chapters, in: Adrian R. Bell & Chris Brooks & Marcel Prokopczuk (ed.), Handbook of Research Methods and Applications in Empirical Finance, chapter 5, pages 114-147, Edward Elgar Publishing.
    19. Mark Broadie & Menghui Cao, 2008. "Improved lower and upper bound algorithms for pricing American options by simulation," Quantitative Finance, Taylor & Francis Journals, vol. 8(8), pages 845-861.
    20. Zbigniew Palmowski & Tomasz Serafin, 2020. "A Note on Simulation Pricing of π -Options," Risks, MDPI, vol. 8(3), pages 1-19, August.
    21. Gagliardini, Patrick & Ronchetti, Diego, 2013. "Semi-parametric estimation of American option prices," Journal of Econometrics, Elsevier, vol. 173(1), pages 57-82.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2208.00765. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.