IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2307.11685.html
   My bibliography  Save this paper

Towards Generalizable Reinforcement Learning for Trade Execution

Author

Listed:
  • Chuheng Zhang
  • Yitong Duan
  • Xiaoyu Chen
  • Jianyu Chen
  • Jian Li
  • Li Zhao

Abstract

Optimized trade execution is to sell (or buy) a given amount of assets in a given time with the lowest possible trading cost. Recently, reinforcement learning (RL) has been applied to optimized trade execution to learn smarter policies from market data. However, we find that many existing RL methods exhibit considerable overfitting which prevents them from real deployment. In this paper, we provide an extensive study on the overfitting problem in optimized trade execution. First, we model the optimized trade execution as offline RL with dynamic context (ORDC), where the context represents market variables that cannot be influenced by the trading policy and are collected in an offline manner. Under this framework, we derive the generalization bound and find that the overfitting issue is caused by large context space and limited context samples in the offline setting. Accordingly, we propose to learn compact representations for context to address the overfitting problem, either by leveraging prior knowledge or in an end-to-end manner. To evaluate our algorithms, we also implement a carefully designed simulator based on historical limit order book (LOB) data to provide a high-fidelity benchmark for different algorithms. Our experiments on the high-fidelity simulator demonstrate that our algorithms can effectively alleviate overfitting and achieve better performance.

Suggested Citation

  • Chuheng Zhang & Yitong Duan & Xiaoyu Chen & Jianyu Chen & Jian Li & Li Zhao, 2023. "Towards Generalizable Reinforcement Learning for Trade Execution," Papers 2307.11685, arXiv.org.
  • Handle: RePEc:arx:papers:2307.11685
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2307.11685
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hans Degryse & Frank De Jong & Maarten Van Ravenswaaij & Gunther Wuyts, 2005. "Aggressive Orders and the Resiliency of a Limit Order Market," Review of Finance, European Finance Association, vol. 9(2), pages 201-242.
    2. Olivier Gu'eant & Charles-Albert Lehalle & Joaquin Fernandez Tapia, 2011. "Optimal Portfolio Liquidation with Limit Orders," Papers 1106.3279, arXiv.org, revised Jul 2012.
    3. Brian Bulthuis & Julio Concha & Tim Leung & Brian Ward, 2017. "Optimal execution of limit and market orders with trade director, speed limiter, and fill uncertainty," International Journal of Financial Engineering (IJFE), World Scientific Publishing Co. Pte. Ltd., vol. 4(02n03), pages 1-29, June.
    4. James Richard Cummings & Alex Frino, 2010. "Further analysis of the speed of response to large trades in interest rate futures," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 30(8), pages 705-724, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Anil Sharma & Freeman Chen & Jaesun Noh & Julio DeJesus & Mario Schlener, 2024. "Hedging and Pricing Structured Products Featuring Multiple Underlying Assets," Papers 2411.01121, arXiv.org.
    2. Chuqiao Zong & Chaojie Wang & Molei Qin & Lei Feng & Xinrun Wang & Bo An, 2024. "MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading," Papers 2406.14537, arXiv.org.
    3. Molei Qin & Shuo Sun & Wentao Zhang & Haochong Xia & Xinrun Wang & Bo An, 2023. "EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading," Papers 2309.12891, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthias Schnaubelt & Jonas Rende & Christopher Krauss, 2019. "Testing Stylized Facts of Bitcoin Limit Order Books," JRFM, MDPI, vol. 12(1), pages 1-30, February.
    2. Schnaubelt, Matthias, 2020. "Deep reinforcement learning for the optimal placement of cryptocurrency limit orders," FAU Discussion Papers in Economics 05/2020, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    3. Peter Gomber & Uwe Schweickert & Erik Theissen, 2015. "Liquidity Dynamics in an Electronic Open Limit Order Book: an Event Study Approach," European Financial Management, European Financial Management Association, vol. 21(1), pages 52-78, January.
    4. Campi, Luciano & Zabaljauregui, Diego, 2020. "Optimal market making under partial information with general intensities," LSE Research Online Documents on Economics 104612, London School of Economics and Political Science, LSE Library.
    5. Fengpei Li & Vitalii Ihnatiuk & Ryan Kinnear & Anderson Schneider & Yuriy Nevmyvaka, 2022. "Do price trajectory data increase the efficiency of market impact estimation?," Papers 2205.13423, arXiv.org, revised Mar 2023.
    6. Large, Jeremy, 2011. "Estimating quadratic variation when quoted prices change by a constant increment," Journal of Econometrics, Elsevier, vol. 160(1), pages 2-11, January.
    7. Christopher Lorenz & Alexander Schied, 2013. "Drift dependence of optimal trade execution strategies under transient price impact," Finance and Stochastics, Springer, vol. 17(4), pages 743-770, October.
    8. Aur'elien Alfonsi & Alexander Schied & Florian Klock, 2013. "Multivariate transient price impact and matrix-valued positive definite functions," Papers 1310.4471, arXiv.org, revised Sep 2015.
    9. Ryan Donnelly & Zi Li, 2022. "Dynamic Inventory Management with Mean-Field Competition," Papers 2210.17208, arXiv.org.
    10. Sofiene El Aoud & Frédéric Abergel, 2015. "A stochastic control approach for options market making," Post-Print hal-01061852, HAL.
    11. Thierry Foucault & Ohad Kadan & Eugene Kandel, 2005. "Limit Order Book as a Market for Liquidity," The Review of Financial Studies, Society for Financial Studies, vol. 18(4), pages 1171-1217.
    12. Wuyts, Gunther, 2008. "The impact of liquidity shocks through the limit order book," CFS Working Paper Series 2008/53, Center for Financial Studies (CFS).
    13. Broto, Carmen & Lamas, Matías, 2020. "Is market liquidity less resilient after the financial crisis? Evidence for US Treasuries," Economic Modelling, Elsevier, vol. 93(C), pages 217-229.
    14. Hendershott, Terrence & Menkveld, Albert J., 2014. "Price pressures," Journal of Financial Economics, Elsevier, vol. 114(3), pages 405-423.
    15. N Baradel & B Bouchard & Ngoc Minh Dang, 2016. "Optimal trading with online parameters revisions," Working Papers hal-01304019, HAL.
    16. O’Sullivan, Conall & Papavassiliou, Vassilios G. & Wafula, Ronald Wekesa & Boubaker, Sabri, 2024. "New insights into liquidity resiliency," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 90(C).
    17. Philippe Bergault & David Evangelista & Olivier Gu'eant & Douglas Vieira, 2018. "Closed-form approximations in multi-asset market making," Papers 1810.04383, arXiv.org, revised Sep 2022.
    18. Luca Lalor & Anatoliy Swishchuk, 2024. "Market Simulation under Adverse Selection," Papers 2409.12721, arXiv.org.
    19. Adam Blazejewski & Richard Coggins, 2004. "A piecewise linear model for trade sign inference," Finance 0412012, University Library of Munich, Germany.
    20. Sensoy, Ahmet, 2017. "Firm size, ownership structure, and systematic liquidity risk: The case of an emerging market," Journal of Financial Stability, Elsevier, vol. 31(C), pages 62-80.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2307.11685. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.