IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2206.10736.html
   My bibliography  Save this paper

Imitate then Transcend: Multi-Agent Optimal Execution with Dual-Window Denoise PPO

Author

Listed:
  • Jin Fang
  • Jiacheng Weng
  • Yi Xiang
  • Xinwen Zhang

Abstract

A novel framework for solving the optimal execution and placement problems using reinforcement learning (RL) with imitation was proposed. The RL agents trained from the proposed framework consistently outperformed the industry benchmark time-weighted average price (TWAP) strategy in execution cost and showed great generalization across out-of-sample trading dates and tickers. The impressive performance was achieved from three aspects. First, our RL network architecture called Dual-window Denoise PPO enabled efficient learning in a noisy market environment. Second, a reward scheme with imitation learning was designed, and a comprehensive set of market features was studied. Third, our flexible action formulation allowed the RL agent to tackle optimal execution and placement collectively resulting in better performance than solving individual problems separately. The RL agent's performance was evaluated in our multi-agent realistic historical limit order book simulator in which price impact was accurately assessed. In addition, ablation studies were also performed, confirming the superiority of our framework.

Suggested Citation

  • Jin Fang & Jiacheng Weng & Yi Xiang & Xinwen Zhang, 2022. "Imitate then Transcend: Multi-Agent Optimal Execution with Dual-Window Denoise PPO," Papers 2206.10736, arXiv.org.
  • Handle: RePEc:arx:papers:2206.10736
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2206.10736
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Michael Karpe & Jin Fang & Zhongyao Ma & Chen Wang, 2020. "Multi-Agent Reinforcement Learning in a Realistic Limit Order Book Market Simulation," Papers 2006.05574, arXiv.org, revised Sep 2020.
    2. Obizhaeva, Anna A. & Wang, Jiang, 2013. "Optimal trading strategy and supply/demand dynamics," Journal of Financial Markets, Elsevier, vol. 16(1), pages 1-32.
    3. Charles Cao & Oliver Hansch & Xiaoxin Wang, 2009. "The information content of an open limit‐order book," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 29(1), pages 16-41, January.
    4. Jinliang Li & Chunchi Wu, 2006. "Daily Return Volatility, Bid-Ask Spreads, and Information Flow: Analyzing the Information Content of Volume," The Journal of Business, University of Chicago Press, vol. 79(5), pages 2697-2740, September.
    5. Schnaubelt, Matthias, 2022. "Deep reinforcement learning for the optimal placement of cryptocurrency limit orders," European Journal of Operational Research, Elsevier, vol. 296(3), pages 993-1006.
    6. Huang, Roger D. & Stoll, Hans R., 1996. "Dealer versus auction markets: A paired comparison of execution costs on NASDAQ and the NYSE," Journal of Financial Economics, Elsevier, vol. 41(3), pages 313-357, July.
    7. He, Hua & Mamaysky, Harry, 2005. "Dynamic trading policies with price impact," Journal of Economic Dynamics and Control, Elsevier, vol. 29(5), pages 891-930, May.
    8. Ranaldo, Angelo, 2004. "Order aggressiveness in limit order book markets," Journal of Financial Markets, Elsevier, vol. 7(1), pages 53-74, January.
    9. Cohen, Kalman J & Maier, Steven F & Schwartz, Robert A & Whitcomb, David K, 1981. "Transaction Costs, Order Placement Strategy, and Existence of the Bid-Ask Spread," Journal of Political Economy, University of Chicago Press, vol. 89(2), pages 287-305, April.
    10. Bertsimas, Dimitris & Lo, Andrew W., 1998. "Optimal control of execution costs," Journal of Financial Markets, Elsevier, vol. 1(1), pages 1-50, April.
    11. Avellaneda, Marco & Reed, Josh & Stoikov, Sasha, 2011. "Forecasting prices from level-I quotes in the presence of hidden liquidity," Algorithmic Finance, IOS Press, vol. 1(1), pages 35-43.
    12. James P. Weston, 2000. "Competition on the Nasdaq and the Impact of Recent Market Reforms," Journal of Finance, American Finance Association, vol. 55(6), pages 2565-2598, December.
    13. Bessembinder, Hendrik, 1997. "The degree of price resolution and equity trading costs," Journal of Financial Economics, Elsevier, vol. 45(1), pages 9-34, July.
    14. Lee, Charles M C & Ready, Mark J, 1991. "Inferring Trade Direction from Intraday Data," Journal of Finance, American Finance Association, vol. 46(2), pages 733-746, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dimitri Vayanos & Jiang Wang, 2012. "Market Liquidity -- Theory and Empirical Evidence," NBER Working Papers 18251, National Bureau of Economic Research, Inc.
    2. Schnaubelt, Matthias, 2022. "Deep reinforcement learning for the optimal placement of cryptocurrency limit orders," European Journal of Operational Research, Elsevier, vol. 296(3), pages 993-1006.
    3. Olivier Guéant, 2016. "The Financial Mathematics of Market Liquidity: From Optimal Execution to Market Making," Post-Print hal-01393136, HAL.
    4. Siu, Chi Chung & Guo, Ivan & Zhu, Song-Ping & Elliott, Robert J., 2019. "Optimal execution with regime-switching market resilience," Journal of Economic Dynamics and Control, Elsevier, vol. 101(C), pages 17-40.
    5. Vayanos, Dimitri & Wang, Jiang, 2013. "Market Liquidity—Theory and Empirical Evidence ," Handbook of the Economics of Finance, in: G.M. Constantinides & M. Harris & R. M. Stulz (ed.), Handbook of the Economics of Finance, volume 2, chapter 0, pages 1289-1361, Elsevier.
    6. Murphy Jun Jie Lee, 2013. "The Microstructure of Trading Processes on the Singapore Exchange," PhD Thesis, Finance Discipline Group, UTS Business School, University of Technology, Sydney, number 2-2013, January-A.
    7. Fishe, Raymond P. H. & Robe, Michel A., 2004. "The impact of illegal insider trading in dealer and specialist markets: evidence from a natural experiment," Journal of Financial Economics, Elsevier, vol. 71(3), pages 461-488, March.
    8. Cebiroğlu, Gökhan & Horst, Ulrich, 2015. "Optimal order display in limit order markets with liquidity competition," Journal of Economic Dynamics and Control, Elsevier, vol. 58(C), pages 81-100.
    9. Schnaubelt, Matthias, 2020. "Deep reinforcement learning for the optimal placement of cryptocurrency limit orders," FAU Discussion Papers in Economics 05/2020, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    10. Murphy Jun Jie Lee, 2013. "The Microstructure of Trading Processes on the Singapore Exchange," PhD Thesis, Finance Discipline Group, UTS Business School, University of Technology, Sydney, number 4, July-Dece.
    11. Sim, Min Kyu & Deng, Shijie, 2020. "Estimation of level-I hidden liquidity using the dynamics of limit order-book," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 540(C).
    12. Olivier Guéant & Charles-Albert Lehalle, 2015. "General Intensity Shapes In Optimal Liquidation," Mathematical Finance, Wiley Blackwell, vol. 25(3), pages 457-495, July.
    13. Alexander, Gordon J. & Peterson, Mark A., 2007. "An analysis of trade-size clustering and its relation to stealth trading," Journal of Financial Economics, Elsevier, vol. 84(2), pages 435-471, May.
    14. Yamamoto, Ryuichi, 2019. "Dynamic Predictor Selection And Order Splitting In A Limit Order Market," Macroeconomic Dynamics, Cambridge University Press, vol. 23(5), pages 1757-1792, July.
    15. Boulatov, Alex & Hatch, Brian C. & Johnson, Shane A. & Lei, Adam Y.C., 2009. "Dealer attention, the speed of quote adjustment to information, and net dealer revenue," Journal of Banking & Finance, Elsevier, vol. 33(8), pages 1531-1542, August.
    16. Wei Cui & Anthony Brabazon & Michael O'Neill, 2011. "Dynamic trade execution: a grammatical evolution approach," International Journal of Financial Markets and Derivatives, Inderscience Enterprises Ltd, vol. 2(1/2), pages 4-31.
    17. Chakravarty, Sugato & Harris, Fredreck H. deB. & Wood, Roger A., 2001. "Do Bid-Ask Spreads or Bid and Ask Depths Convey New Information First?," Purdue University Economics Working Papers 1149, Purdue University, Department of Economics.
    18. Comerton-Forde, Carole & Tang, Kar Mei, 2009. "Anonymity, liquidity and fragmentation," Journal of Financial Markets, Elsevier, vol. 12(3), pages 337-367, August.
    19. Jie-Haun Lee & Whei-May Fan, 2014. "Investors’ perception of corporate governance: a spillover effect of Taiwan corporate scandals," Review of Quantitative Finance and Accounting, Springer, vol. 43(1), pages 97-119, July.
    20. Martin D. Gould & Mason A. Porter & Stacy Williams & Mark McDonald & Daniel J. Fenn & Sam D. Howison, 2010. "Limit Order Books," Papers 1012.0349, arXiv.org, revised Apr 2013.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2206.10736. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.