IDEAS home Printed from https://ideas.repec.org/a/eee/tefoso/v198y2024ics0040162523006297.html
   My bibliography  Save this article

Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach

Author

Listed:
  • Cui, Tianxiang
  • Du, Nanjiang
  • Yang, Xiaoying
  • Ding, Shusheng

Abstract

Portfolio optimization concerns with periodically allocating the limited funds to invest in a variety of potential assets in order to satisfy investors’ appetites for risk and return goals. Recently, Deep Reinforcement Learning (DRL) has shown its promising capabilities in sequential decision making problems. However, traditional DRL algorithms directly operate in the space of low-level actions, which exhibits poor scalability and becomes intractable in real-world problem instances when the dimensionality of the environment increases. To deal with this, in this work, a novel DRL hyper-heuristic framework is proposed for multi-period portfolio optimization problem. Instead of exploiting the entire action domain, our proposed approach is more effective by searching for low-level well-developed trading strategies. In addition, our proposed approach is data-driven and respects the nature of the problem by taking advantage of expert domain knowledge and posing it multidimensional states to further leverage additional diverse information from alternative views of the environment. The proposed approach is evaluated on five real-world capital market problem instances and numerous experimental results demonstrate our proposed method can achieve notable performance gains compared to state-of-art trading strategies as well as traditional DRL baseline method. The data we used are from five stock indices, covering the period from the 2012 to 2022. Our study can have salient policy implications for investment strategy formulation and effective regulatory frameworks establishment.

Suggested Citation

  • Cui, Tianxiang & Du, Nanjiang & Yang, Xiaoying & Ding, Shusheng, 2024. "Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach," Technological Forecasting and Social Change, Elsevier, vol. 198(C).
  • Handle: RePEc:eee:tefoso:v:198:y:2024:i:c:s0040162523006297
    DOI: 10.1016/j.techfore.2023.122944
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040162523006297
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.techfore.2023.122944?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Owen A. Lamont & Richard H. Thaler, 2003. "Can the Market Add and Subtract? Mispricing in Tech Stock Carve-outs," Journal of Political Economy, University of Chicago Press, vol. 111(2), pages 227-268, April.
    2. Merton, Robert C, 1973. "An Intertemporal Capital Asset Pricing Model," Econometrica, Econometric Society, vol. 41(5), pages 867-887, September.
    3. Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
    4. Harry Markowitz, 1952. "Portfolio Selection," Journal of Finance, American Finance Association, vol. 7(1), pages 77-91, March.
    5. Kathryn Tunyasuvunakool & Jonas Adler & Zachary Wu & Tim Green & Michal Zielinski & Augustin Žídek & Alex Bridgland & Andrew Cowie & Clemens Meyer & Agata Laydon & Sameer Velankar & Gerard J. Kleywegt, 2021. "Highly accurate protein structure prediction for the human proteome," Nature, Nature, vol. 596(7873), pages 590-596, August.
    6. Peng, Ling & Kloeden, Peter E., 2021. "Time-consistent portfolio optimization," European Journal of Operational Research, Elsevier, vol. 288(1), pages 183-193.
    7. Laffont, Jean-Jacques & Maskin, Eric S, 1990. "The Efficient Market Hypothesis and Insider Trading on the Stock Market," Journal of Political Economy, University of Chicago Press, vol. 98(1), pages 70-93, February.
    8. P. Bonami & M. A. Lejeune, 2009. "An Exact Solution Approach for Portfolio Optimization Problems Under Stochastic and Integer Constraints," Operations Research, INFORMS, vol. 57(3), pages 650-670, June.
    9. Andrew Ang & Geert Bekaert, 2007. "Stock Return Predictability: Is it There?," The Review of Financial Studies, Society for Financial Studies, vol. 20(3), pages 651-707.
    10. Eachempati, Prajwal & Srivastava, Praveen Ranjan & Kumar, Ajay & Tan, Kim Hua & Gupta, Shivam, 2021. "Validating the impact of accounting disclosures on stock market: A deep neural network approach," Technological Forecasting and Social Change, Elsevier, vol. 170(C).
    11. Bodnar, Taras & Parolya, Nestor & Schmid, Wolfgang, 2018. "Estimation of the global minimum variance portfolio in high dimensions," European Journal of Operational Research, Elsevier, vol. 266(1), pages 371-390.
    12. Nikolaus Hautsch & Lada M. Kyj & Peter Malec, 2015. "Do High‐Frequency Data Improve High‐Dimensional Portfolio Allocations?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(2), pages 263-290, March.
    13. Pierre Bonami & Miguel A. Lejeune, 2009. "An Exact Solution Approach for Integer Constrained Portfolio Optimization Problems Under Stochastic Constraints," Post-Print hal-00421756, HAL.
    14. Campbell, John Y. & Giglio, Stefano & Polk, Christopher & Turley, Robert, 2018. "An intertemporal CAPM with stochastic volatility," Journal of Financial Economics, Elsevier, vol. 128(2), pages 207-233.
    15. Md Shajalal & Petr Hajek & Mohammad Zoynul Abedin, 2023. "Product backorder prediction using deep neural network on imbalanced data," International Journal of Production Research, Taylor & Francis Journals, vol. 61(1), pages 302-319, January.
    16. Yunan Ye & Hengzhi Pei & Boxin Wang & Pin-Yu Chen & Yada Zhu & Jun Xiao & Bo Li, 2020. "Reinforcement-Learning based Portfolio Management with Augmented Asset Movement Prediction States," Papers 2002.05780, arXiv.org.
    17. Chu, Jeffrey & Zhang, Yuanyuan & Chan, Stephen, 2019. "The adaptive market hypothesis in the high frequency cryptocurrency market," International Review of Financial Analysis, Elsevier, vol. 64(C), pages 221-231.
    18. Dimitris Bertsimas & Romy Shioda, 2009. "Algorithm for cardinality-constrained quadratic optimization," Computational Optimization and Applications, Springer, vol. 43(1), pages 1-22, May.
    19. Pun, Chi Seng, 2018. "Time-consistent mean-variance portfolio selection with only risky assets," Economic Modelling, Elsevier, vol. 75(C), pages 281-292.
    20. Ahmed, Leena & Mumford, Christine & Kheiri, Ahmed, 2019. "Solving urban transit route design problem using selection hyper-heuristics," European Journal of Operational Research, Elsevier, vol. 274(2), pages 545-559.
    21. Cui, Tianxiang & Ding, Shusheng & Jin, Huan & Zhang, Yongmin, 2023. "Portfolio constructions in cryptocurrency market: A CVaR-based deep reinforcement learning approach," Economic Modelling, Elsevier, vol. 119(C).
    22. Wu, Qun & Liu, Xinwang & Qin, Jindong & Zhou, Ligang & Mardani, Abbas & Deveci, Muhammet, 2022. "An integrated multi-criteria decision-making and multi-objective optimization model for socially responsible portfolio selection," Technological Forecasting and Social Change, Elsevier, vol. 184(C).
    23. Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
    24. Crama, Y. & Schyns, M., 2003. "Simulated annealing for complex portfolio selection problems," European Journal of Operational Research, Elsevier, vol. 150(3), pages 546-571, November.
    25. Woodside-Oriakhi, M. & Lucas, C. & Beasley, J.E., 2011. "Heuristic algorithms for the cardinality constrained efficient frontier," European Journal of Operational Research, Elsevier, vol. 213(3), pages 538-550, September.
    26. Ma, Yechi & Ahmad, Ferhana & Liu, Miao & Wang, Zilong, 2020. "Portfolio optimization in the era of digital financialization using cryptocurrencies," Technological Forecasting and Social Change, Elsevier, vol. 161(C).
    27. David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
    28. David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
    29. Gilbert-Saad, Antoine & Siedlok, Frank & McNaughton, Rod B., 2023. "Entrepreneurial heuristics: Making strategic decisions in highly uncertain environments," Technological Forecasting and Social Change, Elsevier, vol. 189(C).
    30. Tao, Ran & Su, Chi-Wei & Xiao, Yidong & Dai, Ke & Khalid, Fahad, 2021. "Robo advisors, algorithmic trading and investment management: Wonders of fourth industrial revolution in financial markets," Technological Forecasting and Social Change, Elsevier, vol. 163(C).
    31. Richard Bellman, 1957. "On a Dynamic Programming Approach to the Caterer Problem--I," Management Science, INFORMS, vol. 3(3), pages 270-278, April.
    32. John Jumper & Richard Evans & Alexander Pritzel & Tim Green & Michael Figurnov & Olaf Ronneberger & Kathryn Tunyasuvunakool & Russ Bates & Augustin Žídek & Anna Potapenko & Alex Bridgland & Clemens Me, 2021. "Highly accurate protein structure prediction with AlphaFold," Nature, Nature, vol. 596(7873), pages 583-589, August.
    33. Zhengyao Jiang & Dixing Xu & Jinjun Liang, 2017. "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem," Papers 1706.10059, arXiv.org, revised Jul 2017.
    34. Rahimian, Erfan & Akartunalı, Kerem & Levine, John, 2017. "A hybrid Integer Programming and Variable Neighbourhood Search algorithm to solve Nurse Rostering Problems," European Journal of Operational Research, Elsevier, vol. 258(2), pages 411-423.
    35. Edmund K. Burke & Matthew R. Hyde & Graham Kendall & Gabriela Ochoa & Ender Özcan & John R. Woodward, 2019. "A Classification of Hyper-Heuristic Approaches: Revisited," International Series in Operations Research & Management Science, in: Michel Gendreau & Jean-Yves Potvin (ed.), Handbook of Metaheuristics, edition 3, chapter 0, pages 453-477, Springer.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Woodside-Oriakhi, M. & Lucas, C. & Beasley, J.E., 2011. "Heuristic algorithms for the cardinality constrained efficient frontier," European Journal of Operational Research, Elsevier, vol. 213(3), pages 538-550, September.
    2. Mansini, Renata & Ogryczak, Wlodzimierz & Speranza, M. Grazia, 2014. "Twenty years of linear programming based portfolio optimization," European Journal of Operational Research, Elsevier, vol. 234(2), pages 518-535.
    3. Zhou, Zhongbao & Jin, Qianying & Xiao, Helu & Wu, Qian & Liu, Wenbin, 2018. "Estimation of cardinality constrained portfolio efficiency via segmented DEA," Omega, Elsevier, vol. 76(C), pages 28-37.
    4. Wei Xu & Jie Tang & Ka Fai Cedric Yiu & Jian Wen Peng, 2024. "An Efficient Global Optimal Method for Cardinality Constrained Portfolio Optimization," INFORMS Journal on Computing, INFORMS, vol. 36(2), pages 690-704, March.
    5. Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
    6. Massol, Olivier & Banal-Estañol, Albert, 2014. "Export diversification through resource-based industrialization: The case of natural gas," European Journal of Operational Research, Elsevier, vol. 237(3), pages 1067-1082.
    7. Xuan-Kun Li & Jian-Xu Ma & Xiang-Yu Li & Jun-Jie Hu & Chuan-Yang Ding & Feng-Kai Han & Xiao-Min Guo & Xi Tan & Xian-Min Jin, 2024. "High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    8. Xiaojin Zheng & Xiaoling Sun & Duan Li & Jie Sun, 2014. "Successive convex approximations to cardinality-constrained convex programs: a piecewise-linear DC approach," Computational Optimization and Applications, Springer, vol. 59(1), pages 379-397, October.
    9. Committee, Nobel Prize, 2013. "Understanding Asset Prices," Nobel Prize in Economics documents 2013-1, Nobel Prize Committee.
    10. Ralph Steuer & Markus Hirschberger & Kalyanmoy Deb, 2016. "Extracting from the relaxed for large-scale semi-continuous variable nondominated frontiers," Journal of Global Optimization, Springer, vol. 64(1), pages 33-48, January.
    11. X. Cui & X. Zheng & S. Zhu & X. Sun, 2013. "Convex relaxations and MIQCQP reformulations for a class of cardinality-constrained portfolio selection problems," Journal of Global Optimization, Springer, vol. 56(4), pages 1409-1423, August.
    12. Fuinhas, José Alberto & Marques, António Cardoso & Nogueira, David Coito, 2014. "Análise VAR dos índices bolsistas SP500, FTSE100, PSI20, HSI e IBOVESPA [Integration of the indexes SP500, FTSE100, PSI20, HSI and IBOVESPA: A VAR approach]," MPRA Paper 62092, University Library of Munich, Germany, revised 10 Feb 2015.
    13. Stefan Nagel, 2013. "Empirical Cross-Sectional Asset Pricing," Annual Review of Financial Economics, Annual Reviews, vol. 5(1), pages 167-199, November.
    14. He, Xue-Zhong & Li, Youwei, 2015. "Testing of a market fraction model and power-law behaviour in the DAX 30," Journal of Empirical Finance, Elsevier, vol. 31(C), pages 1-17.
    15. Guo, Hui & Jiang, Xiaowen, 2021. "Aggregate Distress Risk and Equity Returns," Journal of Banking & Finance, Elsevier, vol. 133(C).
    16. Robert J. Shiller, 2003. "From Efficient Markets Theory to Behavioral Finance," Journal of Economic Perspectives, American Economic Association, vol. 17(1), pages 83-104, Winter.
    17. Wang, Wenzhao & Duxbury, Darren, 2021. "Institutional investor sentiment and the mean-variance relationship: Global evidence," Journal of Economic Behavior & Organization, Elsevier, vol. 191(C), pages 415-441.
    18. Lu Zhang, 2017. "The Investment CAPM," European Financial Management, European Financial Management Association, vol. 23(4), pages 545-603, September.
    19. Ian Martin, 2017. "What is the Expected Return on the Market?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 132(1), pages 367-433.
    20. Baker, Malcolm & Wurgler, Jeffrey & Yuan, Yu, 2012. "Global, local, and contagious investor sentiment," Journal of Financial Economics, Elsevier, vol. 104(2), pages 272-287.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:tefoso:v:198:y:2024:i:c:s0040162523006297. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.sciencedirect.com/science/journal/00401625 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.