Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach

My bibliography Save this article

Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach

Author

Listed:

Cui, Tianxiang
Du, Nanjiang
Yang, Xiaoying
Ding, Shusheng

Registered:

Abstract

Portfolio optimization concerns with periodically allocating the limited funds to invest in a variety of potential assets in order to satisfy investors’ appetites for risk and return goals. Recently, Deep Reinforcement Learning (DRL) has shown its promising capabilities in sequential decision making problems. However, traditional DRL algorithms directly operate in the space of low-level actions, which exhibits poor scalability and becomes intractable in real-world problem instances when the dimensionality of the environment increases. To deal with this, in this work, a novel DRL hyper-heuristic framework is proposed for multi-period portfolio optimization problem. Instead of exploiting the entire action domain, our proposed approach is more effective by searching for low-level well-developed trading strategies. In addition, our proposed approach is data-driven and respects the nature of the problem by taking advantage of expert domain knowledge and posing it multidimensional states to further leverage additional diverse information from alternative views of the environment. The proposed approach is evaluated on five real-world capital market problem instances and numerous experimental results demonstrate our proposed method can achieve notable performance gains compared to state-of-art trading strategies as well as traditional DRL baseline method. The data we used are from five stock indices, covering the period from the 2012 to 2022. Our study can have salient policy implications for investment strategy formulation and effective regulatory frameworks establishment.

Suggested Citation

Cui, Tianxiang & Du, Nanjiang & Yang, Xiaoying & Ding, Shusheng, 2024. "Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach," Technological Forecasting and Social Change, Elsevier, vol. 198(C).

Handle: RePEc:eee:tefoso:v:198:y:2024:i:c:s0040162523006297
DOI: 10.1016/j.techfore.2023.122944

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Nikolaus Hautsch & Lada M. Kyj & Peter Malec, 2015. "Do High‐Frequency Data Improve High‐Dimensional Portfolio Allocations?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(2), pages 263-290, March.
- Hautsch, Nikolaus & Kyj, Lada. M. & Malec, Peter, 2013. "Do high-frequency data improve high-dimensional portfolio allocations?," SFB 649 Discussion Papers 2013-014, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
Kathryn Tunyasuvunakool & Jonas Adler & Zachary Wu & Tim Green & Michal Zielinski & Augustin Žídek & Alex Bridgland & Andrew Cowie & Clemens Meyer & Agata Laydon & Sameer Velankar & Gerard J. Kleywegt, 2021. "Highly accurate protein structure prediction for the human proteome," Nature, Nature, vol. 596(7873), pages 590-596, August.
Laffont, Jean-Jacques & Maskin, Eric S, 1990. "The Efficient Market Hypothesis and Insider Trading on the Stock Market," Journal of Political Economy, University of Chicago Press, vol. 98(1), pages 70-93, February.
Andrew Ang & Geert Bekaert, 2007. "Stock Return Predictability: Is it There?," The Review of Financial Studies, Society for Financial Studies, vol. 20(3), pages 651-707.
- Andrew Ang & Geert Bekaert, 2001. "Stock Return Predictability: Is it There?," NBER Working Papers 8207, National Bureau of Economic Research, Inc.
Pierre Bonami & Miguel A. Lejeune, 2009. "An Exact Solution Approach for Integer Constrained Portfolio Optimization Problems Under Stochastic Constraints," Post-Print hal-00421756, HAL.
Dimitris Bertsimas & Romy Shioda, 2009. "Algorithm for cardinality-constrained quadratic optimization," Computational Optimization and Applications, Springer, vol. 43(1), pages 1-22, May.
Bodnar, Taras & Parolya, Nestor & Schmid, Wolfgang, 2018. "Estimation of the global minimum variance portfolio in high dimensions," European Journal of Operational Research, Elsevier, vol. 266(1), pages 371-390.
- Taras Bodnar & Nestor Parolya & Wolfgang Schmid, 2014. "Estimation of the Global Minimum Variance Portfolio in High Dimensions," Papers 1406.0437, arXiv.org, revised Nov 2015.
Pun, Chi Seng, 2018. "Time-consistent mean-variance portfolio selection with only risky assets," Economic Modelling, Elsevier, vol. 75(C), pages 281-292.
Ahmed, Leena & Mumford, Christine & Kheiri, Ahmed, 2019. "Solving urban transit route design problem using selection hyper-heuristics," European Journal of Operational Research, Elsevier, vol. 274(2), pages 545-559.
Campbell, John Y. & Giglio, Stefano & Polk, Christopher & Turley, Robert, 2018. "An intertemporal CAPM with stochastic volatility," Journal of Financial Economics, Elsevier, vol. 128(2), pages 207-233.
- John Y. Campbell & Stefano Giglio & Christopher Polk & Robert Turley, 2012. "An Intertemporal CAPM with Stochastic Volatility," NBER Working Papers 18411, National Bureau of Economic Research, Inc.
- Campbell, John Y & Polk, Christopher & Giglio, Stefano & Turley, Robert, 2015. "An Intertemporal CAPM with Stochastic Volatility," CEPR Discussion Papers 10681, C.E.P.R. Discussion Papers.
- Campbell, John Y. & Giglio, Stefano & Polk, Christopher & Turley, Robert, 2018. "An Intertemporal CAPM with stochastic volatility," LSE Research Online Documents on Economics 69634, London School of Economics and Political Science, LSE Library.
Crama, Y. & Schyns, M., 2003. "Simulated annealing for complex portfolio selection problems," European Journal of Operational Research, Elsevier, vol. 150(3), pages 546-571, November.
Ma, Yechi & Ahmad, Ferhana & Liu, Miao & Wang, Zilong, 2020. "Portfolio optimization in the era of digital financialization using cryptocurrencies," Technological Forecasting and Social Change, Elsevier, vol. 161(C).
Richard Bellman, 1957. "On a Dynamic Programming Approach to the Caterer Problem--I," Management Science, INFORMS, vol. 3(3), pages 270-278, April.
John Jumper & Richard Evans & Alexander Pritzel & Tim Green & Michael Figurnov & Olaf Ronneberger & Kathryn Tunyasuvunakool & Russ Bates & Augustin Žídek & Anna Potapenko & Alex Bridgland & Clemens Me, 2021. "Highly accurate protein structure prediction with AlphaFold," Nature, Nature, vol. 596(7873), pages 583-589, August.
Zhengyao Jiang & Dixing Xu & Jinjun Liang, 2017. "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem," Papers 1706.10059, arXiv.org, revised Jul 2017.
Owen A. Lamont & Richard H. Thaler, 2003. "Can the Market Add and Subtract? Mispricing in Tech Stock Carve-outs," Journal of Political Economy, University of Chicago Press, vol. 111(2), pages 227-268, April.
- Owen A. Lamont & Richard H. Thaler, "undated". "Can the Market Add and Subtract? Mispricing in Tech Stock Carve-outs," CRSP working papers 528, Center for Research in Security Prices, Graduate School of Business, University of Chicago.
- Owen A. Lamont & Richard H. Thaler, 2001. "Can the Market Add and Subtract? Mispricing in Tech Stock Carve-Outs," NBER Working Papers 8302, National Bureau of Economic Research, Inc.
Merton, Robert C, 1973. "An Intertemporal Capital Asset Pricing Model," Econometrica, Econometric Society, vol. 41(5), pages 867-887, September.
Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
Harry Markowitz, 1952. "Portfolio Selection," Journal of Finance, American Finance Association, vol. 7(1), pages 77-91, March.
Peng, Ling & Kloeden, Peter E., 2021. "Time-consistent portfolio optimization," European Journal of Operational Research, Elsevier, vol. 288(1), pages 183-193.
P. Bonami & M. A. Lejeune, 2009. "An Exact Solution Approach for Portfolio Optimization Problems Under Stochastic and Integer Constraints," Operations Research, INFORMS, vol. 57(3), pages 650-670, June.
Eachempati, Prajwal & Srivastava, Praveen Ranjan & Kumar, Ajay & Tan, Kim Hua & Gupta, Shivam, 2021. "Validating the impact of accounting disclosures on stock market: A deep neural network approach," Technological Forecasting and Social Change, Elsevier, vol. 170(C).
Md Shajalal & Petr Hajek & Mohammad Zoynul Abedin, 2023. "Product backorder prediction using deep neural network on imbalanced data," International Journal of Production Research, Taylor & Francis Journals, vol. 61(1), pages 302-319, January.
Yunan Ye & Hengzhi Pei & Boxin Wang & Pin-Yu Chen & Yada Zhu & Jun Xiao & Bo Li, 2020. "Reinforcement-Learning based Portfolio Management with Augmented Asset Movement Prediction States," Papers 2002.05780, arXiv.org.
Chu, Jeffrey & Zhang, Yuanyuan & Chan, Stephen, 2019. "The adaptive market hypothesis in the high frequency cryptocurrency market," International Review of Financial Analysis, Elsevier, vol. 64(C), pages 221-231.
Li, Xiaoyue & Uysal, A. Sinem & Mulvey, John M., 2022. "Multi-period portfolio optimization using model predictive control with mean-variance and risk parity frameworks," European Journal of Operational Research, Elsevier, vol. 299(3), pages 1158-1176.
Cui, Tianxiang & Ding, Shusheng & Jin, Huan & Zhang, Yongmin, 2023. "Portfolio constructions in cryptocurrency market: A CVaR-based deep reinforcement learning approach," Economic Modelling, Elsevier, vol. 119(C).
Wu, Qun & Liu, Xinwang & Qin, Jindong & Zhou, Ligang & Mardani, Abbas & Deveci, Muhammet, 2022. "An integrated multi-criteria decision-making and multi-objective optimization model for socially responsible portfolio selection," Technological Forecasting and Social Change, Elsevier, vol. 184(C).
Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
Woodside-Oriakhi, M. & Lucas, C. & Beasley, J.E., 2011. "Heuristic algorithms for the cardinality constrained efficient frontier," European Journal of Operational Research, Elsevier, vol. 213(3), pages 538-550, September.
David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
Gilbert-Saad, Antoine & Siedlok, Frank & McNaughton, Rod B., 2023. "Entrepreneurial heuristics: Making strategic decisions in highly uncertain environments," Technological Forecasting and Social Change, Elsevier, vol. 189(C).
Tao, Ran & Su, Chi-Wei & Xiao, Yidong & Dai, Ke & Khalid, Fahad, 2021. "Robo advisors, algorithmic trading and investment management: Wonders of fourth industrial revolution in financial markets," Technological Forecasting and Social Change, Elsevier, vol. 163(C).
Rahimian, Erfan & Akartunalı, Kerem & Levine, John, 2017. "A hybrid Integer Programming and Variable Neighbourhood Search algorithm to solve Nurse Rostering Problems," European Journal of Operational Research, Elsevier, vol. 258(2), pages 411-423.
Edmund K. Burke & Matthew R. Hyde & Graham Kendall & Gabriela Ochoa & Ender Özcan & John R. Woodward, 2019. "A Classification of Hyper-Heuristic Approaches: Revisited," International Series in Operations Research & Management Science, in: Michel Gendreau & Jean-Yves Potvin (ed.), Handbook of Metaheuristics, edition 3, chapter 0, pages 453-477, Springer.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Jin, Jiahuan & Cui, Tianxiang & Bai, Ruibin & Qu, Rong, 2024. "Container port truck dispatching optimization using Real2Sim based deep reinforcement learning," European Journal of Operational Research, Elsevier, vol. 315(1), pages 161-175.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Woodside-Oriakhi, M. & Lucas, C. & Beasley, J.E., 2011. "Heuristic algorithms for the cardinality constrained efficient frontier," European Journal of Operational Research, Elsevier, vol. 213(3), pages 538-550, September.
Mansini, Renata & Ogryczak, Wlodzimierz & Speranza, M. Grazia, 2014. "Twenty years of linear programming based portfolio optimization," European Journal of Operational Research, Elsevier, vol. 234(2), pages 518-535.
Zhou, Zhongbao & Jin, Qianying & Xiao, Helu & Wu, Qian & Liu, Wenbin, 2018. "Estimation of cardinality constrained portfolio efficiency via segmented DEA," Omega, Elsevier, vol. 76(C), pages 28-37.
Jin, Jiahuan & Cui, Tianxiang & Bai, Ruibin & Qu, Rong, 2024. "Container port truck dispatching optimization using Real2Sim based deep reinforcement learning," European Journal of Operational Research, Elsevier, vol. 315(1), pages 161-175.
Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
Massol, Olivier & Banal-Estañol, Albert, 2014. "Export diversification through resource-based industrialization: The case of natural gas," European Journal of Operational Research, Elsevier, vol. 237(3), pages 1067-1082.
Xiaojin Zheng & Xiaoling Sun & Duan Li & Jie Sun, 2014. "Successive convex approximations to cardinality-constrained convex programs: a piecewise-linear DC approach," Computational Optimization and Applications, Springer, vol. 59(1), pages 379-397, October.
Committee, Nobel Prize, 2013. "Understanding Asset Prices," Nobel Prize in Economics documents 2013-1, Nobel Prize Committee.
Wang, Jianzhou & Lv, Mengzheng & Wang, Shuai & Gao, Jialu & Zhao, Yang & Wang, Qiangqiang, 2024. "Can multi-period auto-portfolio systems improve returns? Evidence from Chinese and U.S. stock markets," International Review of Financial Analysis, Elsevier, vol. 95(PB).
Wei Xu & Jie Tang & Ka Fai Cedric Yiu & Jian Wen Peng, 2024. "An Efficient Global Optimal Method for Cardinality Constrained Portfolio Optimization," INFORMS Journal on Computing, INFORMS, vol. 36(2), pages 690-704, March.
Zhenchong Mo & Lin Gong & Mingren Zhu & Junde Lan, 2024. "The Generative Generic-Field Design Method Based on Design Cognition and Knowledge Reasoning," Sustainability, MDPI, vol. 16(22), pages 1-34, November.
Xuan-Kun Li & Jian-Xu Ma & Xiang-Yu Li & Jun-Jie Hu & Chuan-Yang Ding & Feng-Kai Han & Xiao-Min Guo & Xi Tan & Xian-Min Jin, 2024. "High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
Ralph Steuer & Markus Hirschberger & Kalyanmoy Deb, 2016. "Extracting from the relaxed for large-scale semi-continuous variable nondominated frontiers," Journal of Global Optimization, Springer, vol. 64(1), pages 33-48, January.
X. Cui & X. Zheng & S. Zhu & X. Sun, 2013. "Convex relaxations and MIQCQP reformulations for a class of cardinality-constrained portfolio selection problems," Journal of Global Optimization, Springer, vol. 56(4), pages 1409-1423, August.
Stefan Nagel, 2013. "Empirical Cross-Sectional Asset Pricing," Annual Review of Financial Economics, Annual Reviews, vol. 5(1), pages 167-199, November.
- Stefan Nagel, 2012. "Empirical Cross-Sectional Asset Pricing," NBER Working Papers 18554, National Bureau of Economic Research, Inc.
- Nagel, Stefan, 2012. "Empirical Cross-Sectional Asset Pricing," CEPR Discussion Papers 9227, C.E.P.R. Discussion Papers.
Guo, Hui & Jiang, Xiaowen, 2021. "Aggregate Distress Risk and Equity Returns," Journal of Banking & Finance, Elsevier, vol. 133(C).
Robert J. Shiller, 2003. "From Efficient Markets Theory to Behavioral Finance," Journal of Economic Perspectives, American Economic Association, vol. 17(1), pages 83-104, Winter.
- Robert J. Shiller, 2002. "From Efficient Market Theory to Behavioral Finance," Cowles Foundation Discussion Papers 1385, Cowles Foundation for Research in Economics, Yale University.
Ian Martin, 2017. "What is the Expected Return on the Market?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 132(1), pages 367-433.
- Martin, Ian, 2015. "What is the Expected Return on the Market?," CEPR Discussion Papers 10715, C.E.P.R. Discussion Papers.
- Martin, Ian, 2016. "What is the expected return on the market?," LSE Research Online Documents on Economics 119013, London School of Economics and Political Science, LSE Library.
- Martin, Ian, 2017. "What is the expected return on the market?," LSE Research Online Documents on Economics 67036, London School of Economics and Political Science, LSE Library.
Dimitris Bertsimas & Ryan Cory-Wright, 2022. "A Scalable Algorithm for Sparse Portfolio Selection," INFORMS Journal on Computing, INFORMS, vol. 34(3), pages 1489-1511, May.
Yang, Kaiyuan & Huang, Houjing & Vandans, Olafs & Murali, Adithya & Tian, Fujia & Yap, Roland H.C. & Dai, Liang, 2023. "Applying deep reinforcement learning to the HP model for protein structure prediction," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 609(C).

More about this item

Keywords

Portfolio optimization; Deep reinforcement learning; Hyper-heuristic; Decision making; Uncertainty;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:tefoso:v:198:y:2024:i:c:s0040162523006297. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.sciencedirect.com/science/journal/00401625 .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data