IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v321y2025i1p243-256.html
   My bibliography  Save this article

Integration of prediction and optimization for smart stock portfolio selection

Author

Listed:
  • Sarkar, Puja
  • Khanapuri, Vivekanand B.
  • Tiwari, Manoj Kumar

Abstract

Machine learning (ML) algorithms pose significant challenges in predicting unknown parameters for optimization models in decision-making scenarios. Conventionally, prediction models are optimized independently in decision-making processes, whereas ML algorithms primarily focus on minimizing prediction errors, neglecting the role of decision-making in downstream optimization tasks. The pursuit of high prediction accuracy may not always align with the goal of reducing decision errors. The idea of reducing decision errors has been proposed to address this limitation. This paper introduces an optimization process that integrates predictive regression models within a mean–variance optimization setting. This innovative technique introduces a general loss function to capture decision errors. Consequently, the predictive model not only focuses on forecasting unknown optimization parameters but also emphasizes the predicted values that minimize decision errors. This approach prioritizes decision accuracy over the potential accuracy of unknown parameter prediction. In contrast to traditional ML approaches that minimize standard loss functions such as mean squared error, our proposed model seeks to minimize the objective value derived directly from the decision-making problem. Furthermore, this strategy is validated by developing an optimization-based regression tree model for predicting stock returns and reducing decision errors. Empirical evaluations of our framework reveal its superiority over conventional regression tree methods, demonstrating enhanced decision quality. The computational experiments are conducted on a stock market dataset to compare the effectiveness of the proposed framework with the conventional regression tree-based approach. Remarkably, the results confirm the strengths inherent in this holistic approach.

Suggested Citation

  • Sarkar, Puja & Khanapuri, Vivekanand B. & Tiwari, Manoj Kumar, 2025. "Integration of prediction and optimization for smart stock portfolio selection," European Journal of Operational Research, Elsevier, vol. 321(1), pages 243-256.
  • Handle: RePEc:eee:ejores:v:321:y:2025:i:1:p:243-256
    DOI: 10.1016/j.ejor.2024.08.027
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221724006672
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2024.08.027?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Fama, Eugene F & French, Kenneth R, 1992. "The Cross-Section of Expected Stock Returns," Journal of Finance, American Finance Association, vol. 47(2), pages 427-465, June.
    2. Krauss, Christopher & Do, Xuan Anh & Huck, Nicolas, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," European Journal of Operational Research, Elsevier, vol. 259(2), pages 689-702.
    3. Václavík, Roman & Novák, Antonín & Šůcha, Přemysl & Hanzálek, Zdeněk, 2018. "Accelerating the Branch-and-Price Algorithm Using Machine Learning," European Journal of Operational Research, Elsevier, vol. 271(3), pages 1055-1069.
    4. Dieter, Peter & Caron, Matthew & Schryen, Guido, 2023. "Integrating driver behavior into last-mile delivery routing: Combining machine learning and optimization in a hybrid decision support framework," European Journal of Operational Research, Elsevier, vol. 311(1), pages 283-300.
    5. Kallestad, Jakob & Hasibi, Ramin & Hemmati, Ahmad & Sörensen, Kenneth, 2023. "A general deep reinforcement learning hyperheuristic framework for solving combinatorial optimization problems," European Journal of Operational Research, Elsevier, vol. 309(1), pages 446-468.
    6. Bodnar, Taras & Mazur, Stepan & Okhrin, Yarema, 2017. "Bayesian estimation of the global minimum variance portfolio," European Journal of Operational Research, Elsevier, vol. 256(1), pages 292-307.
    7. Miten Mistry & Dimitrios Letsios & Gerhard Krennrich & Robert M. Lee & Ruth Misener, 2021. "Mixed-Integer Convex Nonlinear Optimization with Gradient-Boosted Trees Embedded," INFORMS Journal on Computing, INFORMS, vol. 33(3), pages 1103-1119, July.
    8. Dehaybe, Henri & Catanzaro, Daniele & Chevalier, Philippe, 2024. "Deep Reinforcement Learning for inventory optimization with non-stationary uncertain demand," European Journal of Operational Research, Elsevier, vol. 314(2), pages 433-445.
    9. Yu, Jing-Rung & Paul Chiou, Wan-Jiun & Lee, Wen-Yi & Lin, Shun-Ji, 2020. "Portfolio models with return forecasting and transaction costs," International Review of Economics & Finance, Elsevier, vol. 66(C), pages 118-130.
    10. Shen, Yunzhuang & Sun, Yuan & Li, Xiaodong & Eberhard, Andrew & Ernst, Andreas, 2023. "Adaptive solution prediction for combinatorial optimization," European Journal of Operational Research, Elsevier, vol. 309(3), pages 1392-1408.
    11. Fischer, Thomas & Krauss, Christopher, 2018. "Deep learning with long short-term memory networks for financial market predictions," European Journal of Operational Research, Elsevier, vol. 270(2), pages 654-669.
    12. Harry Markowitz, 1952. "Portfolio Selection," Journal of Finance, American Finance Association, vol. 7(1), pages 77-91, March.
    13. Dimitris Bertsimas & Ryan Cory-Wright, 2022. "A Scalable Algorithm for Sparse Portfolio Selection," INFORMS Journal on Computing, INFORMS, vol. 34(3), pages 1489-1511, May.
    14. Dimitris Bertsimas & Cheol Woo Kim, 2023. "A Prescriptive Machine Learning Approach to Mixed-Integer Convex Optimization," INFORMS Journal on Computing, INFORMS, vol. 35(6), pages 1225-1241, November.
    15. Yilmaz, Dogacan & Büyüktahtakın, İ. Esra, 2024. "An expandable machine learning-optimization framework to sequential decision-making," European Journal of Operational Research, Elsevier, vol. 314(1), pages 280-296.
    16. Andrew Butler & Roy H. Kwon, 2023. "Integrating prediction in mean-variance portfolio optimization," Quantitative Finance, Taylor & Francis Journals, vol. 23(3), pages 429-452, March.
    17. Junming Liu & Weiwei Chen & Jingyuan Yang & Hui Xiong & Can Chen, 2022. "Iterative Prediction-and-Optimization for E-Logistics Distribution Network Design," INFORMS Journal on Computing, INFORMS, vol. 34(2), pages 769-789, March.
    18. Jose Blanchet & Lin Chen & Xun Yu Zhou, 2022. "Distributionally Robust Mean-Variance Portfolio Selection with Wasserstein Distances," Management Science, INFORMS, vol. 68(9), pages 6382-6410, September.
    19. Platanakis, Emmanouil & Sutcliffe, Charles & Ye, Xiaoxia, 2021. "Horses for courses: Mean-variance for asset allocation and 1/N for stock selection," European Journal of Operational Research, Elsevier, vol. 288(1), pages 302-317.
    20. Corazza, Marco, 2021. "A note on “Portfolio selection under possibilistic mean-variance utility and a SMO algorithm”," European Journal of Operational Research, Elsevier, vol. 288(1), pages 343-345.
    21. Karimi-Mamaghan, Maryam & Mohammadi, Mehrdad & Meyer, Patrick & Karimi-Mamaghan, Amir Mohammad & Talbi, El-Ghazali, 2022. "Machine learning at the service of meta-heuristics for solving combinatorial optimization problems: A state-of-the-art," European Journal of Operational Research, Elsevier, vol. 296(2), pages 393-422.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tony Guida & Guillaume Coqueret, 2019. "Ensemble Learning Applied to Quant Equity: Gradient Boosting in a Multifactor Framework," Post-Print hal-02311104, HAL.
    2. Wang, Jianzhou & Lv, Mengzheng & Wang, Shuai & Gao, Jialu & Zhao, Yang & Wang, Qiangqiang, 2024. "Can multi-period auto-portfolio systems improve returns? Evidence from Chinese and U.S. stock markets," International Review of Financial Analysis, Elsevier, vol. 95(PB).
    3. Kentaro Imajo & Kentaro Minami & Katsuya Ito & Kei Nakagawa, 2020. "Deep Portfolio Optimization via Distributional Prediction of Residual Factors," Papers 2012.07245, arXiv.org.
    4. Vitor Azevedo & Christopher Hoegner, 2023. "Enhancing stock market anomalies with machine learning," Review of Quantitative Finance and Accounting, Springer, vol. 60(1), pages 195-230, January.
    5. Uddin, Ajim & Yu, Dantong, 2020. "Latent factor model for asset pricing," Journal of Behavioral and Experimental Finance, Elsevier, vol. 27(C).
    6. Ko, Hyungjin & Son, Bumho & Lee, Jaewook, 2024. "A novel integration of the Fama–French and Black–Litterman models to enhance portfolio management," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 91(C).
    7. Abada, Ibrahim & Lambin, Xavier & Tchakarov, Nikolay, 2024. "Collusion by mistake: Does algorithmic sophistication drive supra-competitive profits?," European Journal of Operational Research, Elsevier, vol. 318(3), pages 927-953.
    8. Chen, Wei & Zhang, Haoyu & Jia, Lifen, 2022. "A novel two-stage method for well-diversified portfolio construction based on stock return prediction using machine learning," The North American Journal of Economics and Finance, Elsevier, vol. 63(C).
    9. Priya Singh & Manoj Jha, 2024. "Portfolio Optimization Using Novel EW-MV Method in Conjunction with Asset Preselection," Computational Economics, Springer;Society for Computational Economics, vol. 64(6), pages 3683-3712, December.
    10. Schnaubelt, Matthias & Seifert, Oleg, 2020. "Valuation ratios, surprises, uncertainty or sentiment: How does financial machine learning predict returns from earnings announcements?," FAU Discussion Papers in Economics 04/2020, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    11. Kobayashi, Ken & Takano, Yuichi & Nakata, Kazuhide, 2023. "Cardinality-constrained distributionally robust portfolio optimization," European Journal of Operational Research, Elsevier, vol. 309(3), pages 1173-1182.
    12. Muhinyuza, Stanislas & Bodnar, Taras & Lindholm, Mathias, 2020. "A test on the location of the tangency portfolio on the set of feasible portfolios," Applied Mathematics and Computation, Elsevier, vol. 386(C).
    13. Kamaladdin Fataliyev & Aneesh Chivukula & Mukesh Prasad & Wei Liu, 2021. "Stock Market Analysis with Text Data: A Review," Papers 2106.12985, arXiv.org, revised Jul 2021.
    14. Zhong, Angel, 2018. "Idiosyncratic volatility in the Australian equity market," Pacific-Basin Finance Journal, Elsevier, vol. 50(C), pages 105-125.
    15. Klaus Schredelseker, 2012. "Finanzkrise — Mitschuld der Theorie?," Schmalenbach Journal of Business Research, Springer, vol. 64(8), pages 833-845, December.
    16. Silva, A. Christian & Prange, Richard E., 2007. "Virtual volatility," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 376(C), pages 507-516.
    17. Hu, Xiao & Kang, Siqin & Ren, Long & Zhu, Shaokeng, 2024. "Interactive preference analysis: A reinforcement learning framework," European Journal of Operational Research, Elsevier, vol. 319(3), pages 983-998.
    18. Taufiq Choudhry & Ranadeva Jayasekera, 2015. "Level of efficiency in the UK equity market: empirical study of the effects of the global financial crisis," Review of Quantitative Finance and Accounting, Springer, vol. 44(2), pages 213-242, February.
    19. Cathy W. S. Chen & Richard H. Gerlach & Ann M. H. Lin, 2011. "Multi-regime nonlinear capital asset pricing models," Quantitative Finance, Taylor & Francis Journals, vol. 11(9), pages 1421-1438, April.
    20. Bartosz Bieganowski & Robert 'Slepaczuk, 2024. "Supervised Autoencoders with Fractionally Differentiated Features and Triple Barrier Labelling Enhance Predictions on Noisy Data," Papers 2411.12753, arXiv.org, revised Nov 2024.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:321:y:2025:i:1:p:243-256. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.