IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2403.06779.html
   My bibliography  Save this paper

From Factor Models to Deep Learning: Machine Learning in Reshaping Empirical Asset Pricing

Author

Listed:
  • Junyi Ye
  • Bhaskar Goswami
  • Jingyi Gu
  • Ajim Uddin
  • Guiling Wang

Abstract

This paper comprehensively reviews the application of machine learning (ML) and AI in finance, specifically in the context of asset pricing. It starts by summarizing the traditional asset pricing models and examining their limitations in capturing the complexities of financial markets. It explores how 1) ML models, including supervised, unsupervised, semi-supervised, and reinforcement learning, provide versatile frameworks to address these complexities, and 2) the incorporation of advanced ML algorithms into traditional financial models enhances return prediction and portfolio optimization. These methods can adapt to changing market dynamics by modeling structural changes and incorporating heterogeneous data sources, such as text and images. In addition, this paper explores challenges in applying ML in asset pricing, addressing the growing demand for explainability in decision-making and mitigating overfitting in complex models. This paper aims to provide insights into novel methodologies showcasing the potential of ML to reshape the future of quantitative finance.

Suggested Citation

  • Junyi Ye & Bhaskar Goswami & Jingyi Gu & Ajim Uddin & Guiling Wang, 2024. "From Factor Models to Deep Learning: Machine Learning in Reshaping Empirical Asset Pricing," Papers 2403.06779, arXiv.org.
  • Handle: RePEc:arx:papers:2403.06779
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2403.06779
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Kelly, Bryan T. & Pruitt, Seth & Su, Yinan, 2019. "Characteristics are covariances: A unified model of risk and return," Journal of Financial Economics, Elsevier, vol. 134(3), pages 501-524.
    2. Stefano Giglio & Bryan Kelly & Dacheng Xiu, 2022. "Factor Models, Machine Learning, and Asset Pricing," Annual Review of Financial Economics, Annual Reviews, vol. 14(1), pages 337-368, November.
    3. Zhiguo He & Arvind Krishnamurthy, 2013. "Intermediary Asset Pricing," American Economic Review, American Economic Association, vol. 103(2), pages 732-770, April.
    4. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    5. Leippold, Markus & Wang, Qian & Zhou, Wenyu, 2022. "Machine learning in the Chinese stock market," Journal of Financial Economics, Elsevier, vol. 145(2), pages 64-82.
    6. Guanhao Feng & Stefano Giglio & Dacheng Xiu, 2020. "Taming the Factor Zoo: A Test of New Factors," Journal of Finance, American Finance Association, vol. 75(3), pages 1327-1370, June.
    7. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    8. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    9. Stefano Giglio & Dacheng Xiu, 2021. "Asset Pricing with Omitted Factors," Journal of Political Economy, University of Chicago Press, vol. 129(7), pages 1947-1990.
    10. repec:bla:jfinan:v:59:y:2004:i:4:p:1481-1509 is not listed on IDEAS
    11. William F. Sharpe, 1964. "Capital Asset Prices: A Theory Of Market Equilibrium Under Conditions Of Risk," Journal of Finance, American Finance Association, vol. 19(3), pages 425-442, September.
    12. Ludovic Goudenège & Andrea Molent & Antonino Zanette, 2020. "Machine learning for pricing American options in high-dimensional Markovian and non-Markovian models," Quantitative Finance, Taylor & Francis Journals, vol. 20(4), pages 573-591, April.
    13. Eugene F. Fama & Kenneth R. French, 2016. "Dissecting Anomalies with a Five-Factor Model," The Review of Financial Studies, Society for Financial Studies, vol. 29(1), pages 69-103.
    14. Xiao-Yang Liu & Ziyi Xia & Jingyang Rui & Jiechao Gao & Hongyang Yang & Ming Zhu & Christina Dan Wang & Zhaoran Wang & Jian Guo, 2022. "FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement Learning," Papers 2211.03107, arXiv.org.
    15. Gu, Shihao & Kelly, Bryan & Xiu, Dacheng, 2021. "Autoencoder asset pricing models," Journal of Econometrics, Elsevier, vol. 222(1), pages 429-450.
    16. Ajim Uddin & Xinyuan Tao & Chia-Ching Chou & Dantong Yu, 2022. "Are missing values important for earnings forecasts? A machine learning perspective," Quantitative Finance, Taylor & Francis Journals, vol. 22(6), pages 1113-1132, June.
    17. Beckmeyer, Heiner & Wiedemann, Timo, 2022. "Recovering Missing Firm Characteristics with Attention-Based Machine Learning," VfS Annual Conference 2022 (Basel): Big Data in Economics 264135, Verein für Socialpolitik / German Economic Association.
    18. Raehyun Kim & Chan Ho So & Minbyul Jeong & Sanghoon Lee & Jinkyu Kim & Jaewoo Kang, 2019. "HATS: A Hierarchical Graph Attention Network for Stock Movement Prediction," Papers 1908.07999, arXiv.org, revised Nov 2019.
    19. Zhengyao Jiang & Dixing Xu & Jinjun Liang, 2017. "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem," Papers 1706.10059, arXiv.org, revised Jul 2017.
    20. Uddin, Ajim & Yu, Dantong, 2020. "Latent factor model for asset pricing," Journal of Behavioral and Experimental Finance, Elsevier, vol. 27(C).
    21. Connor, Gregory & Korajczyk, Robert A., 1986. "Performance measurement with the arbitrage pricing theory : A new framework for analysis," Journal of Financial Economics, Elsevier, vol. 15(3), pages 373-394, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Uddin, Ajim & Tao, Xinyuan & Yu, Dantong, 2023. "Attention based dynamic graph neural network for asset pricing," Global Finance Journal, Elsevier, vol. 58(C).
    2. Constantinos Kardaras & Hyeng Keun Koo & Johannes Ruf, 2022. "Estimation of growth in fund models," Papers 2208.02573, arXiv.org.
    3. Christian Fieberg & Daniel Metko & Thorsten Poddig & Thomas Loy, 2023. "Machine learning techniques for cross-sectional equity returns’ prediction," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 45(1), pages 289-323, March.
    4. Ma, Tian & Leong, Wen Jun & Jiang, Fuwei, 2023. "A latent factor model for the Chinese stock market," International Review of Financial Analysis, Elsevier, vol. 87(C).
    5. Wolfgang Drobetz & Tizian Otto, 2021. "Empirical asset pricing via machine learning: evidence from the European stock market," Journal of Asset Management, Palgrave Macmillan, vol. 22(7), pages 507-538, December.
    6. Vafai, Nima & Rakowski, David, 2024. "The sources of portfolio volatility and mutual fund performance," International Review of Financial Analysis, Elsevier, vol. 91(C).
    7. Xiao, Xiang & Hua, Xia & Qin, Kexin, 2024. "A self-attention based cross-sectional return forecasting model with evidence from the Chinese market," Finance Research Letters, Elsevier, vol. 62(PA).
    8. Langlois, Hugues, 2023. "What matters in a characteristic?," Journal of Financial Economics, Elsevier, vol. 149(1), pages 52-72.
    9. Bandi, Federico M. & Chaudhuri, Shomesh E. & Lo, Andrew W. & Tamoni, Andrea, 2021. "Spectral factor models," Journal of Financial Economics, Elsevier, vol. 142(1), pages 214-238.
    10. Uddin, Ajim & Yu, Dantong, 2020. "Latent factor model for asset pricing," Journal of Behavioral and Experimental Finance, Elsevier, vol. 27(C).
    11. Svetlana Bryzgalova & Jiantao Huang & Christian Julliard, 2023. "Bayesian Solutions for the Factor Zoo: We Just Ran Two Quadrillion Models," Journal of Finance, American Finance Association, vol. 78(1), pages 487-557, February.
    12. De Nard, Gianluca & Zhao, Zhao, 2023. "Using, taming or avoiding the factor zoo? A double-shrinkage estimator for covariance matrices," Journal of Empirical Finance, Elsevier, vol. 72(C), pages 23-35.
    13. Söhnke M. Bartram & Harald Lohre & Peter F. Pope & Ananthalakshmi Ranganathan, 2021. "Navigating the factor zoo around the world: an institutional investor perspective," Journal of Business Economics, Springer, vol. 91(5), pages 655-703, July.
    14. Ai He & Guofu Zhou, 2023. "Diagnostics for asset pricing models," Financial Management, Financial Management Association International, vol. 52(4), pages 617-642, December.
    15. Weichuan Deng & Pawel Polak & Abolfazl Safikhani & Ronakdilip Shah, 2023. "A Unified Framework for Fast Large-Scale Portfolio Optimization," Papers 2303.12751, arXiv.org, revised Nov 2023.
    16. Cakici, Nusret & Shahzad, Syed Jawad Hussain & Będowska-Sójka, Barbara & Zaremba, Adam, 2024. "Machine learning and the cross-section of cryptocurrency returns," International Review of Financial Analysis, Elsevier, vol. 94(C).
    17. Dashan Huang & Fuwei Jiang & Kunpeng Li & Guoshi Tong & Guofu Zhou, 2022. "Scaled PCA: A New Approach to Dimension Reduction," Management Science, INFORMS, vol. 68(3), pages 1678-1695, March.
    18. van Binsbergen, Jules H. & Boons, Martijn & Opp, Christian C. & Tamoni, Andrea, 2023. "Dynamic asset (mis)pricing: Build-up versus resolution anomalies," Journal of Financial Economics, Elsevier, vol. 147(2), pages 406-431.
    19. Bakalli, Gaetan & Guerrier, Stéphane & Scaillet, Olivier, 2023. "A penalized two-pass regression to predict stock returns with time-varying risk premia," Journal of Econometrics, Elsevier, vol. 237(2).
    20. Clarke, Charles, 2022. "The level, slope, and curve factor model for stocks," Journal of Financial Economics, Elsevier, vol. 143(1), pages 159-187.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2403.06779. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.