IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2106.07103.html
   My bibliography  Save this paper

A News-based Machine Learning Model for Adaptive Asset Pricing

Author

Listed:
  • Liao Zhu
  • Haoxuan Wu
  • Martin T. Wells

Abstract

The paper proposes a new asset pricing model -- the News Embedding UMAP Selection (NEUS) model, to explain and predict the stock returns based on the financial news. Using a combination of various machine learning algorithms, we first derive a company embedding vector for each basis asset from the financial news. Then we obtain a collection of the basis assets based on their company embedding. After that for each stock, we select the basis assets to explain and predict the stock return with high-dimensional statistical methods. The new model is shown to have a significantly better fitting and prediction power than the Fama-French 5-factor model.

Suggested Citation

  • Liao Zhu & Haoxuan Wu & Martin T. Wells, 2021. "A News-based Machine Learning Model for Adaptive Asset Pricing," Papers 2106.07103, arXiv.org.
  • Handle: RePEc:arx:papers:2106.07103
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2106.07103
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Marta Banbura & Domenico Giannone & Lucrezia Reichlin, 2010. "Large Bayesian vector auto regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(1), pages 71-92.
    2. Liao Zhu & Robert A. Jarrow & Martin T. Wells, 2021. "Time-Invariance Coefficients Tests with the Adaptive Multi-Factor Model," Quarterly Journal of Finance (QJF), World Scientific Publishing Co. Pte. Ltd., vol. 11(04), pages 1-30, December.
    3. Stephen A. Ross, 2013. "The Arbitrage Theory of Capital Asset Pricing," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part I, chapter 1, pages 11-30, World Scientific Publishing Co. Pte. Ltd..
    4. Robert A. Jarrow & Rinald Murataj & Martin T. Wells & Liao Zhu, 2023. "The Low-Volatility Anomaly And The Adaptive Multi-Factor Model," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 26(04n05), pages 1-33, August.
    5. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    6. Marta Banbura & Domenico Giannone & Lucrezia Reichlin, 2010. "Large Bayesian vector auto regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(1), pages 71-92.
    7. Liao Zhu & Sumanta Basu & Robert A. Jarrow & Martin T. Wells, 2020. "High-Dimensional Estimation, Basis Assets, and the Adaptive Multi-Factor Model," Quarterly Journal of Finance (QJF), World Scientific Publishing Co. Pte. Ltd., vol. 10(04), pages 1-52, December.
    8. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    9. Bien, Jacob & Tibshirani, Robert, 2011. "Hierarchical Clustering With Prototypes via Minimax Linkage," Journal of the American Statistical Association, American Statistical Association, vol. 106(495), pages 1075-1084.
    10. Yanci Zhang & Tianming Du & Yujie Sun & Lawrence Donohue & Rui Dai, 2021. "Form 10-Q Itemization," Papers 2104.11783, arXiv.org, revised Oct 2021.
    11. Robert Jarrow, 2016. "Bubbles And Multiple-Factor Asset Pricing Models," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 19(01), pages 1-19, February.
    12. Song, Song & Bickel, Peter J., 2011. "Large vector auto regressions," SFB 649 Discussion Papers 2011-048, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    13. repec:ulb:ulbeco:2013/13388 is not listed on IDEAS
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Liao Zhu, 2021. "The Adaptive Multi-Factor Model and the Financial Market," Papers 2107.14410, arXiv.org, revised Aug 2021.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liao Zhu, 2021. "The Adaptive Multi-Factor Model and the Financial Market," Papers 2107.14410, arXiv.org, revised Aug 2021.
    2. Liao Zhu & Ningning Sun & Martin T. Wells, 2022. "Clustering Structure of Microstructure Measures," Applied Economics and Finance, Redfame publishing, vol. 9(1), pages 85-95, December.
    3. Camehl, Annika, 2023. "Penalized estimation of panel vector autoregressive models: A panel LASSO approach," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1185-1204.
    4. Kascha, Christian & Trenkler, Carsten, 2015. "Forecasting VARs, model selection, and shrinkage," Working Papers 15-07, University of Mannheim, Department of Economics.
    5. Zachary F. Fisher & Younghoon Kim & Barbara L. Fredrickson & Vladas Pipiras, 2022. "Penalized Estimation and Forecasting of Multiple Subject Intensive Longitudinal Data," Psychometrika, Springer;The Psychometric Society, vol. 87(2), pages 1-29, June.
    6. Alain Hecq & Luca Margaritella & Stephan Smeekes, 2023. "Granger Causality Testing in High-Dimensional VARs: A Post-Double-Selection Procedure," Journal of Financial Econometrics, Oxford University Press, vol. 21(3), pages 915-958.
    7. Smeekes, Stephan & Wijler, Etienne, 2018. "Macroeconomic forecasting using penalized regression methods," International Journal of Forecasting, Elsevier, vol. 34(3), pages 408-430.
    8. Robert A. Jarrow & Rinald Murataj & Martin T. Wells & Liao Zhu, 2023. "The Low-Volatility Anomaly And The Adaptive Multi-Factor Model," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 26(04n05), pages 1-33, August.
    9. Liao Zhu & Sumanta Basu & Robert A. Jarrow & Martin T. Wells, 2020. "High-Dimensional Estimation, Basis Assets, and the Adaptive Multi-Factor Model," Quarterly Journal of Finance (QJF), World Scientific Publishing Co. Pte. Ltd., vol. 10(04), pages 1-52, December.
    10. Andrei Dubovik & Adam Elbourne & Bram Hendriks & Mark Kattenberg, 2022. "Forecasting World Trade Using Big Data and Machine Learning Techniques," CPB Discussion Paper 441, CPB Netherlands Bureau for Economic Policy Analysis.
    11. Skripnikov, A. & Michailidis, G., 2019. "Joint estimation of multiple network Granger causal models," Econometrics and Statistics, Elsevier, vol. 10(C), pages 120-133.
    12. Denis Shibitov & Mariam Mamedli, 2021. "Forecasting Russian Cpi With Data Vintages And Machine Learning Techniques," Bank of Russia Working Paper Series wps70, Bank of Russia.
    13. Bork, Lasse, 2009. "Estimating US Monetary Policy Shocks Using a Factor-Augmented Vector Autoregression: An EM Algorithm Approach," Finance Research Group Working Papers F-2009-03, University of Aarhus, Aarhus School of Business, Department of Business Studies.
    14. Battulga Gankhuu, 2023. "Parameter Estimation Methods of Required Rate of Return," Papers 2305.19708, arXiv.org, revised Aug 2023.
    15. Liao Zhu & Robert A. Jarrow & Martin T. Wells, 2021. "Time-Invariance Coefficients Tests with the Adaptive Multi-Factor Model," Quarterly Journal of Finance (QJF), World Scientific Publishing Co. Pte. Ltd., vol. 11(04), pages 1-30, December.
    16. Hadjiantoni, Stella & Kontoghiorghes, Erricos John, 2022. "An alternative numerical method for estimating large-scale time-varying parameter seemingly unrelated regressions models," Econometrics and Statistics, Elsevier, vol. 21(C), pages 1-18.
    17. Pettenuzzo, Davide & Timmermann, Allan & Valkanov, Rossen, 2014. "Forecasting stock returns under economic constraints," Journal of Financial Economics, Elsevier, vol. 114(3), pages 517-553.
    18. Tesi Aliaj & Milos Ciganovic & Massimiliano Tancioni, 2023. "Nowcasting inflation with Lasso‐regularized vector autoregressions and mixed frequency data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(3), pages 464-480, April.
    19. Daniel Felix Ahelegbey & Monica Billio & Roberto Casarin, 2016. "Sparse Graphical Vector Autoregression: A Bayesian Approach," Annals of Economics and Statistics, GENES, issue 123-124, pages 333-361.
    20. Creal, Drew & Kim, Jaeho, 2024. "Bayesian estimation of cluster covariance matrices of unknown form," Journal of Econometrics, Elsevier, vol. 241(1).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2106.07103. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.