IDEAS home Printed from https://ideas.repec.org/a/spr/digfin/v6y2024i3d10.1007_s42521-024-00107-2.html
   My bibliography  Save this article

Learning deep news sentiment representations for macro-finance

Author

Listed:
  • Axel Groß-Klußmann

    (Quoniam Asset Management GmbH)

Abstract

This paper introduces custom neural network techniques to the problem of latent economic factor extraction for voluminous news analytics data. In the context of macro-financial news, we derive low-dimensional representations of time series that arise in textual sentiment analyses spanning various topics. We explore three applications for compressed news sentiment data: nowcasting GDP growth, explaining asset class returns in a panel data analysis, and time series momentum investment. Our empirical study shows that nonlinear data representations based on supervised autoencoder architectures compare favorably to alternatives across all applications. In specific, we demonstrate that augmenting autoencoders with supervision tasks based on common asset class returns and market characteristics disciplines the dimension reduction and naturally supports the transparency of resulting representations. Taken together, our findings position supervised autoencoders as attractive competitor models alongside PCA and PLS approaches.

Suggested Citation

  • Axel Groß-Klußmann, 2024. "Learning deep news sentiment representations for macro-finance," Digital Finance, Springer, vol. 6(3), pages 341-377, September.
  • Handle: RePEc:spr:digfin:v:6:y:2024:i:3:d:10.1007_s42521-024-00107-2
    DOI: 10.1007/s42521-024-00107-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s42521-024-00107-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s42521-024-00107-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kelly, Bryan T. & Pruitt, Seth & Su, Yinan, 2019. "Characteristics are covariances: A unified model of risk and return," Journal of Financial Economics, Elsevier, vol. 134(3), pages 501-524.
    2. Peter R. Hansen & Asger Lunde & James M. Nason, 2011. "The Model Confidence Set," Econometrica, Econometric Society, vol. 79(2), pages 453-497, March.
    3. John C. Driscoll & Aart C. Kraay, 1998. "Consistent Covariance Matrix Estimation With Spatially Dependent Panel Data," The Review of Economics and Statistics, MIT Press, vol. 80(4), pages 549-560, November.
    4. Calomiris, Charles W. & Mamaysky, Harry, 2019. "How news and its context drive risk and returns around the world," Journal of Financial Economics, Elsevier, vol. 133(2), pages 299-336.
    5. Elena Andreou & Eric Ghysels & Andros Kourtellos, 2013. "Should Macroeconomic Forecasters Use Daily Financial Data and How?," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 31(2), pages 240-251, April.
    6. Vegard Høghaug Larsen & Leif Anders Thorsrud, 2022. "Asset returns, news topics, and media effects," Scandinavian Journal of Economics, Wiley Blackwell, vol. 124(3), pages 838-868, July.
    7. Ghysels, Eric & Santa-Clara, Pedro & Valkanov, Rossen, 2005. "There is a risk-return trade-off after all," Journal of Financial Economics, Elsevier, vol. 76(3), pages 509-548, June.
    8. Eric Ghysels & Arthur Sinko & Rossen Valkanov, 2007. "MIDAS Regressions: Further Results and New Directions," Econometric Reviews, Taylor & Francis Journals, vol. 26(1), pages 53-90.
    9. Paul C. Tetlock & Maytal Saar‐Tsechansky & Sofus Macskassy, 2008. "More Than Words: Quantifying Language to Measure Firms' Fundamentals," Journal of Finance, American Finance Association, vol. 63(3), pages 1437-1467, June.
    10. M. Hashem Pesaran, 2006. "Estimation and Inference in Large Heterogeneous Panels with a Multifactor Error Structure," Econometrica, Econometric Society, vol. 74(4), pages 967-1012, July.
    11. Leif Anders Thorsrud, 2020. "Words are the New Numbers: A Newsy Coincident Index of the Business Cycle," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(2), pages 393-409, April.
    12. Bair, Eric & Hastie, Trevor & Paul, Debashis & Tibshirani, Robert, 2006. "Prediction by Supervised Principal Components," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 119-137, March.
    13. Aparicio, Diego & López de Prado, Marcos, 2018. "How hard is it to pick the right model? MCS and backtest overfitting," Algorithmic Finance, IOS Press, vol. 7(1-2), pages 53-61.
    14. Hansen, Peter Reinhard, 2005. "A Test for Superior Predictive Ability," Journal of Business & Economic Statistics, American Statistical Association, vol. 23, pages 365-380, October.
    15. Andreou, Elena & Ghysels, Eric & Kourtellos, Andros, 2010. "Regression models with mixed sampling frequencies," Journal of Econometrics, Elsevier, vol. 158(2), pages 246-261, October.
    16. Gu, Shihao & Kelly, Bryan & Xiu, Dacheng, 2021. "Autoencoder asset pricing models," Journal of Econometrics, Elsevier, vol. 222(1), pages 429-450.
    17. Ghysels, Eric & Kvedaras, Virmantas & Zemlys, Vaidotas, 2016. "Mixed Frequency Data Sampling Regression Models: The R Package midasr," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 72(i04).
    18. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    19. Manuel Arellano & Stephen Bond, 1991. "Some Tests of Specification for Panel Data: Monte Carlo Evidence and an Application to Employment Equations," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 58(2), pages 277-297.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Degiannakis, Stavros & Filis, George, 2018. "Forecasting oil prices: High-frequency financial data are indeed useful," Energy Economics, Elsevier, vol. 76(C), pages 388-402.
    2. Nuttanan Wichitaksorn, 2020. "Analyzing and Forecasting Thai Macroeconomic Data using Mixed-Frequency Approach," PIER Discussion Papers 146, Puey Ungphakorn Institute for Economic Research.
    3. Daniel Borup & Jorge Wolfgang Hansen & Benjamin Dybro Liengaard & Erik Christian Montes Schütte, 2023. "Quantifying investor narratives and their role during COVID‐19," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 38(4), pages 512-532, June.
    4. Ioannis Chalkiadakis & Gareth W. Peters & Matthew Ames, 2023. "Hybrid ARDL-MIDAS-Transformer time-series regressions for multi-topic crypto market sentiment driven by price and technology factors," Digital Finance, Springer, vol. 5(2), pages 295-365, June.
    5. Dhaene, Geert & Wu, Jianbin, 2020. "Incorporating overnight and intraday returns into multivariate GARCH volatility models," Journal of Econometrics, Elsevier, vol. 217(2), pages 471-495.
    6. Qian Chen & Xiang Gao & Shan Xie & Li Sun & Shuairu Tian & Shigeyuki Hamori, 2021. "On the Predictability of China Macro Indicator with Carbon Emissions Trading," Energies, MDPI, vol. 14(5), pages 1-24, February.
    7. Wichitaksorn, Nuttanan, 2022. "Analyzing and forecasting Thai macroeconomic data using mixed-frequency approach," Journal of Asian Economics, Elsevier, vol. 78(C).
    8. Santiago Etchegaray Alvarez, 2022. "Proyecciones macroeconómicas con datos en frecuencias mixtas. Modelos ADL-MIDAS, U-MIDAS y TF-MIDAS con aplicaciones para Uruguay," Documentos de trabajo 2022004, Banco Central del Uruguay.
    9. Stavros Degiannakis, 2023. "The D-model for GDP nowcasting," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-33, December.
    10. Nava, Consuelo R. & Osti, Linda & Zoia, Maria Grazia, 2022. "Forecasting Domestic Tourism across Regional Destinations through MIDAS Regressions," Department of Economics and Statistics Cognetti de Martiis. Working Papers 202207, University of Turin.
    11. Sarun Kamolthip, 2021. "Macroeconomic Forecasting with LSTM and Mixed Frequency Time Series Data," PIER Discussion Papers 165, Puey Ungphakorn Institute for Economic Research.
    12. Valadkhani, Abbas & Smyth, Russell, 2018. "Asymmetric responses in the timing, and magnitude, of changes in Australian monthly petrol prices to daily oil price changes," Energy Economics, Elsevier, vol. 69(C), pages 89-100.
    13. Schumacher, Christian, 2016. "A comparison of MIDAS and bridge equations," International Journal of Forecasting, Elsevier, vol. 32(2), pages 257-270.
    14. Bjørn Eraker & Ching Wai (Jeremy) Chiu & Andrew T. Foerster & Tae Bong Kim & Hernán D. Seoane, 2015. "Bayesian Mixed Frequency VARs," Journal of Financial Econometrics, Oxford University Press, vol. 13(3), pages 698-721.
    15. Kuzin, Vladimir & Marcellino, Massimiliano & Schumacher, Christian, 2011. "MIDAS vs. mixed-frequency VAR: Nowcasting GDP in the euro area," International Journal of Forecasting, Elsevier, vol. 27(2), pages 529-542.
    16. Vegard H. Larsen & Leif Anders Thorsrud, 2018. "Business cycle narratives," Working Paper 2018/3, Norges Bank.
    17. Foroni, Claudia & Marcellino, Massimiliano & Schumacher, Christian, 2011. "U-MIDAS: MIDAS regressions with unrestricted lag polynomials," Discussion Paper Series 1: Economic Studies 2011,35, Deutsche Bundesbank.
    18. Hanan Naser, 2015. "Estimating and forecasting Bahrain quarterly GDP growth using simple regression and factor-based methods," Empirical Economics, Springer, vol. 49(2), pages 449-479, September.
    19. Mogliani, Matteo & Simoni, Anna, 2021. "Bayesian MIDAS penalized regressions: Estimation, selection, and prediction," Journal of Econometrics, Elsevier, vol. 222(1), pages 833-860.

    More about this item

    Keywords

    Neural networks; Sentiment analysis; Semi-structured data; Dimension reduction; Latent factor extraction; High-frequency macro-data;
    All these keywords.

    JEL classification:

    • C45 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Neural Networks and Related Topics
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C58 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Financial Econometrics
    • C38 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Classification Methdos; Cluster Analysis; Principal Components; Factor Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:digfin:v:6:y:2024:i:3:d:10.1007_s42521-024-00107-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.