IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2105.09154.html
   My bibliography  Save this paper

Using four different online media sources to forecast the crude oil price

Author

Listed:
  • M. Elshendy
  • A. Fronzetti Colladon
  • E. Battistoni
  • P. A. Gloor

Abstract

This study looks for signals of economic awareness on online social media and tests their significance in economic predictions. The study analyses, over a period of two years, the relationship between the West Texas Intermediate daily crude oil price and multiple predictors extracted from Twitter, Google Trends, Wikipedia, and the Global Data on Events, Language, and Tone database (GDELT). Semantic analysis is applied to study the sentiment, emotionality and complexity of the language used. Autoregressive Integrated Moving Average with Explanatory Variable (ARIMAX) models are used to make predictions and to confirm the value of the study variables. Results show that the combined analysis of the four media platforms carries valuable information in making financial forecasting. Twitter language complexity, GDELT number of articles and Wikipedia page reads have the highest predictive power. This study also allows a comparison of the different fore-sighting abilities of each platform, in terms of how many days ahead a platform can predict a price movement before it happens. In comparison with previous work, more media sources and more dimensions of the interaction and of the language used are combined in a joint analysis.

Suggested Citation

  • M. Elshendy & A. Fronzetti Colladon & E. Battistoni & P. A. Gloor, 2021. "Using four different online media sources to forecast the crude oil price," Papers 2105.09154, arXiv.org.
  • Handle: RePEc:arx:papers:2105.09154
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2105.09154
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Meltem Gulenay Chadwick & Gonul Sengul, 2015. "Nowcasting the Unemployment Rate in Turkey : Let's ask Google," Central Bank Review, Research and Monetary Policy Department, Central Bank of the Republic of Turkey, vol. 15(3), pages 15-40.
    2. Timm O. Sprenger & Andranik Tumasjan & Philipp G. Sandner & Isabell M. Welpe, 2014. "Tweets and Trades: the Information Content of Stock Microblogs," European Financial Management, European Financial Management Association, vol. 20(5), pages 926-957, November.
    3. Krichene, Noureddine, 2002. "World crude oil and natural gas: a demand and supply model," Energy Economics, Elsevier, vol. 24(6), pages 557-576, November.
    4. Jaroslav Pavlicek & Ladislav Kristoufek, 2015. "Nowcasting Unemployment Rates with Google Searches: Evidence from the Visegrad Group Countries," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-11, May.
    5. Onook Oh & Manish Agrawal & H. Raghav Rao, 2011. "Information control and terrorism: Tracking the Mumbai terrorist attack through twitter," Information Systems Frontiers, Springer, vol. 13(1), pages 33-43, March.
    6. Giannone, Domenico & Reichlin, Lucrezia & Small, David, 2008. "Nowcasting: The real-time informational content of macroeconomic data," Journal of Monetary Economics, Elsevier, vol. 55(4), pages 665-676, May.
    7. Chevillon, Guillaume & Rifflart, Christine, 2009. "Physical market determinants of the price of crude oil and the market premium," Energy Economics, Elsevier, vol. 31(4), pages 537-549, July.
    8. Márton Mestyán & Taha Yasseri & János Kertész, 2013. "Early Prediction of Movie Box Office Success Based on Wikipedia Activity Big Data," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-8, August.
    9. Shambora, William E. & Rossiter, Rosemary, 2007. "Are there exploitable inefficiencies in the futures market for oil?," Energy Economics, Elsevier, vol. 29(1), pages 18-27, January.
    10. Goldfarb, Avi & Greenstein, Shane M. & Tucker, Catherine E. (ed.), 2015. "Economic Analysis of the Digital Economy," National Bureau of Economic Research Books, University of Chicago Press, number 9780226206981, August.
    11. Bassam Fattouh, Lutz Kilian, and Lavan Mahadeva, 2013. "The Role of Speculation in Oil Markets: What Have We Learned So Far?," The Energy Journal, International Association for Energy Economics, vol. 0(Number 3).
    12. Cheong, Chin Wen, 2009. "Modeling and forecasting crude oil markets using ARCH-type models," Energy Policy, Elsevier, vol. 37(6), pages 2346-2355, June.
    13. Selim Elekdag & René Lalonde & Douglas Laxton & Dirk Muir & Paolo Pesenti, 2008. "Oil Price Movements and the Global Economy: A Model-Based Assessment," IMF Staff Papers, Palgrave Macmillan, vol. 55(2), pages 297-311, June.
    14. Bahattin Büyükşahin & Jeffrey H. Harris, 2011. "Do Speculators Drive Crude Oil Futures Prices?," The Energy Journal, , vol. 32(2), pages 167-202, April.
    15. Joe Roeber, 1994. "Oil Industry Structure and Evolving Markets," The Energy Journal, International Association for Energy Economics, vol. 0(Special I), pages 253-276.
    16. Mian Sajid Nazir & Hassan Younus & Ahmad Kaleem & Zeshan Anwar, 2014. "Impact of political events on stock market returns: empirical evidence from Pakistan," Journal of Economic and Administrative Sciences, Emerald Group Publishing Limited, vol. 30(1), pages 60-78, May.
    17. Wei, Yu & Wang, Yudong & Huang, Dengshi, 2010. "Forecasting crude oil market volatility: Further evidence using GARCH-class models," Energy Economics, Elsevier, vol. 32(6), pages 1477-1484, November.
    18. Gary Koop & Luca Onorante, 2019. "Macroeconomic Nowcasting Using Google Probabilities☆," Advances in Econometrics, in: Topics in Identification, Limited Dependent Variables, Partial Observability, Experimentation, and Flexible Modeling: Part A, volume 40, pages 17-40, Emerald Group Publishing Limited.
    19. Saeed Moshiri & Faezeh Foroutan, 2006. "Forecasting Nonlinear Crude Oil Futures Prices," The Energy Journal, , vol. 27(4), pages 81-96, October.
    20. Fantazzini, Dean & Toktamysova, Zhamal, 2015. "Forecasting German car sales using Google data and multivariate models," International Journal of Production Economics, Elsevier, vol. 170(PA), pages 97-135.
    21. Mohammadi, Hassan & Su, Lixian, 2010. "International evidence on crude oil price dynamics: Applications of ARIMA-GARCH models," Energy Economics, Elsevier, vol. 32(5), pages 1001-1008, September.
    22. Yiuman Tse & Grigori Erenburg, 2003. "Competition For Order Flow, Market Quality, And Price Discovery In The Nasdaq 100 Index Tracking Stock," Journal of Financial Research, Southern Finance Association;Southwestern Finance Association, vol. 26(3), pages 301-318, September.
    23. Avi Goldfarb & Shane M. Greenstein & Catherine E. Tucker, 2015. "Economic Analysis of the Digital Economy," NBER Books, National Bureau of Economic Research, Inc, number gree13-1.
    24. Steven L. Scott & Hal R. Varian, 2015. "Bayesian Variable Selection for Nowcasting Economic Time Series," NBER Chapters, in: Economic Analysis of the Digital Economy, pages 119-135, National Bureau of Economic Research, Inc.
    25. Lynn Wu & Erik Brynjolfsson, 2015. "The Future of Prediction: How Google Searches Foreshadow Housing Prices and Sales," NBER Chapters, in: Economic Analysis of the Digital Economy, pages 89-118, National Bureau of Economic Research, Inc.
    26. Harvey,Andrew C., 1991. "Forecasting, Structural Time Series Models and the Kalman Filter," Cambridge Books, Cambridge University Press, number 9780521405737, September.
    27. Hyunyoung Choi & Hal Varian, 2012. "Predicting the Present with Google Trends," The Economic Record, The Economic Society of Australia, vol. 88(s1), pages 2-9, June.
    28. Granger, C W J, 1969. "Investigating Causal Relations by Econometric Models and Cross-Spectral Methods," Econometrica, Econometric Society, vol. 37(3), pages 424-438, July.
    29. Kaplan, Andreas M. & Haenlein, Michael, 2010. "Users of the world, unite! The challenges and opportunities of Social Media," Business Horizons, Elsevier, vol. 53(1), pages 59-68, January.
    30. Domenico Giannone & Lucrezia Reichlin & David Small, 2008. "Nowcasting: the real time informational content of macroeconomic data releases," ULB Institutional Repository 2013/6409, ULB -- Universite Libre de Bruxelles.
    31. Lutz Kilian, 2009. "Not All Oil Price Shocks Are Alike: Disentangling Demand and Supply Shocks in the Crude Oil Market," American Economic Review, American Economic Association, vol. 99(3), pages 1053-1069, June.
    32. Reichlin, Lucrezia & Giannone, Domenico & Small, David, 2005. "Nowcasting GDP and Inflation: The Real Time Informational Content of Macroeconomic Data Releases," CEPR Discussion Papers 5178, C.E.P.R. Discussion Papers.
    33. Kaufmann, Robert K. & Ullman, Ben, 2009. "Oil prices, speculation, and fundamentals: Interpreting causal relations among spot and futures prices," Energy Economics, Elsevier, vol. 31(4), pages 550-558, July.
    34. Fan, Ying & Liang, Qiang & Wei, Yi-Ming, 2008. "A generalized pattern matching approach for multi-step prediction of crude oil price," Energy Economics, Elsevier, vol. 30(3), pages 889-904, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tuhkuri, Joonas, 2016. "Forecasting Unemployment with Google Searches," ETLA Working Papers 35, The Research Institute of the Finnish Economy.
    2. Tuhkuri, Joonas, 2016. "ETLAnow: A Model for Forecasting with Big Data – Forecasting Unemployment with Google Searches in Europe," ETLA Reports 54, The Research Institute of the Finnish Economy.
    3. Lang, Korbinian & Auer, Benjamin R., 2020. "The economic and financial properties of crude oil: A review," The North American Journal of Economics and Finance, Elsevier, vol. 52(C).
    4. Laurent Ferrara & Anna Simoni, 2023. "When are Google Data Useful to Nowcast GDP? An Approach via Preselection and Shrinkage," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(4), pages 1188-1202, October.
    5. David Coble & Pablo Pincheira, 2021. "Forecasting building permits with Google Trends," Empirical Economics, Springer, vol. 61(6), pages 3315-3345, December.
    6. Pérez, Fernando, 2018. "Nowcasting Peruvian GDP using Leading Indicators and Bayesian Variable Selection," Working Papers 2018-010, Banco Central de Reserva del Perú.
    7. Leif Anders Thorsrud, 2016. "Nowcasting using news topics Big Data versus big bank," Working Papers No 6/2016, Centre for Applied Macro- and Petroleum economics (CAMP), BI Norwegian Business School.
    8. James Chapman & Ajit Desai, 2021. "Using Payments Data to Nowcast Macroeconomic Variables During the Onset of COVID-19," Staff Working Papers 21-2, Bank of Canada.
    9. Degiannakis, Stavros & Filis, George, 2017. "Forecasting oil price realized volatility using information channels from other asset classes," Journal of International Money and Finance, Elsevier, vol. 76(C), pages 28-49.
    10. Coble, David & Pincheira, Pablo, 2017. "Nowcasting Building Permits with Google Trends," MPRA Paper 76514, University Library of Munich, Germany.
    11. Philip ME Garboden, 2019. "Sources and Types of Big Data for Macroeconomic Forecasting," Working Papers 2019-3, University of Hawaii Economic Research Organization, University of Hawaii at Manoa.
    12. James T. E. Chapman & Ajit Desai, 2023. "Macroeconomic Predictions Using Payments Data and Machine Learning," Forecasting, MDPI, vol. 5(4), pages 1-32, November.
    13. Qadan, Mahmoud & Nama, Hazar, 2018. "Investor sentiment and the price of oil," Energy Economics, Elsevier, vol. 69(C), pages 42-58.
    14. Benedikt Maas, 2020. "Short‐term forecasting of the US unemployment rate," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(3), pages 394-411, April.
    15. Jean-Charles Bricongne & Baptiste Meunier & Raquel Caldeira, 2024. "Should Central Banks Care About Text Mining? A Literature Review," Working papers 950, Banque de France.
    16. D'Ecclesia, Rita L. & Magrini, Emiliano & Montalbano, Pierluigi & Triulzi, Umberto, 2014. "Understanding recent oil price dynamics: A novel empirical approach," Energy Economics, Elsevier, vol. 46(S1), pages 11-17.
    17. Manel Hamdi & Chaker Aloui, 2015. "Forecasting Crude Oil Price Using Artificial Neural Networks: A Literature Survey," Economics Bulletin, AccessEcon, vol. 35(2), pages 1339-1359.
    18. Meng, Fanyi & Liu, Li, 2019. "Analyzing the economic sources of oil price volatility: An out-of-sample perspective," Energy, Elsevier, vol. 177(C), pages 476-486.
    19. Ferrari, Davide & Ravazzolo, Francesco & Vespignani, Joaquin, 2021. "Forecasting energy commodity prices: A large global dataset sparse approach," Energy Economics, Elsevier, vol. 98(C).
    20. Matteo Barigozzi & Matteo Luciani, 2019. "Quasi Maximum Likelihood Estimation and Inference of Large Approximate Dynamic Factor Models via the EM algorithm," Papers 1910.03821, arXiv.org, revised Sep 2024.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2105.09154. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.