IDEAS home Printed from https://ideas.repec.org/a/kap/compec/v47y2016i4d10.1007_s10614-015-9492-9.html
   My bibliography  Save this article

Exploiting Financial News and Social Media Opinions for Stock Market Analysis using MCMC Bayesian Inference

Author

Listed:
  • Manolis Maragoudakis

    (University of the Aegean)

  • Dimitrios Serpanos

    (Qatar Computing Research Institute (QCRI))

Abstract

Stock market analysis by using Information and Communication Technology methods is a dynamic and volatile domain. Over the past years, there has been an increasing focus on the development of modeling tools, especially when the expected outcomes appear to yield significant profits to the investors’ portfolios. In alignment with modern globalized economy, the available resources are becoming gradually more plentiful, thus difficult to be analyzed by standard statistical tools. Thus far, there have been a number of research papers that emphasize solely in past data from stock bond prices and other technical indicators. Nevertheless, throughout recent studies, prediction is also based on textual information, based on the logical assumption that the course of a stock price can also be affected by news articles and perhaps by public opinions, as posted on various Web 2.0 platforms. Despite the recent advances in Natural Language Processing and Data Mining, when data tend to grow both in number of records and attributes, numerous mining algorithms face significant difficulties, resulting in poor forecast ability. The aim of this study is to propose a potential answer to the problem, by considering a Markov Chain Monte Carlo Bayesian Inference approach, which estimates conditional probability distributions in structures obtained from a Tree-Augmented Naïve Bayes algorithm. The novelty of this study is based on the fact that technical analysis contains the event and not the cause of the change, while textual data may interpret that cause. The paper takes into account a large number of technical indices, accompanied with features that are extracted by a text mining methodology, from financial news articles and opinions posted in different social media platforms. Previous research has demonstrated that due to the high-dimensionality and sparseness of such data, the majority of widespread Data Mining algorithms suffer from either convergence or accuracy problems. Results acquired from the experimental phase, including a virtual trading experiment, are promising. Certainly, as it is tedious for a human investor to read all daily news concerning a company and other financial information, a prediction system that could analyze such textual resources and find relations with price movement at future time frames is valuable.

Suggested Citation

  • Manolis Maragoudakis & Dimitrios Serpanos, 2016. "Exploiting Financial News and Social Media Opinions for Stock Market Analysis using MCMC Bayesian Inference," Computational Economics, Springer;Society for Computational Economics, vol. 47(4), pages 589-622, April.
  • Handle: RePEc:kap:compec:v:47:y:2016:i:4:d:10.1007_s10614-015-9492-9
    DOI: 10.1007/s10614-015-9492-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10614-015-9492-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10614-015-9492-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Andrew W. Lo, A. Craig MacKinlay, 1988. "Stock Market Prices do not Follow Random Walks: Evidence from a Simple Specification Test," The Review of Financial Studies, Society for Financial Studies, vol. 1(1), pages 41-66.
    2. Dwiti Krishna Bebarta & Birendra Biswal & P.K. Dash, 2012. "Comparative study of stock market forecasting using different functional link artificial neural networks," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 4(4), pages 398-427.
    3. West, Kenneth D., 2006. "Forecast Evaluation," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 1, chapter 3, pages 99-134, Elsevier.
    4. Selma Jayech & Naceur Ben Zina, 2012. "Measuring financial contagion in the stock markets using a copula approach," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 4(2), pages 154-180.
    5. Panagiotis Xidonas & Emmanouil Ergazakis & Kostas Ergazakis & Kostas Metaxiotis & John Psarras, 2009. "Evaluating corporate performance within the frame of the expert systems technology," International Journal of Data Mining, Modelling and Management, Inderscience Enterprises Ltd, vol. 1(3), pages 261-290.
    6. Devulapalli Karthik Chandra & Vadlamani Ravi & Pediredla Ravisankar, 2010. "Support vector machine and wavelet neural network hybrid: application to bankruptcy prediction in banks," International Journal of Data Mining, Modelling and Management, Inderscience Enterprises Ltd, vol. 2(1), pages 1-21.
    7. Jenni L. Bettman & Stephen J. Sault & Emma L. Schultz, 2009. "Fundamental and technical analysis: substitutes or complements?," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 49(1), pages 21-36, March.
    8. Chan, Yue-cheong & John Wei, K. C., 1996. "Political risk and stock price volatility: The case of Hong Kong," Pacific-Basin Finance Journal, Elsevier, vol. 4(2-3), pages 259-275, July.
    9. Emmanuel Olateju Oyatoye & Waheed Oladimeji Arilesere, 2012. "A non-linear programming model for insurance company investment portfolio management in Nigeria," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 4(1), pages 83-100.
    10. Jingtao Yao & Chew Lim Tan & Hean-Lee Poh, 1999. "Neural Networks For Technical Analysis: A Study On Klci," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 2(02), pages 221-241.
    11. Madireddi Vasu & Vadlamani Ravi, 2011. "A hybrid under-sampling approach for mining unbalanced datasets: applications to banking and insurance," International Journal of Data Mining, Modelling and Management, Inderscience Enterprises Ltd, vol. 3(1), pages 75-105.
    12. Laura Nunez-Letamendia & Joaquin Pacheco & Silvia Casado, 2011. "Applying genetic algorithms to Wall Street," International Journal of Data Mining, Modelling and Management, Inderscience Enterprises Ltd, vol. 3(4), pages 319-340.
    13. Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
    14. G. Elliott & C. Granger & A. Timmermann (ed.), 2013. "Handbook of Economic Forecasting," Handbook of Economic Forecasting, Elsevier, edition 1, volume 2, number 2.
    15. repec:bla:jfinan:v:53:y:1998:i:2:p:673-699 is not listed on IDEAS
    16. Dudyala Anil Kumar & V. Ravi, 2008. "Predicting credit card customer churn in banks using data mining," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 1(1), pages 4-28.
    17. Chen, Nai-Fu, 1991. "Financial Investment Opportunities and the Macroeconomy," Journal of Finance, American Finance Association, vol. 46(2), pages 529-554, June.
    18. Bilson, Christopher M. & Brailsford, Timothy J. & Hooper, Vincent J., 2001. "Selecting macroeconomic variables as explanatory factors of emerging stock market returns," Pacific-Basin Finance Journal, Elsevier, vol. 9(4), pages 401-426, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tom Marty & Bruce Vanstone & Tobias Hahn, 2020. "News media analytics in finance: a survey," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 60(2), pages 1385-1434, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dhanya Jothimani & Ravi Shankar & Surendra S. Yadav, 2016. "Discrete Wavelet Transform-Based Prediction of Stock Index: A Study on National Stock Exchange Fifty Index," Papers 1605.07278, arXiv.org.
    2. Jasleen Kaur & Khushdeep Dharni, 2022. "Application and performance of data mining techniques in stock market: A review," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 29(4), pages 219-241, October.
    3. Matheus José Silva de Souza & Danilo Guimarães Franco Ramos & Marina Garcia Pena & Vinicius Amorim Sobreiro & Herbert Kimura, 2018. "Examination of the profitability of technical analysis based on moving average strategies in BRICS," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 4(1), pages 1-18, December.
    4. Fernando Rubio, 2005. "Eficiencia De Mercado, Administracion De Carteras De Fondos Y Behavioural Finance," Finance 0503028, University Library of Munich, Germany, revised 23 Jul 2005.
    5. Qing Zhou & Robert Faff, 2017. "The complementary role of cross-sectional and time-series information in forecasting stock returns," Australian Journal of Management, Australian School of Business, vol. 42(1), pages 113-139, February.
    6. Alagidede, Paul & Panagiotidis, Theodore, 2009. "Modelling stock returns in Africa's emerging equity markets," International Review of Financial Analysis, Elsevier, vol. 18(1-2), pages 1-11, March.
    7. Choi, Gahyun & Park, Kwangyeol & Yi, Eojin & Ahn, Kwangwon, 2023. "Price fairness: Clean energy stocks and the overall market," Chaos, Solitons & Fractals, Elsevier, vol. 168(C).
    8. Neely, Christopher J. & Weller, Paul, 2000. "Predictability in International Asset Returns: A Reexamination," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 35(4), pages 601-620, December.
    9. Lo, Andrew W & MacKinlay, A Craig, 1990. "When Are Contrarian Profits Due to Stock Market Overreaction?," The Review of Financial Studies, Society for Financial Studies, vol. 3(2), pages 175-205.
    10. Eero Pätäri & Timo Leivo, 2017. "A Closer Look At Value Premium: Literature Review And Synthesis," Journal of Economic Surveys, Wiley Blackwell, vol. 31(1), pages 79-168, February.
    11. Cornelis A. Los, 2004. "Nonparametric Efficiency Testing of Asian Stock Markets Using Weekly Data," Finance 0409033, University Library of Munich, Germany.
    12. Jitka Veselá & Alžběta Zíková, 2022. "Are the Czech, Polish, German and Dutch markets taking a random walk? [Konají český, polský, německý a nizozemský trh náhodnou procházku?]," Český finanční a účetní časopis, Prague University of Economics and Business, vol. 2022(2), pages 19-38.
    13. Immonen, Eero, 2015. "A quantitative description for efficient financial markets," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 433(C), pages 171-181.
    14. Jiang, Yonghong & Nie, He & Ruan, Weihua, 2018. "Time-varying long-term memory in Bitcoin market," Finance Research Letters, Elsevier, vol. 25(C), pages 280-284.
    15. Eli Kraizberg & Mitchell Kellman, 1999. "The U-Shape Autocorrelation Pattern in International Stock Markets," The American Economist, Sage Publications, vol. 43(2), pages 36-48, October.
    16. Bariviera, Aurelio F. & Font-Ferrer, Alejandro & Sorrosal-Forradellas, M. Teresa & Rosso, Osvaldo A., 2019. "An information theory perspective on the informational efficiency of gold price," The North American Journal of Economics and Finance, Elsevier, vol. 50(C).
    17. Zhong, Meirui & Zhang, Rui & Ren, Xiaohang, 2023. "The time-varying effects of liquidity and market efficiency of the European Union carbon market: Evidence from the TVP-SVAR-SV approach," Energy Economics, Elsevier, vol. 123(C).
    18. Cristi Spulbar & Ramona Birau & Lucian Florin Spulbar, 2021. "A Critical Survey on Efficient Market Hypothesis (EMH), Adaptive Market Hypothesis (AMH) and Fractal Markets Hypothesis (FMH) Considering Their Implication on Stock Markets Behavior," Ovidius University Annals, Economic Sciences Series, Ovidius University of Constantza, Faculty of Economic Sciences, vol. 0(2), pages 1161-1165, December.
    19. Chen, Yong & Kelly, Bryan & Wu, Wei, 2020. "Sophisticated investors and market efficiency: Evidence from a natural experiment," Journal of Financial Economics, Elsevier, vol. 138(2), pages 316-341.
    20. Benjamin Miranda Tabak, 2003. "The random walk hypothesis and the behaviour of foreign capital portfolio flows: the Brazilian stock market case," Applied Financial Economics, Taylor & Francis Journals, vol. 13(5), pages 369-378.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:47:y:2016:i:4:d:10.1007_s10614-015-9492-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.