IDEAS home Printed from https://ideas.repec.org/p/hal/journl/emse-03953759.html
   My bibliography  Save this paper

Text mining methodologies with R: An application to central bank texts

Author

Listed:
  • Jonathan Benchimol

    (BoI - Bank of Israel)

  • Sophia Kazinnik

    (Federal Reserve Bank of Richmond)

  • Yossi Saadon

    (BoI - Bank of Israel)

Abstract

We review several existing text analysis methodologies and explain their formal application processes using the open-source software R and relevant packages. Several text mining applications to analyze central bank texts are presented.

Suggested Citation

  • Jonathan Benchimol & Sophia Kazinnik & Yossi Saadon, 2022. "Text mining methodologies with R: An application to central bank texts," Post-Print emse-03953759, HAL.
  • Handle: RePEc:hal:journl:emse-03953759
    DOI: 10.1016/j.mlwa.2022.100286
    as

    Download full text from publisher

    To our knowledge, this item is not available for download. To find whether it is available, there are three options:
    1. Check below whether another version of this item is available online.
    2. Check on the provider's web page whether it is in fact available.
    3. Perform a search for a similarly titled item that would be available.

    References listed on IDEAS

    as
    1. Jonathan Benchimol & Sophia Kazinnik & Yossi Saadon, 2021. "Federal Reserve Communication and the COVID-19 Pandemic," Bank of Israel Working Papers 2021.15, Bank of Israel.
    2. Hansen, Stephen & McMahon, Michael & Tong, Matthew, 2019. "The long-run information effect of central bank communication," Journal of Monetary Economics, Elsevier, vol. 108(C), pages 185-202.
    3. Hansen, Stephen & McMahon, Michael, 2016. "Shocking language: Understanding the macroeconomic effects of central bank communication," Journal of International Economics, Elsevier, vol. 99(S1), pages 114-133.
    4. David Bholat & Stephen Hans & Pedro Santos & Cheryl Schonhardt-Bailey, 2015. "Text mining for central banks," Handbooks, Centre for Central Banking Studies, Bank of England, number 33, April.
    5. Saskia Ter Ellen & Vegard H. Larsen & Leif Anders Thorsrud, 2022. "Narrative Monetary Policy Surprises and the Media," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 54(5), pages 1525-1549, August.
    6. Feinerer, Ingo & Hornik, Kurt & Meyer, David, 2008. "Text Mining Infrastructure in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 25(i05).
    7. Michael Ehrmann & Marcel Fratzscher, 2007. "Communication by Central Bank Committee Members: Different Strategies, Same Effectiveness?," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 39(2‐3), pages 509-541, March.
    8. Laver, Michael & Benoit, Kenneth & Garry, John, 2003. "Extracting Policy Positions from Political Texts Using Words as Data," American Political Science Review, Cambridge University Press, vol. 97(2), pages 311-331, May.
    9. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    10. Bholat, David & Broughton, Nida & Ter Meer, Janna & Walczak, Eryk, 2019. "Enhancing central bank communications using simple and relatable information," Journal of Monetary Economics, Elsevier, vol. 108(C), pages 1-15.
    11. Scott Deerwester & Susan T. Dumais & George W. Furnas & Thomas K. Landauer & Richard Harshman, 1990. "Indexing by latent semantic analysis," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 41(6), pages 391-407, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dimitrios Kanelis & Pierre L. Siklos, 2022. "Emotion in Euro Area Monetary Policy Communication and Bond Yields: The Draghi Era," CQE Working Papers 10322, Center for Quantitative Economics (CQE), University of Muenster.
    2. Kurowski, Łukasz & Smaga, Paweł, 2023. "Analysing financial stability reports as crisis predictors with the use of text-mining," The Journal of Economic Asymmetries, Elsevier, vol. 28(C).
    3. Martina Dattilo & Fabio Padovano, 2023. "Evaluating the quality of UNESCO World Heritage List: a comparison with the Baedeker's guidebooks," Post-Print hal-04388046, HAL.
    4. Vyshnevskyi, Iegor & Jombo, Wytone & Sohn, Wook, 2024. "The clarity of monetary policy communication and financial market volatility in developing economies," Emerging Markets Review, Elsevier, vol. 59(C).
    5. Bogner Alexandra & Jerger Jürgen, 2023. "Big data in monetary policy analysis—a critical assessment," Economics and Business Review, Sciendo, vol. 9(2), pages 27-40, April.
    6. Binbin Yang & Sang-Do Park, 2023. "Who Drives Carbon Neutrality in China? Text Mining and Network Analysis," Sustainability, MDPI, vol. 15(6), pages 1-24, March.
    7. Yuqian Zhang, 2023. "Using Google Trends to track the global interest in International Financial Reporting Standards: Evidence from big data," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 30(2), pages 87-100, April.
    8. Curti, Filippo & Kazinnik, Sophia, 2023. "Central bank communication and website characteristics," Journal of Economic Behavior & Organization, Elsevier, vol. 212(C), pages 1216-1241.
    9. Erkan Işığıçok & Sadullah Çelik & Dilek Özdemir Yılmaz, 2023. "Analysis of Skills and Qualifications Required in Data Scientist Job Postings Based on the Pareto Analysis Perspective Using Text Mining," EKOIST Journal of Econometrics and Statistics, Istanbul University, Faculty of Economics, vol. 0(39), pages 10-25, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Martin Baumgaertner & Johannes Zahner, 2021. "Whatever it takes to understand a central banker - Embedding their words using neural networks," MAGKS Papers on Economics 202130, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
    2. Nicolò Fraccaroli & Alessandro Giovannini & Jean-François Jamet & Eric Persson, 2023. "Central Banks in Parliaments: A Text Analysis of the Parliamentary Hearings of the Bank of England, the European Central Bank, and the Federal Reserve," International Journal of Central Banking, International Journal of Central Banking, vol. 19(2), pages 543-600, June.
    3. Jonne Lehtimäki & Marianne Palmu, 2022. "Who Should You Listen to in a Crisis? Differences in Communication of Central Bank Policymakers," Journal of Central Banking Theory and Practice, Central bank of Montenegro, vol. 11(3), pages 33-57.
    4. Curti, Filippo & Kazinnik, Sophia, 2023. "Central bank communication and website characteristics," Journal of Economic Behavior & Organization, Elsevier, vol. 212(C), pages 1216-1241.
    5. Baranowski, Paweł & Doryń, Wirginia & Łyziak, Tomasz & Stanisławska, Ewa, 2021. "Words and deeds in managing expectations: Empirical evidence from an inflation targeting economy," Economic Modelling, Elsevier, vol. 95(C), pages 49-67.
    6. Parle, Conor, 2022. "The financial market impact of ECB monetary policy press conferences — A text based approach," European Journal of Political Economy, Elsevier, vol. 74(C).
    7. Giacomo Caterini, 2020. "La comunicazione della Banca Centrale dei Caraibi Orientali: un?analisi testuale (On the communication of the Eastern Caribbeans Central Bank: A textual analysis)," Moneta e Credito, Economia civile, vol. 73(289), pages 57-82.
    8. Michael Ehrmann & Sarah Holton & Danielle Kedan & Gillian Phelan, 2024. "Monetary Policy Communication: Perspectives from Former Policymakers at the ECB," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 56(4), pages 837-864, June.
    9. Schmeling, Maik & Wagner, Christian, 2019. "Does Central Bank Tone Move Asset Prices?," CEPR Discussion Papers 13490, C.E.P.R. Discussion Papers.
    10. Miguel Acosta, 2015. "FOMC Responses to Calls for Transparency," Finance and Economics Discussion Series 2015-60, Board of Governors of the Federal Reserve System (U.S.).
    11. Kawamura, Kohei & Kobashi, Yohei & Shizume, Masato & Ueda, Kozo, 2019. "Strategic central bank communication: Discourse analysis of the Bank of Japan’s Monthly Report," Journal of Economic Dynamics and Control, Elsevier, vol. 100(C), pages 230-250.
    12. Munday, Tim & Brookes, James, 2021. "Mark my words: the transmission of central bank communication to the general public via the print media," Bank of England working papers 944, Bank of England.
    13. repec:hal:spmain:info:hdl:2441/3mgbd73vkp9f9oje7utooe7vpg is not listed on IDEAS
    14. Leif Anders Thorsrud, 2016. "Nowcasting using news topics Big Data versus big bank," Working Papers No 6/2016, Centre for Applied Macro- and Petroleum economics (CAMP), BI Norwegian Business School.
    15. Paul Hubert & Fabien Labondance, 2016. "Central Bank Sentiment and Policy Expectations," Working Papers hal-03459227, HAL.
    16. Christopher S. Sutherland, 2020. "Forward Guidance and Expectation Formation: A Narrative Approach," Staff Working Papers 20-40, Bank of Canada.
    17. Necmettin Alpay Koçak, 2020. "The Role of Ecb Speeches in Nowcasting German Gdp," European Financial and Accounting Journal, Prague University of Economics and Business, vol. 2020(2), pages 05-20.
    18. Youngjoon Lee & Soohyon Kim & Ki Young Park, 2018. "Deciphering Monetary Policy Committee Minutes with Text Mining Approach: A Case of South Korea," Working papers 2018rwp-132, Yonsei University, Yonsei Economics Research Institute.
    19. Antón Sarabia Arturo & Bazdresch Santiago & Lelo-de-Larrea Alejandra, 2023. "The Influence of Central Bank's Projections and Economic Narrative on Professional Forecasters' Expectations: Evidence from Mexico," Working Papers 2023-21, Banco de México.
    20. Paul Hubert & Fabien Labondance, 2019. "Central bank tone and the dispersion of views within monetary policy committees," SciencePo Working papers Main hal-03403256, HAL.
    21. Leonardo N. Ferreira, 2021. "Forecasting with VAR-teXt and DFM-teXt Models:exploring the predictive power of central bank communication," Working Papers Series 559, Central Bank of Brazil, Research Department.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:emse-03953759. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.