IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2407.11765.html
   My bibliography  Save this paper

Nowcasting R&D Expenditures: A Machine Learning Approach

Author

Listed:
  • Atin Aboutorabi
  • Ga'etan de Rassenfosse

Abstract

Macroeconomic data are crucial for monitoring countries' performance and driving policy. However, traditional data acquisition processes are slow, subject to delays, and performed at a low frequency. We address this 'ragged-edge' problem with a two-step framework. The first step is a supervised learning model predicting observed low-frequency figures. We propose a neural-network-based nowcasting model that exploits mixed-frequency, high-dimensional data. The second step uses the elasticities derived from the previous step to interpolate unobserved high-frequency figures. We apply our method to nowcast countries' yearly research and development (R&D) expenditure series. These series are collected through infrequent surveys, making them ideal candidates for this task. We exploit a range of predictors, chiefly Internet search volume data, and document the relevance of these data in improving out-of-sample predictions. Furthermore, we leverage the high frequency of our data to derive monthly estimates of R&D expenditures, which are currently unobserved. We compare our results with those obtained from the classical regression-based and the sparse temporal disaggregation methods. Finally, we validate our results by reporting a strong correlation with monthly R&D employment data.

Suggested Citation

  • Atin Aboutorabi & Ga'etan de Rassenfosse, 2024. "Nowcasting R&D Expenditures: A Machine Learning Approach," Papers 2407.11765, arXiv.org.
  • Handle: RePEc:arx:papers:2407.11765
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2407.11765
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zhaoxin Dai & Yunfeng Hu & Guanhua Zhao, 2017. "The Suitability of Different Nighttime Light Data for GDP Estimation at Different Spatial Scales and Regional Levels," Sustainability, MDPI, vol. 9(2), pages 1-15, February.
    2. Domenico Giannone & Michele Lenza & Giorgio E. Primiceri, 2021. "Economic Predictions With Big Data: The Illusion of Sparsity," Econometrica, Econometric Society, vol. 89(5), pages 2409-2437, September.
    3. Laurent Ferrara & Anna Simoni, 2023. "When are Google Data Useful to Nowcast GDP? An Approach via Preselection and Shrinkage," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(4), pages 1188-1202, October.
    4. Giannone, Domenico & Reichlin, Lucrezia & Small, David, 2008. "Nowcasting: The real-time informational content of macroeconomic data," Journal of Monetary Economics, Elsevier, vol. 55(4), pages 665-676, May.
    5. Nick Bloom, 2007. "Uncertainty and the Dynamics of R&D," American Economic Review, American Economic Association, vol. 97(2), pages 250-255, May.
    6. Alberto Cavallo & Roberto Rigobon, 2016. "The Billion Prices Project: Using Online Prices for Measurement and Research," Journal of Economic Perspectives, American Economic Association, vol. 30(2), pages 151-178, Spring.
    7. Nicolas Woloszko, 2020. "Tracking activity in real time with Google Trends," OECD Economics Department Working Papers 1634, OECD Publishing.
    8. Bangwen Cheng & Rong He & Hongjin Yang & Jun Yang, 2005. "Quantitative method and model for forecasting R&D expenditures in China," Research Evaluation, Oxford University Press, vol. 14(1), pages 51-56, April.
    9. Diebold, Francis X. & Göbel, Maximilian & Goulet Coulombe, Philippe & Rudebusch, Glenn D. & Zhang, Boyuan, 2021. "Optimal combination of Arctic sea ice extent measures: A dynamic factor modeling approach," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1509-1519.
    10. Jakob Edler & Jan Fagerberg, 2017. "Innovation policy: what, why, and how," Oxford Review of Economic Policy, Oxford University Press and Oxford Review of Economic Policy Limited, vol. 33(1), pages 2-23.
    11. Götz, Thomas B. & Knetsch, Thomas A., 2019. "Google data in bridge equation models for German GDP," International Journal of Forecasting, Elsevier, vol. 35(1), pages 45-66.
    12. Martin D. D. Evans, 2005. "Where Are We Now? Real-Time Estimates of the Macroeconomy," International Journal of Central Banking, International Journal of Central Banking, vol. 1(2), September.
    13. Romer, Paul M, 1990. "Endogenous Technological Change," Journal of Political Economy, University of Chicago Press, vol. 98(5), pages 71-102, October.
    14. Luke Mosley & Idris Eckley & Alex Gibberd, 2021. "Sparse Temporal Disaggregation," Papers 2108.05783, arXiv.org, revised Oct 2022.
    15. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W. & Lessmann, Stefan, 2020. "Incorporating textual information in customer churn prediction models based on a convolutional neural network," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1563-1578.
    16. J. Vernon Henderson & Adam Storeygard & David N. Weil, 2012. "Measuring Economic Growth from Outer Space," American Economic Review, American Economic Association, vol. 102(2), pages 994-1028, April.
    17. Luke Mosley & Idris A. Eckley & Alex Gibberd, 2022. "Sparse temporal disaggregation," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 2203-2233, October.
    18. Daniel Borup & Erik Christian Montes Schütte, 2022. "In Search of a Job: Forecasting Employment Growth Using Google Trends," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(1), pages 186-200, January.
    19. Ghysels, Eric & Santa-Clara, Pedro & Valkanov, Rossen, 2004. "The MIDAS Touch: Mixed Data Sampling Regression Models," University of California at Los Angeles, Anderson Graduate School of Management qt9mf223rs, Anderson Graduate School of Management, UCLA.
    20. Hyunyoung Choi & Hal Varian, 2012. "Predicting the Present with Google Trends," The Economic Record, The Economic Society of Australia, vol. 88(s1), pages 2-9, June.
    21. Sax, Christoph & Steiner, Peter, 2013. "Temporal Disaggregation of Time Series," MPRA Paper 53389, University Library of Munich, Germany.
    22. Claudia Foroni & Massimiliano Marcellino & Christian Schumacher, 2015. "Unrestricted mixed data sampling (MIDAS): MIDAS regressions with unrestricted lag polynomials," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 178(1), pages 57-82, January.
    23. Borup, Daniel & Rapach, David E. & Schütte, Erik Christian Montes, 2023. "Mixed-frequency machine learning: Nowcasting and backcasting weekly initial claims with daily internet search volume data," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1122-1144.
    24. Paul J. J. Welfens, 2008. "ICT – productivity and economic growth in Europe," Springer Books, in: Paul J. J. Welfens & Ellen Walther-Klaus (ed.), Digital Excellence, pages 13-39, Springer.
    25. Marcelo C. Medeiros & Henrique F. Pires, 2021. "The Proper Use of Google Trends in Forecasting Models," Papers 2104.03065, arXiv.org, revised Apr 2021.
    26. Reichlin, Lucrezia & Giannone, Domenico & Small, David, 2005. "Nowcasting GDP and Inflation: The Real Time Informational Content of Macroeconomic Data Releases," CEPR Discussion Papers 5178, C.E.P.R. Discussion Papers.
    27. de Rassenfosse, Gaétan & Jaffe, Adam B., 2018. "Econometric evidence on the depreciation of innovations," European Economic Review, Elsevier, vol. 101(C), pages 625-642.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Laurent Ferrara & Anna Simoni, 2023. "When are Google Data Useful to Nowcast GDP? An Approach via Preselection and Shrinkage," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(4), pages 1188-1202, October.
    2. Bantis, Evripidis & Clements, Michael P. & Urquhart, Andrew, 2023. "Forecasting GDP growth rates in the United States and Brazil using Google Trends," International Journal of Forecasting, Elsevier, vol. 39(4), pages 1909-1924.
    3. George Kapetanios & Fotis Papailias, 2018. "Big Data & Macroeconomic Nowcasting: Methodological Review," Economic Statistics Centre of Excellence (ESCoE) Discussion Papers ESCoE DP-2018-12, Economic Statistics Centre of Excellence (ESCoE).
    4. Sarun Kamolthip, 2021. "Macroeconomic Forecasting with LSTM and Mixed Frequency Time Series Data," PIER Discussion Papers 165, Puey Ungphakorn Institute for Economic Research.
    5. David Kohns & Arnab Bhattacharjee, 2020. "Nowcasting Growth using Google Trends Data: A Bayesian Structural Time Series Model," Papers 2011.00938, arXiv.org, revised May 2022.
    6. Danilo Cascaldi-Garcia & Matteo Luciani & Michele Modugno, 2023. "Lessons from Nowcasting GDP across the World," International Finance Discussion Papers 1385, Board of Governors of the Federal Reserve System (U.S.).
    7. David Kohns & Arnab Bhattacharjee, 2019. "Interpreting Big Data in the Macro Economy: A Bayesian Mixed Frequency Estimator," CEERP Working Paper Series 010, Centre for Energy Economics Research and Policy, Heriot-Watt University.
    8. Zheng, Tingguo & Fan, Xinyue & Jin, Wei & Fang, Kuangnan, 2024. "Words or numbers? Macroeconomic nowcasting with textual and macroeconomic data," International Journal of Forecasting, Elsevier, vol. 40(2), pages 746-761.
    9. Jeffrey C. Chen & Abe Dunn & Kyle Hood & Alexander Driessen & Andrea Batch, 2019. "Off to the Races: A Comparison of Machine Learning and Alternative Data for Predicting Economic Indicators," NBER Chapters, in: Big Data for Twenty-First-Century Economic Statistics, pages 373-402, National Bureau of Economic Research, Inc.
    10. Knut Are Aastveit & Tuva Marie Fastbø & Eleonora Granziera & Kenneth Sæterhagen Paulsen & Kjersti Næss Torstensen, 2020. "Nowcasting Norwegian household consumption with debit card transaction data," Working Paper 2020/17, Norges Bank.
    11. Kohns, David & Bhattacharjee, Arnab, 2023. "Nowcasting growth using Google Trends data: A Bayesian Structural Time Series model," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1384-1412.
    12. Marina Diakonova & Luis Molina & Hannes Mueller & Javier J. Pérez & Cristopher Rauh, 2022. "The information content of conflict, social unrest and policy uncertainty measures for macroeconomic forecasting," Working Papers 2232, Banco de España.
    13. Claudia Foroni & Massimiliano Marcellino, 2013. "A survey of econometric methods for mixed-frequency data," Economics Working Papers ECO2013/02, European University Institute.
    14. Aruoba, S. BoraÄŸan & Diebold, Francis X. & Scotti, Chiara, 2009. "Real-Time Measurement of Business Conditions," Journal of Business & Economic Statistics, American Statistical Association, vol. 27(4), pages 417-427.
    15. Richard Schnorrenberger & Aishameriane Schmidt & Guilherme Valle Moura, 2024. "Harnessing Machine Learning for Real-Time Inflation Nowcasting," Working Papers 806, DNB.
    16. Chien-jung Ting & Yi-Long Hsiao, 2022. "Nowcasting the GDP in Taiwan and the Real-Time Tourism Data," Advances in Management and Applied Economics, SCIENPRESS Ltd, vol. 12(3), pages 1-2.
    17. Cebrián, Eduardo & Domenech, Josep, 2024. "Addressing Google Trends inconsistencies," Technological Forecasting and Social Change, Elsevier, vol. 202(C).
    18. Peter Fuleky & Carl S. Bonham, 2013. "Forecasting with Mixed Frequency Samples: The Case of Common Trends," Working Papers 201316, University of Hawaii at Manoa, Department of Economics.
    19. Lahiri, Kajal & Monokroussos, George, 2013. "Nowcasting US GDP: The role of ISM business surveys," International Journal of Forecasting, Elsevier, vol. 29(4), pages 644-658.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2407.11765. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.