IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v126y2021i1d10.1007_s11192-020-03744-7.html
   My bibliography  Save this article

Understanding the temporal evolution of COVID-19 research through machine learning and natural language processing

Author

Listed:
  • Ashkan Ebadi

    (National Research Council Canada
    Concordia University)

  • Pengcheng Xi

    (National Research Council Canada)

  • Stéphane Tremblay

    (National Research Council Canada)

  • Bruce Spencer

    (National Research Council Canada
    University of New Brunswick)

  • Raman Pall

    (National Research Council Canada)

  • Alexander Wong

    (University of Waterloo
    Waterloo Artificial Intelligence Institute)

Abstract

The outbreak of the novel coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been continuously affecting human lives and communities around the world in many ways, from cities under lockdown to new social experiences. Although in most cases COVID-19 results in mild illness, it has drawn global attention due to the extremely contagious nature of SARS-CoV-2. Governments and healthcare professionals, along with people and society as a whole, have taken any measures to break the chain of transition and flatten the epidemic curve. In this study, we used multiple data sources, i.e., PubMed and ArXiv, and built several machine learning models to characterize the landscape of current COVID-19 research by identifying the latent topics and analyzing the temporal evolution of the extracted research themes, publications similarity, and sentiments, within the time-frame of January–May 2020. Our findings confirm the types of research available in PubMed and ArXiv differ significantly, with the former exhibiting greater diversity in terms of COVID-19 related issues and the latter focusing more on intelligent systems/tools to predict/diagnose COVID-19. The special attention of the research community to the high-risk groups and people with complications was also confirmed.

Suggested Citation

  • Ashkan Ebadi & Pengcheng Xi & Stéphane Tremblay & Bruce Spencer & Raman Pall & Alexander Wong, 2021. "Understanding the temporal evolution of COVID-19 research through machine learning and natural language processing," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(1), pages 725-739, January.
  • Handle: RePEc:spr:scient:v:126:y:2021:i:1:d:10.1007_s11192-020-03744-7
    DOI: 10.1007/s11192-020-03744-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-020-03744-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-020-03744-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lucas, Christopher & Nielsen, Richard A. & Roberts, Margaret E. & Stewart, Brandon M. & Storer, Alex & Tingley, Dustin, 2015. "Computer-Assisted Text Analysis for Comparative Politics," Political Analysis, Cambridge University Press, vol. 23(2), pages 254-277, April.
    2. Margaret Roberts & Brandon Stewart & Tingley, Dustin, 2014. "stm: R Package for Structural Topic Models," Working Paper 176291, Harvard University OpenScholar.
    3. Ebadi, Ashkan & Tremblay, Stéphane & Goutte, Cyril & Schiffauerova, Andrea, 2020. "Application of machine learning techniques to assess the trends and alignment of the funded research output," Journal of Informetrics, Elsevier, vol. 14(2).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Seyyed Reza Taher Harikandeh & Sadegh Aliakbary & Soroush Taheri, 2023. "An embedding approach for analyzing the evolution of research topics with a case study on computer science subdomains," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(3), pages 1567-1582, March.
    2. Thavavel Vaiyapuri & Sharath Kumar Jagannathan & Mohammed Altaf Ahmed & K. C. Ramya & Gyanendra Prasad Joshi & Soojeong Lee & Gangseong Lee, 2023. "Sustainable Artificial Intelligence-Based Twitter Sentiment Analysis on COVID-19 Pandemic," Sustainability, MDPI, vol. 15(8), pages 1-15, April.
    3. W. Benedikt Schmal, 2024. "Academic Knowledge: Does it Reflect the Combinatorial Growth of Technology?," Papers 2409.20282, arXiv.org.
    4. Wadim Strielkowski & Svetlana Zenchenko & Anna Tarasova & Yana Radyukova, 2022. "Management of Smart and Sustainable Cities in the Post-COVID-19 Era: Lessons and Implications," Sustainability, MDPI, vol. 14(12), pages 1-17, June.
    5. Breno Santana Santos & Ivanovitch Silva & Luciana Lima & Patricia Takako Endo & Gisliany Alves & Marcel da Câmara Ribeiro-Dantas, 2022. "Discovering temporal scientometric knowledge in COVID-19 scholarly production," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(3), pages 1609-1642, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Endre Tvinnereim & Xiaozi Liu & Eric M. Jamelske, 2017. "Public perceptions of air pollution and climate change: different manifestations, similar causes, and concerns," Climatic Change, Springer, vol. 140(3), pages 399-412, February.
    2. Ebadi, Ashkan & Tremblay, Stéphane & Goutte, Cyril & Schiffauerova, Andrea, 2020. "Application of machine learning techniques to assess the trends and alignment of the funded research output," Journal of Informetrics, Elsevier, vol. 14(2).
    3. Dean Neu & Gregory D. Saxton & Abu S. Rahaman, 2022. "Social Accountability, Ethics, and the Occupy Wall Street Protests," Journal of Business Ethics, Springer, vol. 180(1), pages 17-31, September.
    4. Dehler-Holland, Joris & Schumacher, Kira & Fichtner, Wolf, 2021. "Topic Modeling Uncovers Shifts in Media Framing of the German Renewable Energy Act," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 2(1).
    5. Maschke, Andreas, 2024. "Talking exports: The representation of Germany's current account in newspaper media," MPIfG Discussion Paper 24/1, Max Planck Institute for the Study of Societies.
    6. Fernandez Martinez, Roberto & Lostado Lorza, Ruben & Santos Delgado, Ana Alexandra & Piedra, Nelson, 2021. "Use of classification trees and rule-based models to optimize the funding assignment to research projects: A case study of UTPL," Journal of Informetrics, Elsevier, vol. 15(1).
    7. Mohamed M. Mostafa, 2023. "A one-hundred-year structural topic modeling analysis of the knowledge structure of international management research," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3905-3935, August.
    8. Florian Rabitz & Alin Olteanu & Jurgita Jurkevičienė & Agnė Budžytė, 2021. "A topic network analysis of the system turn in the environmental sciences," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(3), pages 2107-2140, March.
    9. Mujtaba Ali Isani, 2021. "Methodological Problems of Using Arabic-Language Twitter as a Gauge for Arab Attitudes Toward Politics and Society," Contemporary Review of the Middle East, , vol. 8(1), pages 22-35, March.
    10. Ferrara, Federico M. & Masciandaro, Donato & Moschella, Manuela & Romelli, Davide, 2022. "Political voice on monetary policy: Evidence from the parliamentary hearings of the European Central Bank," European Journal of Political Economy, Elsevier, vol. 74(C).
    11. Max Nathan & Anna Rosso, 2017. "Innovative events," Development Working Papers 429, Centro Studi Luca d'Agliano, University of Milano, revised 08 Apr 2019.
    12. Dybowski, T.P. & Adämmer, P., 2018. "The economic effects of U.S. presidential tax communication: Evidence from a correlated topic model," European Journal of Political Economy, Elsevier, vol. 55(C), pages 511-525.
    13. Anselm Hager & Hanno Hilbig, 2020. "Does Public Opinion Affect Political Speech?," American Journal of Political Science, John Wiley & Sons, vol. 64(4), pages 921-937, October.
    14. Andreas Rehs, 2020. "A structural topic model approach to scientific reorientation of economics and chemistry after German reunification," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(2), pages 1229-1251, November.
    15. Dehler-Holland, Joris & Okoh, Marvin & Keles, Dogan, 2022. "Assessing technology legitimacy with topic models and sentiment analysis – The case of wind power in Germany," Technological Forecasting and Social Change, Elsevier, vol. 175(C).
    16. Carlos Mendez & Fernando Mendez & Vasiliki Triga & Juan Miguel Carrascosa, 2020. "EU Cohesion Policy under the Media Spotlight: Exploring Territorial and Temporal Patterns in News Coverage and Tone," Journal of Common Market Studies, Wiley Blackwell, vol. 58(4), pages 1034-1055, July.
    17. Bai, Xiwen & Zhang, Xiunian & Li, Kevin X. & Zhou, Yaoming & Yuen, Kum Fai, 2021. "Research topics and trends in the maritime transport: A structural topic model," Transport Policy, Elsevier, vol. 102(C), pages 11-24.
    18. Weiss, Daniel & Nemeczek, Fabian, 2021. "A text-based monitoring tool for the legitimacy and guidance of technological innovation systems," Technology in Society, Elsevier, vol. 66(C).
    19. Ulrich Fritsche & Johannes Puckelwald, 2018. "Deciphering Professional Forecasters’ Stories - Analyzing a Corpus of Textual Predictions for the German Economy," Macroeconomics and Finance Series 201804, University of Hamburg, Department of Socioeconomics.
    20. David Ardia & Keven Bluteau & Kris Boudt & Koen Inghelbrecht, 2020. "Climate change concerns and the performance of green versus brown stocks," Working Paper Research 395, National Bank of Belgium.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:126:y:2021:i:1:d:10.1007_s11192-020-03744-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.