IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v18y2021i9p4560-d543278.html
   My bibliography  Save this article

Associations between Google Search Trends for Symptoms and COVID-19 Confirmed and Death Cases in the United States

Author

Listed:
  • Mostafa Abbas

    (Department of Translational Data Science and Informatics, Geisinger, Danville, PA 17822, USA)

  • Thomas B. Morland

    (Department of General Internal Medicine, Geisinger, Danville, PA 17822, USA)

  • Eric S. Hall

    (Department of Translational Data Science and Informatics, Geisinger, Danville, PA 17822, USA)

  • Yasser EL-Manzalawy

    (Department of Translational Data Science and Informatics, Geisinger, Danville, PA 17822, USA)

Abstract

We utilize functional data analysis techniques to investigate patterns of COVID-19 positivity and mortality in the US and their associations with Google search trends for COVID-19-related symptoms. Specifically, we represent state-level time series data for COVID-19 and Google search trends for symptoms as smoothed functional curves. Given these functional data, we explore the modes of variation in the data using functional principal component analysis (FPCA). We also apply functional clustering analysis to identify patterns of COVID-19 confirmed case and death trajectories across the US. Moreover, we quantify the associations between Google COVID-19 search trends for symptoms and COVID-19 confirmed case and death trajectories using dynamic correlation. Finally, we examine the dynamics of correlations for the top nine Google search trends of symptoms commonly associated with COVID-19 confirmed case and death trajectories. Our results reveal and characterize distinct patterns for COVID-19 spread and mortality across the US. The dynamics of these correlations suggest the feasibility of using Google queries to forecast COVID-19 cases and mortality for up to three weeks in advance. Our results and analysis framework set the stage for the development of predictive models for forecasting COVID-19 confirmed cases and deaths using historical data and Google search trends for nine symptoms associated with both outcomes.

Suggested Citation

  • Mostafa Abbas & Thomas B. Morland & Eric S. Hall & Yasser EL-Manzalawy, 2021. "Associations between Google Search Trends for Symptoms and COVID-19 Confirmed and Death Cases in the United States," IJERPH, MDPI, vol. 18(9), pages 1-24, April.
  • Handle: RePEc:gam:jijerp:v:18:y:2021:i:9:p:4560-:d:543278
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/18/9/4560/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/18/9/4560/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Dubin, Joel A. & Muller, Hans-Georg, 2005. "Dynamical Correlation for Multivariate Longitudinal Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 872-881, September.
    2. Han Shang, 2014. "A survey of functional principal component analysis," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 98(2), pages 121-142, April.
    3. Sasikiran Kandula & Jeffrey Shaman, 2019. "Reappraising the utility of Google Flu Trends," PLOS Computational Biology, Public Library of Science, vol. 15(8), pages 1-16, August.
    4. Heidi Ledford, 2020. "Why do COVID death rates seem to be falling?," Nature, Nature, vol. 587(7833), pages 190-192, November.
    5. Vanja Dukic & Hedibert F. Lopes & Nicholas G. Polson, 2012. "Tracking Epidemics With Google Flu Trends Data and a State-Space SEIR Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1410-1426, December.
    6. Aucejo, Esteban M. & French, Jacob & Ugalde Araya, Maria Paola & Zafar, Basit, 2020. "The impact of COVID-19 on student experiences and expectations: Evidence from a survey," Journal of Public Economics, Elsevier, vol. 191(C).
    7. Nicola Scafetta, 2020. "Distribution of the SARS-CoV-2 Pandemic and Its Monthly Forecast Based on Seasonal Climate Patterns," IJERPH, MDPI, vol. 17(10), pages 1-34, May.
    8. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Min Du & Chenyuan Qin & Wenxin Yan & Qiao Liu & Yaping Wang & Lin Zhu & Wannian Liang & Min Liu & Jue Liu, 2023. "Trends in Online Search Activity and the Correlation with Daily New Cases of Monkeypox among 102 Countries or Territories," IJERPH, MDPI, vol. 20(4), pages 1-13, February.
    2. Tobias Saegner & Donatas Austys, 2022. "Forecasting and Surveillance of COVID-19 Spread Using Google Trends: Literature Review," IJERPH, MDPI, vol. 19(19), pages 1-18, September.
    3. Maria da Penha de Andrade Abi Harb & Lena Veiga e Silva & Nandamudi Lankalapalli Vijaykumar & Marcelino Silva da Silva & Carlos Renato Lisboa Francês, 2022. "An Analysis of the Deleterious Impact of the Infodemic during the COVID-19 Pandemic in Brazil: A Case Study Considering Possible Correlations with Socioeconomic Aspects of Brazilian Demography," IJERPH, MDPI, vol. 19(6), pages 1-19, March.
    4. Michael Olumekor & Hossam Haddad & Nidal Mahmoud Al-Ramahi, 2023. "The Relationship between Search Engines and Entrepreneurship Development: A Granger-VECM Approach," Sustainability, MDPI, vol. 15(6), pages 1-16, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Klaus Ackermann & Simon D Angus & Paul A Raschky, 2020. "Estimating Sleep and Work Hours from Alternative Data by Segmented Functional Classification Analysis, SFCA," SoDa Laboratories Working Paper Series 2020-04, Monash University, SoDa Laboratories.
    2. M. Hubert & P. Rousseeuw & K. Vakili, 2014. "Shape bias of robust covariance estimators: an empirical study," Statistical Papers, Springer, vol. 55(1), pages 15-28, February.
    3. Klaus Ackermann & Simon D. Angus & Paul A. Raschky, 2020. "Estimating Sleep & Work Hours from Alternative Data by Segmented Functional Classification Analysis (SFCA)," Papers 2010.08102, arXiv.org.
    4. Taesik Lee & Hayong Shin, 2016. "Combining syndromic surveillance and ILI data using particle filter for epidemic state estimation," Flexible Services and Manufacturing Journal, Springer, vol. 28(1), pages 233-253, June.
    5. Jun, Seung-Pyo & Yoo, Hyoung Sun & Lee, Jae-Seong, 2021. "The impact of the pandemic declaration on public awareness and behavior: Focusing on COVID-19 google searches," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    6. Zhengming Xing & Bradley Nicholson & Monica Jimenez & Timothy Veldman & Lori Hudson & Joseph Lucas & David Dunson & Aimee K. Zaas & Christopher W. Woods & Geoffrey S. Ginsburg & Lawrence Carin, 2014. "Bayesian modeling of temporal properties of infectious disease in a college student population," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(6), pages 1358-1382, June.
    7. Katsikopoulos, Konstantinos V. & Şimşek, Özgür & Buckmann, Marcus & Gigerenzer, Gerd, 2022. "Transparent modeling of influenza incidence: Big data or a single data point from psychological theory?," International Journal of Forecasting, Elsevier, vol. 38(2), pages 613-619.
    8. Amir Hassan Zadeh & Hamed M. Zolbanin & Ramesh Sharda & Dursun Delen, 2019. "Social Media for Nowcasting Flu Activity: Spatio-Temporal Big Data Analysis," Information Systems Frontiers, Springer, vol. 21(4), pages 743-760, August.
    9. Lisa Singh & Carole Roan Gresenz, 2022. "Social Media Data for Firearms Research: Promise and Perils," The ANNALS of the American Academy of Political and Social Science, , vol. 704(1), pages 267-291, November.
    10. Baek, Changryong & Davis, Richard A. & Pipiras, Vladas, 2017. "Sparse seasonal and periodic vector autoregressive modeling," Computational Statistics & Data Analysis, Elsevier, vol. 106(C), pages 103-126.
    11. David H Chae & Sean Clouston & Mark L Hatzenbuehler & Michael R Kramer & Hannah L F Cooper & Sacoby M Wilson & Seth I Stephens-Davidowitz & Robert S Gold & Bruce G Link, 2015. "Association between an Internet-Based Measure of Area Racism and Black Mortality," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-12, April.
    12. Binelli, Chiara & Comi, Simona & Meschi, Elena & Pagani, Laura, 2024. "Every cloud has a silver lining: The role of study time and class recordings on university students’ performance during COVID-19," Journal of Economic Behavior & Organization, Elsevier, vol. 225(C), pages 305-328.
    13. Xiaoli Wang & Shuangsheng Wu & C Raina MacIntyre & Hongbin Zhang & Weixian Shi & Xiaomin Peng & Wei Duan & Peng Yang & Yi Zhang & Quanyi Wang, 2015. "Using an Adjusted Serfling Regression Model to Improve the Early Warning at the Arrival of Peak Timing of Influenza in Beijing," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-14, March.
    14. Nicoleta Serban & Huijing Jiang, 2012. "Multilevel Functional Clustering Analysis," Biometrics, The International Biometric Society, vol. 68(3), pages 805-814, September.
    15. Ishani Chaudhuri & Parthajit Kayal, 2022. "Predicting Power of Ticker Search Volume in Indian Stock Market," Working Papers 2022-214, Madras School of Economics,Chennai,India.
    16. Yang, Xin & Pan, Bing & Evans, James A. & Lv, Benfu, 2015. "Forecasting Chinese tourist volume with search engine data," Tourism Management, Elsevier, vol. 46(C), pages 386-397.
    17. Sana Malik & Melissa Bessaha & Kathleen Scarbrough & Jessica Younger & Wei Hou, 2023. "Self-Reported Depression and Anxiety among Graduate Students during the COVID-19 Pandemic: Examining Risk and Protective Factors," Sustainability, MDPI, vol. 15(8), pages 1-16, April.
    18. Kuchler, Theresa & Russel, Dominic & Stroebel, Johannes, 2022. "JUE Insight: The geographic spread of COVID-19 correlates with the structure of social networks as measured by Facebook," Journal of Urban Economics, Elsevier, vol. 127(C).
    19. Markowitz, Sara & Nesson, Erik & Robinson, Joshua J., 2019. "The effects of employment on influenza rates," Economics & Human Biology, Elsevier, vol. 34(C), pages 286-295.
    20. Christoph Zimmer & Reza Yaesoubi & Ted Cohen, 2017. "A Likelihood Approach for Real-Time Calibration of Stochastic Compartmental Epidemic Models," PLOS Computational Biology, Public Library of Science, vol. 13(1), pages 1-21, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:18:y:2021:i:9:p:4560-:d:543278. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.