IDEAS home Printed from https://ideas.repec.org/a/spr/infotm/v17y2016i2d10.1007_s10799-015-0238-0.html
   My bibliography  Save this article

Chinese trending search terms popularity rank prediction

Author

Listed:
  • Soyeon Caren Han

    (University of Tasmania)

  • Yulu Liang

    (University of Tasmania)

  • Hyunsuk Chung

    (University of Tasmania)

  • Hyejin Kim

    (Sungshin Women’s University)

  • Byeong Ho Kang

    (University of Tasmania)

Abstract

Baidu, the most popular Chinese search engine, monitors what their users are currently searching and provides top 50 search terms, called trending search terms, in descending order of popularity ranking. The paper focused on predicting the popularity ranking trends of this top trending search terms in Baidu. Based on the data analysis, two issues were identified that could affect accuracy of using the ranking data for predicting the popularity of trending searched terms. Firstly, all trending terms are disappeared from the top 50 terms list when the popularity is getting lower. However, there are several trending terms that reappear to the top 50 terms list after they disappeared. New distinct search terms can be differentiated from reappearances of old terms so we proposed the term distinction model by using the related news articles of a trending search term provided by Baidu. Secondly, it is necessary to handle the missing value when the term is out of the trending term list. To achieve the goal of this paper, we collected top 50 trending search terms from Baidu engine and its related news articles hourly for 6 months (from 1st March 2013 to 31th August 2013). Based on the proposed model, we found that the optimal disappearing interval can be 9 h, and using rank 51 for the missing values was the most successful. We conducted evaluations by using 3 months data (from 1st September 2013 to 30th November 2013), and four machine learning techniques where compared to evaluate the most accurate for predicting the popularity rank of trending search terms. Feed Forward Neural Network was achieved 78.81 % the most highest prediction accuracy, and achieved 85.55 % accuracy in ±3 error range.

Suggested Citation

  • Soyeon Caren Han & Yulu Liang & Hyunsuk Chung & Hyejin Kim & Byeong Ho Kang, 2016. "Chinese trending search terms popularity rank prediction," Information Technology and Management, Springer, vol. 17(2), pages 133-139, June.
  • Handle: RePEc:spr:infotm:v:17:y:2016:i:2:d:10.1007_s10799-015-0238-0
    DOI: 10.1007/s10799-015-0238-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10799-015-0238-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10799-015-0238-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kesten Green & J. Scott Armstrong & Andreas Graefe, 2007. "Methods to Elicit Forecasts from Groups: Delphi and Prediction Markets Compared," Foresight: The International Journal of Applied Forecasting, International Institute of Forecasters, issue 8, pages 17-20, Fall.
    2. Nesreen Ahmed & Amir Atiya & Neamat El Gayar & Hisham El-Shishiny, 2010. "An Empirical Comparison of Machine Learning Models for Time Series Forecasting," Econometric Reviews, Taylor & Francis Journals, vol. 29(5-6), pages 594-621.
    3. Simeon Vosen & Torsten Schmidt, 2011. "Forecasting private consumption: survey‐based indicators vs. Google trends," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 30(6), pages 565-578, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Martin Obschonka & Mingjie Zhou & Yixin Zhou & Jianxin Zhang & Rainer K. Silbereisen, 2019. "“Confucian” traits, entrepreneurial personality, and entrepreneurship in China: a regional analysis," Small Business Economics, Springer, vol. 53(4), pages 961-979, December.
    2. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    3. Jacques Bughin, 2015. "Google searches and twitter mood: nowcasting telecom sales performance," Netnomics, Springer, vol. 16(1), pages 87-105, August.
    4. Robert Lehmann, 2021. "Forecasting exports across Europe: What are the superior survey indicators?," Empirical Economics, Springer, vol. 60(5), pages 2429-2453, May.
    5. Vosen, Simeon & Schmidt, Torsten, 2012. "A monthly consumption indicator for Germany based on Internet search query data," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 19(7), pages 683-687.
    6. Chien-jung Ting & Yi-Long Hsiao, 2022. "Nowcasting the GDP in Taiwan and the Real-Time Tourism Data," Advances in Management and Applied Economics, SCIENPRESS Ltd, vol. 12(3), pages 1-2.
    7. Szafranek, Karol, 2019. "Bagged neural networks for forecasting Polish (low) inflation," International Journal of Forecasting, Elsevier, vol. 35(3), pages 1042-1059.
    8. Nikolaos Askitas & Klaus F. Zimmermann, 2015. "The internet as a data source for advancement in social sciences," International Journal of Manpower, Emerald Group Publishing Limited, vol. 36(1), pages 2-12, April.
    9. Huber, Jakob & Stuckenschmidt, Heiner, 2020. "Daily retail demand forecasting using machine learning with emphasis on calendric special days," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1420-1438.
    10. Andrei Dubovik & Adam Elbourne & Bram Hendriks & Mark Kattenberg, 2022. "Forecasting World Trade Using Big Data and Machine Learning Techniques," CPB Discussion Paper 441, CPB Netherlands Bureau for Economic Policy Analysis.
    11. Kock, Anders Bredahl & Teräsvirta, Timo, 2014. "Forecasting performances of three automated modelling techniques during the economic crisis 2007–2009," International Journal of Forecasting, Elsevier, vol. 30(3), pages 616-631.
    12. Götz, Thomas B. & Knetsch, Thomas A., 2019. "Google data in bridge equation models for German GDP," International Journal of Forecasting, Elsevier, vol. 35(1), pages 45-66.
    13. Milan Daus & Katharina Koberger & Kaan Koca & Felix Beckers & Jorge Encinas Fernández & Barbara Weisbrod & Daniel Dietrich & Sabine Ulrike Gerbersdorf & Rüdiger Glaser & Stefan Haun & Hilmar Hofmann &, 2021. "Interdisciplinary Reservoir Management—A Tool for Sustainable Water Resources Management," Sustainability, MDPI, vol. 13(8), pages 1-21, April.
    14. Fondeur, Y. & Karamé, F., 2013. "Can Google data help predict French youth unemployment?," Economic Modelling, Elsevier, vol. 30(C), pages 117-125.
    15. Zeynalov, Ayaz, 2014. "Nowcasting Tourist Arrivals to Prague: Google Econometrics," MPRA Paper 60945, University Library of Munich, Germany.
    16. Koffi, Siméon, 2022. "Prévision de l’inflation en Côte D’ivoire : Analyse Comparée des Modèles Arima, Holt-Winters, et Lstm [Inflation Forecasting in Côte D'Ivoire: A Comparative Analysis of the Arima, Holt-Winters, and," MPRA Paper 113961, University Library of Munich, Germany.
    17. Kristina Gligorić & Arnaud Chiolero & Emre Kıcıman & Ryen W. White & Robert West, 2022. "Population-scale dietary interests during the COVID-19 pandemic," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    18. Stephen L. France & Yuying Shi, 2017. "Aggregating Google Trends: Multivariate Testing and Analysis," Papers 1712.03152, arXiv.org, revised Mar 2018.
    19. Caperna, Giulio & Colagrossi, Marco & Geraci, Andrea & Mazzarella, Gianluca, 2022. "A babel of web-searches: Googling unemployment during the pandemic," Labour Economics, Elsevier, vol. 74(C).
    20. Masha Krupenkin & David Rothschild & Shawndra Hill & Elad Yom-Tov, 2019. "President Trump Stress Disorder: Partisanship, Ethnicity, and Expressive Reporting of Mental Distress After the 2016 Election," SAGE Open, , vol. 9(1), pages 21582440198, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infotm:v:17:y:2016:i:2:d:10.1007_s10799-015-0238-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.