IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v447y2016icp247-260.html
   My bibliography  Save this article

Mining the key predictors for event outbreaks in social networks

Author

Listed:
  • Yi, Chengqi
  • Bao, Yuanyuan
  • Xue, Yibo

Abstract

It will be beneficial to devise a method to predict a so-called event outbreak. Existing works mainly focus on exploring effective methods for improving the accuracy of predictions, while ignoring the underlying causes: What makes event go viral? What factors that significantly influence the prediction of an event outbreak in social networks? In this paper, we proposed a novel definition for an event outbreak, taking into account the structural changes to a network during the propagation of content. In addition, we investigated features that were sensitive to predicting an event outbreak. In order to investigate the universality of these features at different stages of an event, we split the entire lifecycle of an event into 20 equal segments according to the proportion of the propagation time. We extracted 44 features, including features related to content, users, structure, and time, from each segment of the event. Based on these features, we proposed a prediction method using supervised classification algorithms to predict event outbreaks. Experimental results indicate that, as time goes by, our method is highly accurate, with a precision rate ranging from 79% to 97% and a recall rate ranging from 74% to 97%. In addition, after applying a feature-selection algorithm, the top five selected features can considerably improve the accuracy of the prediction. Data-driven experimental results show that the entropy of the eigenvector centrality, the entropy of the PageRank, the standard deviation of the betweenness centrality, the proportion of re-shares without content, and the average path length are the key predictors for an event outbreak. Our findings are especially useful for further exploring the intrinsic characteristics of outbreak prediction.

Suggested Citation

  • Yi, Chengqi & Bao, Yuanyuan & Xue, Yibo, 2016. "Mining the key predictors for event outbreaks in social networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 447(C), pages 247-260.
  • Handle: RePEc:eee:phsmap:v:447:y:2016:i:c:p:247-260
    DOI: 10.1016/j.physa.2015.12.019
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437115010468
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2015.12.019?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Przemyslaw A Grabowicz & José J Ramasco & Esteban Moro & Josep M Pujol & Victor M Eguiluz, 2012. "Social Features of Online Networks: The Strength of Intermediary Ties in Online Social Media," PLOS ONE, Public Library of Science, vol. 7(1), pages 1-9, January.
    2. Freeman, Mark & McVittie, James & Sivak, Iryna & Wu, Jianhong, 2014. "Viral information propagation in the Digg online social network," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 415(C), pages 87-94.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhang, Zhiwei & Wang, Zhenyu, 2017. "The data-driven null models for information dissemination tree in social networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 484(C), pages 394-411.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Maxime Lenormand & Miguel Picornell & Oliva G Cantú-Ros & Antònia Tugores & Thomas Louail & Ricardo Herranz & Marc Barthelemy & Enrique Frías-Martínez & José J Ramasco, 2014. "Cross-Checking Different Sources of Mobility Information," PLOS ONE, Public Library of Science, vol. 9(8), pages 1-10, August.
    2. Borondo, J. & Morales, A.J. & Benito, R.M. & Losada, J.C., 2014. "Mapping the online communication patterns of political conversations," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 414(C), pages 403-413.
    3. Gonzalo Luna-Cortés & Luis Miguel López-Bonilla & Jesús Manuel López-Bonilla, 2019. "The influence of social value and self-congruity on interpersonal connections in virtual social networks by Gen-Y tourists," PLOS ONE, Public Library of Science, vol. 14(6), pages 1-17, June.
    4. Maxime Lenormand & Sandra Luque & Johannes Langemeyer & Patrizia Tenerelli & Grazia Zulian & Inge Aalders & Serban Chivulescu & Pedro Clemente & Jan Dick & Jiska van Dijk & Michiel van Eupen & Relu C , 2018. "Multiscale socio-ecological networks in the age of information," PLOS ONE, Public Library of Science, vol. 13(11), pages 1-16, November.
    5. Zhao, Danling & Sun, Jianbin & Tan, Yuejin & Wu, Jianhong & Dou, Yajie, 2018. "An extended SEIR model considering homepage effect for the information propagation of online social networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 1019-1031.
    6. Berny Carrera & Jae-Yoon Jung, 2018. "SentiFlow: An Information Diffusion Process Discovery Based on Topic and Sentiment from Online Social Networks," Sustainability, MDPI, vol. 10(8), pages 1-16, August.
    7. Mozas-Moral, Adoración & Bernal-Jurado, Enrique & Medina-Viruel, Miguel Jesús & Fernández-Uclés, Domingo, 2016. "Factors for success in online social networks: An fsQCA approach," Journal of Business Research, Elsevier, vol. 69(11), pages 5261-5264.
    8. Ping Jiang & Xiangbin Yan, 2019. "A quantitative model for the spread of online information," Quality & Quantity: International Journal of Methodology, Springer, vol. 53(4), pages 1981-2001, July.
    9. Li, Yuan & Gao, Haoyu & Yang, Mingmin & Guan, Wanqiu & Ma, Haixin & Qian, Weining & Cao, Zhigang & Yang, Xiaoguang, 2015. "What are Chinese talking about in hot weibos?," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 419(C), pages 546-557.
    10. Sun, Xin & Dong, Junyu & Tang, Ruichun & Xu, Mantao & Qi, Lin & Cai, Yang, 2015. "Topological evolution of virtual social networks by modeling social activities," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 433(C), pages 259-267.
    11. Przemyslaw A Grabowicz & José J Ramasco & Bruno Gonçalves & Víctor M Eguíluz, 2014. "Entangling Mobility and Interactions in Social Media," PLOS ONE, Public Library of Science, vol. 9(3), pages 1-12, March.
    12. Maxime Lenormand & Antònia Tugores & Pere Colet & José J Ramasco, 2014. "Tweets on the Road," PLOS ONE, Public Library of Science, vol. 9(8), pages 1-12, August.
    13. Yang, Dingda & Liao, Xiangwen & Shen, Huawei & Cheng, Xueqi & Chen, Guolong, 2018. "Modeling the reemergence of information diffusion in social network," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 490(C), pages 1493-1500.
    14. Aron Culotta & Jennifer Cutler, 2016. "Mining Brand Perceptions from Twitter Social Networks," Marketing Science, INFORMS, vol. 35(3), pages 343-362, May.
    15. Miltiades D. Lytras & Anna Visvizi, 2019. "Big Data and Their Social Impact: Preliminary Study," Sustainability, MDPI, vol. 11(18), pages 1-18, September.
    16. Robin Lovelace, 2021. "Open source tools for geographic analysis in transport planning," Journal of Geographical Systems, Springer, vol. 23(4), pages 547-578, October.
    17. Wang, Dan & Cheng, Shun-Jun, 2016. "A two-stage broadcast message propagation model in social networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 462(C), pages 1286-1293.
    18. Renmeng Cao & Xianwen Wang & Xiaoke Xu & Jianlin Zhou, 2021. "Multiple bursts of highly retweeted articles on social media," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(6), pages 5165-5179, June.
    19. Jin Zhang & Shanshan Zhai & Hongxia Liu & Jennifer Ann Stevenson, 2016. "Social network analysis on a topic‐based navigation guidance system in a public health portal," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(5), pages 1068-1088, May.
    20. Kazutoshi Sasahara & Yoshito Hirata & Masashi Toyoda & Masaru Kitsuregawa & Kazuyuki Aihara, 2013. "Quantifying Collective Attention from Tweet Stream," PLOS ONE, Public Library of Science, vol. 8(4), pages 1-10, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:447:y:2016:i:c:p:247-260. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.