IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v6y2012i2p237-253.html
   My bibliography  Save this article

Adding community and dynamic to topic models

Author

Listed:
  • Li, Daifeng
  • Ding, Ying
  • Shuai, Xin
  • Bollen, Johan
  • Tang, Jie
  • Chen, Shanshan
  • Zhu, Jiayi
  • Rocha, Guilherme

Abstract

The detection of communities in large social networks is receiving increasing attention in a variety of research areas. Most existing community detection approaches focus on the topology of social connections (e.g., coauthor, citation, and social conversation) without considering their topic and dynamic features. In this paper, we propose two models to detect communities by considering both topic and dynamic features. First, the Community Topic Model (CTM) can identify communities sharing similar topics. Second, the Dynamic CTM (DCTM) can capture the dynamic features of communities and topics based on the Bernoulli distribution that leverages the temporal continuity between consecutive timestamps. Both models were tested on two datasets: ArnetMiner and Twitter. Experiments show that communities with similar topics can be detected and the co-evolution of communities and topics can be observed by these two models, which allow us to better understand the dynamic features of social networks and make improved personalized recommendations.

Suggested Citation

  • Li, Daifeng & Ding, Ying & Shuai, Xin & Bollen, Johan & Tang, Jie & Chen, Shanshan & Zhu, Jiayi & Rocha, Guilherme, 2012. "Adding community and dynamic to topic models," Journal of Informetrics, Elsevier, vol. 6(2), pages 237-253.
  • Handle: RePEc:eee:infome:v:6:y:2012:i:2:p:237-253
    DOI: 10.1016/j.joi.2011.11.004
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1751157711001039
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2011.11.004?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ding, Ying, 2011. "Community detection: Topological vs. topical," Journal of Informetrics, Elsevier, vol. 5(4), pages 498-514.
    2. Aaron Clauset & Cristopher Moore & M. E. J. Newman, 2008. "Hierarchical structure and the prediction of missing links in networks," Nature, Nature, vol. 453(7191), pages 98-101, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Peng Wang & Mengnan Zhang & Yike Wang & Xiqing Yuan, 2023. "Sustainable Career Development of Chinese Generation Z (Post-00s) Attending and Graduating from University: Dynamic Topic Model Analysis Based on Microblogging," Sustainability, MDPI, vol. 15(3), pages 1-17, January.
    2. Masood, Muhammad Ali & Abbasi, Rabeeh Ayaz, 2021. "Using graph embedding and machine learning to identify rebels on twitter," Journal of Informetrics, Elsevier, vol. 15(1).
    3. Small, Kenneth A. & Ng, Chen Feng, 2014. "Optimizing road capacity and type," Economics of Transportation, Elsevier, vol. 3(2), pages 145-157.
    4. Hall, Lisa M.H. & Buckley, Alastair R., 2016. "A review of energy systems models in the UK: Prevalent usage and categorisation," Applied Energy, Elsevier, vol. 169(C), pages 607-628.
    5. Qiang Gao & Xiao Huang & Ke Dong & Zhentao Liang & Jiang Wu, 2022. "Semantic-enhanced topic evolution analysis: a combination of the dynamic topic model and word2vec," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(3), pages 1543-1563, March.
    6. Schröder, Nadine & Falke, Andreas & Hruschka, Harald & Reutterer, Thomas, 2019. "Analyzing the Browsing Basket: A Latent Interests-Based Segmentation Tool," Journal of Interactive Marketing, Elsevier, vol. 47(C), pages 181-197.
    7. Erjia Yan, 2014. "Topic-based Pagerank: toward a topic-level scientific evaluation," Scientometrics, Springer;Akadémiai Kiadó, vol. 100(2), pages 407-437, August.
    8. Qian-Jin Zong & Hong-Zhou Shen & Qin-Jian Yuan & Xiao-Wei Hu & Zhi-Ping Hou & Shun-Guo Deng, 2013. "Doctoral dissertations of Library and Information Science in China: A co-word analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(2), pages 781-799, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Curci, Ylenia & Mongeau Ospina, Christian A., 2016. "Investigating biofuels through network analysis," Energy Policy, Elsevier, vol. 97(C), pages 60-72.
    2. Ding, Ying, 2011. "Community detection: Topological vs. topical," Journal of Informetrics, Elsevier, vol. 5(4), pages 498-514.
    3. Gräbner, Claudius, 2016. "From realism to instrumentalism - and back? Methodological implications of changes in the epistemology of economics," MPRA Paper 71933, University Library of Munich, Germany.
    4. Tamás Nepusz & Tamás Vicsek, 2013. "Hierarchical Self-Organization of Non-Cooperating Individuals," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-9, December.
    5. Nora Connor & Albert Barberán & Aaron Clauset, 2017. "Using null models to infer microbial co-occurrence networks," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-23, May.
    6. Leto Peel & Tiago P. Peixoto & Manlio De Domenico, 2022. "Statistical inference links data and theory in network science," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    7. Xinyi Liu & Bin Liu & Zhimin Huang & Ting Shi & Yingyi Chen & Jian Zhang, 2012. "SPPS: A Sequence-Based Method for Predicting Probability of Protein-Protein Interaction Partners," PLOS ONE, Public Library of Science, vol. 7(1), pages 1-6, January.
    8. Amulyashree Sridhar & Sharvani GS & AH Manjunatha Reddy & Biplab Bhattacharjee & Kalyan Nagaraj, 2019. "The Eminence of Co-Expressed Ties in Schizophrenia Network Communities," Data, MDPI, vol. 4(4), pages 1-23, November.
    9. Li, Qing & Zhang, Huaige & Hong, Xianpei, 2020. "Knowledge structure of technology licensing based on co-keywords network: A review and future directions," International Review of Economics & Finance, Elsevier, vol. 66(C), pages 154-165.
    10. Wang, Zuxi & Wu, Yao & Li, Qingguang & Jin, Fengdong & Xiong, Wei, 2016. "Link prediction based on hyperbolic mapping with community structure for complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 450(C), pages 609-623.
    11. Lee, Yan-Li & Zhou, Tao, 2021. "Collaborative filtering approach to link prediction," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 578(C).
    12. Thorben Funke & Till Becker, 2019. "Stochastic block models: A comparison of variants and inference methods," PLOS ONE, Public Library of Science, vol. 14(4), pages 1-40, April.
    13. Kan, Kees-Jan & van der Maas, Han L.J. & Levine, Stephen Z., 2019. "Extending psychometric network analysis: Empirical evidence against g in favor of mutualism?," Intelligence, Elsevier, vol. 73(C), pages 52-62.
    14. Yao, Can-Zhong & Lin, Ji-Nan & Zheng, Xu-Zhou & Liu, Xiao-Feng, 2015. "The study of RMB exchange rate complex networks based on fluctuation mode," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 436(C), pages 359-376.
    15. Liu, Zhenfeng & Feng, Jian & Uden, Lorna, 2023. "Technology opportunity analysis using hierarchical semantic networks and dual link prediction," Technovation, Elsevier, vol. 128(C).
    16. Zhichao Ba & Yujie Cao & Jin Mao & Gang Li, 2019. "A hierarchical approach to analyzing knowledge integration between two fields—a case study on medical informatics and computer science," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(3), pages 1455-1486, June.
    17. Li, Wenyao & Cai, Meng & Zhong, Xiaoni & Liu, Yanbing & Lin, Tao & Wang, Wei, 2023. "Coevolution of epidemic and infodemic on higher-order networks," Chaos, Solitons & Fractals, Elsevier, vol. 168(C).
    18. Lü, Linyuan & Zhou, Tao, 2011. "Link prediction in complex networks: A survey," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(6), pages 1150-1170.
    19. Erjia Yan & Ying Ding & Elin K. Jacob, 2012. "Overlaying communities and topics: an analysis on publication networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 499-513, February.
    20. He, Bing & Ding, Ying & Tang, Jie & Reguramalingam, Vignesh & Bollen, Johan, 2013. "Mining diversity subgraph in multidisciplinary scientific collaboration networks: A meso perspective," Journal of Informetrics, Elsevier, vol. 7(1), pages 117-128.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:6:y:2012:i:2:p:237-253. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.