IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v446y2016icp22-34.html
   My bibliography  Save this article

The (un)supervised NMF methods for discovering overlapping communities as well as hubs and outliers in networks

Author

Listed:
  • Wang, Xiao
  • Cao, Xiaochun
  • Jin, Di
  • Cao, Yixin
  • He, Dongxiao

Abstract

For its crucial importance in the study of large-scale networks, many researchers devote to the detection of communities in various networks. It is now widely agreed that the communities usually overlap with each other. In some communities, there exist members that play a special role as hubs (also known as leaders), whose importance merits special attention. Moreover, it is also observed that some members of the network do not belong to any communities in a convincing way, and hence recognized as outliers. Failure to detect and exclude outliers will distort, sometimes significantly, the outcome of the detected communities. In short, it is preferable for a community detection method to detect all three structures altogether. This becomes even more interesting and also more challenging when we take the unsupervised assumption, that is, we do not assume the prior knowledge of the number K of communities. Our approach here is to define a novel generative model and formalize the detection of overlapping communities as well as hubs and outliers as an optimization problem on it. When K is given, we propose a normalized symmetric nonnegative matrix factorization algorithm based on Kullback–Leibler (KL) divergence to learn the parameters of the model. Otherwise, by combining KL divergence and prior model on parameters, we introduce another parameter learning method based on Bayesian symmetric nonnegative matrix factorization to learn the parameters of the model, while determining K. Therefore, we present a community detection method arguably in the most general sense, which detects all three structures altogether without prior knowledge of the number of communities. Finally, we test the proposed method on various real-world networks. The experimental results, in contrast to several state-of-art algorithms, indicate its superior performance over other ones in terms of both clustering accuracy and community quality.

Suggested Citation

  • Wang, Xiao & Cao, Xiaochun & Jin, Di & Cao, Yixin & He, Dongxiao, 2016. "The (un)supervised NMF methods for discovering overlapping communities as well as hubs and outliers in networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 446(C), pages 22-34.
  • Handle: RePEc:eee:phsmap:v:446:y:2016:i:c:p:22-34
    DOI: 10.1016/j.physa.2015.11.016
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437115009954
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2015.11.016?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Daniel D. Lee & H. Sebastian Seung, 1999. "Learning the parts of objects by non-negative matrix factorization," Nature, Nature, vol. 401(6755), pages 788-791, October.
    2. Gergely Palla & Imre Derényi & Illés Farkas & Tamás Vicsek, 2005. "Uncovering the overlapping community structure of complex networks in nature and society," Nature, Nature, vol. 435(7043), pages 814-818, June.
    3. Pablo M. Gleiser & Leon Danon, 2003. "Community Structure In Jazz," Advances in Complex Systems (ACS), World Scientific Publishing Co. Pte. Ltd., vol. 6(04), pages 565-573.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chen, Xue & Jiao, Pengfei & Yu, Yandong & Li, Xiaoming & Tang, Minghu, 2019. "Toward link predictability of bipartite networks based on structural enhancement and structural perturbation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 527(C), pages 1-1.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiang-Sun Zhang & Zhenping Li & Rui-Sheng Wang & Yong Wang, 2012. "A combinatorial model and algorithm for globally searching community structure in complex networks," Journal of Combinatorial Optimization, Springer, vol. 23(4), pages 425-442, May.
    2. Nicolò Pecora & Pablo Rovira Kaltwasser & Alessandro Spelta, 2016. "Discovering SIFIs in Interbank Communities," PLOS ONE, Public Library of Science, vol. 11(12), pages 1-17, December.
    3. Cui, Yaozu & Wang, Xingyuan, 2016. "Detecting one-mode communities in bipartite networks by bipartite clustering triangular," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 457(C), pages 307-315.
    4. Šubelj, Lovro & Bajec, Marko, 2011. "Community structure of complex software systems: Analysis and applications," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(16), pages 2968-2975.
    5. Zhang, Hongli & Gao, Yang & Zhang, Yue, 2018. "Overlapping communities from dense disjoint and high total degree clusters," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 496(C), pages 286-298.
    6. Abdolhosseini-Qomi, Amir Mahdi & Yazdani, Naser & Asadpour, Masoud, 2020. "Overlapping communities and the prediction of missing links in multiplex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 554(C).
    7. Gholami, Maryam & Sheikhahmadi, Amir & Khamforoosh, Keyhan & Jalili, Mahdi, 2022. "Overlapping community detection in networks based on Neutrosophic theory," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 598(C).
    8. Eustace, Justine & Wang, Xingyuan & Cui, Yaozu, 2015. "Overlapping community detection using neighborhood ratio matrix," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 421(C), pages 510-521.
    9. Ma, Xiaoke & Gao, Lin & Yong, Xuerong & Fu, Lidong, 2010. "Semi-supervised clustering algorithm for community structure detection in complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(1), pages 187-197.
    10. Gui, Chun & Zhang, Ruisheng & Hu, Rongjing & Huang, Guoming & Wei, Jiaxuan, 2018. "Overlapping communities detection based on spectral analysis of line graphs," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 498(C), pages 50-65.
    11. Zhou, Xu & Liu, Yanheng & Zhang, Jindong & Liu, Tuming & Zhang, Di, 2015. "An ant colony based algorithm for overlapping community detection in complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 427(C), pages 289-301.
    12. Zhaoyang Li & Yuehan Yang, 2024. "A semi-orthogonal nonnegative matrix tri-factorization algorithm for overlapping community detection," Statistical Papers, Springer, vol. 65(6), pages 3601-3619, August.
    13. Zhao Wang & Qingguo Xu & Weimin Li, 2022. "Multi-Layer Feature Fusion-Based Community Evolution Prediction," Future Internet, MDPI, vol. 14(4), pages 1-20, April.
    14. Ma, Xiaoke & Wang, Bingbo & Yu, Liang, 2018. "Semi-supervised spectral algorithms for community detection in complex networks based on equivalence of clustering methods," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 490(C), pages 786-802.
    15. Gao, Yang & Zhang, Hongli & Zhang, Yue, 2019. "Overlapping community detection based on conductance optimization in large-scale networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 522(C), pages 69-79.
    16. Yan, Chao & Chang, Zhenhai, 2020. "Modularized convex nonnegative matrix factorization for community detection in signed and unsigned networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 539(C).
    17. Nicolò Pecora & Alessandro Spelta, 2016. "Discovering SIFIs in interbank communities," DISCE - Working Papers del Dipartimento di Economia e Finanza def037, Università Cattolica del Sacro Cuore, Dipartimenti e Istituti di Scienze Economiche (DISCE).
    18. Gao, Yang & Zhang, Hongli & Zhang, Yue, 2019. "Overlapping communities from lines and triangles in complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 521(C), pages 455-466.
    19. Rafael Teixeira & Mário Antunes & Diogo Gomes & Rui L. Aguiar, 2024. "Comparison of Semantic Similarity Models on Constrained Scenarios," Information Systems Frontiers, Springer, vol. 26(4), pages 1307-1330, August.
    20. Del Corso, Gianna M. & Romani, Francesco, 2019. "Adaptive nonnegative matrix factorization and measure comparisons for recommender systems," Applied Mathematics and Computation, Elsevier, vol. 354(C), pages 164-179.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:446:y:2016:i:c:p:22-34. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.