IDEAS home Printed from https://ideas.repec.org/a/spr/stpapr/v65y2024i6d10.1007_s00362-024-01537-1.html
   My bibliography  Save this article

A semi-orthogonal nonnegative matrix tri-factorization algorithm for overlapping community detection

Author

Listed:
  • Zhaoyang Li

    (Fudan University)

  • Yuehan Yang

    (Central University of Finance and Economics)

Abstract

In this paper, we focus on overlapping community detection and propose an efficient semi-orthogonal nonnegative matrix tri-factorization (semi-ONMTF) algorithm. This method factorizes a matrix X into an orthogonal matrix U, a nonnegative matrix B, and a transposed matrix $$U^\mathrm {\scriptscriptstyle T} $$ U T . We use the Cayley Transformation to maintain strict orthogonality of U that each iteration stays on the Stiefel Manifold. This algorithm is computationally efficient because the solutions of U and B are simplified into a matrix-wise update algorithm. Applying this method, we detect overlapping communities by the belonging coefficient vector and analyse associations between communities by the unweighted network of communities. We conduct simulations and applications to show that the proposed method has wide applicability. In a real data example, we apply the semi-ONMTF to a stock data set and construct a directed association network of companies. Based on the modularity for directed and overlapping communities, we obtain five overlapping communities, 17 overlapping nodes, and five outlier nodes in the network. We also discuss the associations between communities, providing insights into the overlapping community detection on the stock market network.

Suggested Citation

  • Zhaoyang Li & Yuehan Yang, 2024. "A semi-orthogonal nonnegative matrix tri-factorization algorithm for overlapping community detection," Statistical Papers, Springer, vol. 65(6), pages 3601-3619, August.
  • Handle: RePEc:spr:stpapr:v:65:y:2024:i:6:d:10.1007_s00362-024-01537-1
    DOI: 10.1007/s00362-024-01537-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00362-024-01537-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00362-024-01537-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Daniel D. Lee & H. Sebastian Seung, 1999. "Learning the parts of objects by non-negative matrix factorization," Nature, Nature, vol. 401(6755), pages 788-791, October.
    2. Gergely Palla & Imre Derényi & Illés Farkas & Tamás Vicsek, 2005. "Uncovering the overlapping community structure of complex networks in nature and society," Nature, Nature, vol. 435(7043), pages 814-818, June.
    3. Yutong Li & Ruoqing Zhu & Annie Qu & Han Ye & Zhankun Sun, 2021. "Topic Modeling on Triage Notes With Semiorthogonal Nonnegative Matrix Factorization," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(536), pages 1609-1624, October.
    4. Brunetti, Celso & Harris, Jeffrey H. & Mankad, Shawn & Michailidis, George, 2019. "Interconnectedness in the interbank market," Journal of Financial Economics, Elsevier, vol. 133(2), pages 520-538.
    5. Rainone, Edoardo, 2020. "The network nature of over-the-counter interest rates," Journal of Financial Markets, Elsevier, vol. 47(C).
    6. Sourav Chatterjee, 2021. "A New Coefficient of Correlation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(536), pages 2009-2022, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicolò Pecora & Pablo Rovira Kaltwasser & Alessandro Spelta, 2016. "Discovering SIFIs in Interbank Communities," PLOS ONE, Public Library of Science, vol. 11(12), pages 1-17, December.
    2. Zhaoyang Li & Yuehan Yang, 2024. "Directed association network analysis on the Standard and Poor’s 500 Index," Computational Economics, Springer;Society for Computational Economics, vol. 63(1), pages 111-127, January.
    3. Zhang, Hongli & Gao, Yang & Zhang, Yue, 2018. "Overlapping communities from dense disjoint and high total degree clusters," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 496(C), pages 286-298.
    4. Abdolhosseini-Qomi, Amir Mahdi & Yazdani, Naser & Asadpour, Masoud, 2020. "Overlapping communities and the prediction of missing links in multiplex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 554(C).
    5. Eustace, Justine & Wang, Xingyuan & Cui, Yaozu, 2015. "Overlapping community detection using neighborhood ratio matrix," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 421(C), pages 510-521.
    6. Ma, Xiaoke & Wang, Bingbo & Yu, Liang, 2018. "Semi-supervised spectral algorithms for community detection in complex networks based on equivalence of clustering methods," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 490(C), pages 786-802.
    7. Gao, Yang & Zhang, Hongli & Zhang, Yue, 2019. "Overlapping community detection based on conductance optimization in large-scale networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 522(C), pages 69-79.
    8. Gao, Yang & Zhang, Hongli & Zhang, Yue, 2019. "Overlapping communities from lines and triangles in complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 521(C), pages 455-466.
    9. Ma, Xiaoke & Gao, Lin & Yong, Xuerong & Fu, Lidong, 2010. "Semi-supervised clustering algorithm for community structure detection in complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(1), pages 187-197.
    10. Wang, Xiao & Cao, Xiaochun & Jin, Di & Cao, Yixin & He, Dongxiao, 2016. "The (un)supervised NMF methods for discovering overlapping communities as well as hubs and outliers in networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 446(C), pages 22-34.
    11. Zhao Wang & Qingguo Xu & Weimin Li, 2022. "Multi-Layer Feature Fusion-Based Community Evolution Prediction," Future Internet, MDPI, vol. 14(4), pages 1-20, April.
    12. Nicolò Pecora & Alessandro Spelta, 2016. "Discovering SIFIs in interbank communities," DISCE - Working Papers del Dipartimento di Economia e Finanza def037, Università Cattolica del Sacro Cuore, Dipartimenti e Istituti di Scienze Economiche (DISCE).
    13. Rafael Teixeira & Mário Antunes & Diogo Gomes & Rui L. Aguiar, 2024. "Comparison of Semantic Similarity Models on Constrained Scenarios," Information Systems Frontiers, Springer, vol. 26(4), pages 1307-1330, August.
    14. Del Corso, Gianna M. & Romani, Francesco, 2019. "Adaptive nonnegative matrix factorization and measure comparisons for recommender systems," Applied Mathematics and Computation, Elsevier, vol. 354(C), pages 164-179.
    15. P Fogel & C Geissler & P Cotte & G Luta, 2022. "Applying separative non-negative matrix factorization to extra-financial data," Working Papers hal-03689774, HAL.
    16. Kuzubaş, Tolga Umut & Saltoğlu, Burak & Sever, Can, 2016. "Systemic risk and heterogeneous leverage in banking networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 462(C), pages 358-375.
    17. Kevin F. Kiernan & Vladimir Yankov & Filip Zikes, 2021. "Liquidity Provision and Co-insurance in Bank Syndicates," Finance and Economics Discussion Series 2021-060, Board of Governors of the Federal Reserve System (U.S.).
    18. Spelta, A. & Pecora, N. & Rovira Kaltwasser, P., 2019. "Identifying Systemically Important Banks: A temporal approach for macroprudential policies," Journal of Policy Modeling, Elsevier, vol. 41(1), pages 197-218.
    19. Jorge Peña & Yannick Rochat, 2012. "Bipartite Graphs as Models of Population Structures in Evolutionary Multiplayer Games," PLOS ONE, Public Library of Science, vol. 7(9), pages 1-13, September.
    20. Paul Fogel & Yann Gaston-Mathé & Douglas Hawkins & Fajwel Fogel & George Luta & S. Stanley Young, 2016. "Applications of a Novel Clustering Approach Using Non-Negative Matrix Factorization to Environmental Research in Public Health," IJERPH, MDPI, vol. 13(5), pages 1-14, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stpapr:v:65:y:2024:i:6:d:10.1007_s00362-024-01537-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.