IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v11y2019i12p254-d295909.html
   My bibliography  Save this article

Research on Community Detection of Online Social Network Members Based on the Sparse Subspace Clustering Approach

Author

Listed:
  • Zihe Zhou

    (College of Science, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China)

  • Bo Tian

    (School of Information Management & Engineering, Shanghai University of Finance and Economics, Shanghai 200433, China)

Abstract

The text data of the social network platforms take the form of short texts, and the massive text data have high-dimensional and sparse characteristics, which does not make the traditional clustering algorithm perform well. In this paper, a new community detection method based on the sparse subspace clustering (SSC) algorithm is proposed to deal with the problem of sparsity and the high-dimensional characteristic of short texts in online social networks. The main ideal is as follows. First, the structured data including users’ attributions and user behavior and unstructured data such as user reviews are used to construct the vector space for the network. And the similarity of the feature words is calculated by the location relation of the feature words in the synonym word forest. Then, the dimensions of data are deduced based on the principal component analysis in order to improve the clustering accuracy. Further, a new community detection method of social network members based on the SSC is proposed. Finally, experiments on several data sets are performed and compared with the K-means clustering algorithm. Experimental results show that proper dimension reduction for high dimensional data can improve the clustering accuracy and efficiency of the SSC approach. The proposed method can achieve suitable community partition effect on online social network data sets.

Suggested Citation

  • Zihe Zhou & Bo Tian, 2019. "Research on Community Detection of Online Social Network Members Based on the Sparse Subspace Clustering Approach," Future Internet, MDPI, vol. 11(12), pages 1-16, December.
  • Handle: RePEc:gam:jftint:v:11:y:2019:i:12:p:254-:d:295909
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/11/12/254/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/11/12/254/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Nan, Dong-Yang & Yu, Wei & Liu, Xiao & Zhang, Yun-Peng & Dai, Wei-Di, 2018. "A framework of community detection based on individual labels in attribute networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 523-536.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.

      Corrections

      All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:11:y:2019:i:12:p:254-:d:295909. See general information about how to correct material in RePEc.

      If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

      If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

      If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

      For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

      Please note that corrections may take a couple of weeks to filter through the various RePEc services.

      IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.