IDEAS home Printed from https://ideas.repec.org/a/spr/jclass/v36y2019i3d10.1007_s00357-019-09318-4.html
   My bibliography  Save this article

MCC: a Multiple Consensus Clustering Framework

Author

Listed:
  • Tao Li

    (Florida International University)

  • Yi Zhang
  • Dingding Wang

    (Florida Atlantic University)

  • Jian Xu

Abstract

Consensus clustering has emerged as an important extension of the classical clustering problem. Given a set of input clusterings of a given dataset, consensus clustering aims to find a single final clustering which is a better fit in some sense than the existing clusterings. There is a significant drawback in generating a single consensus clustering since different input clusterings could differ significantly. In this paper, we develop a new framework, called Multiple Consensus Clustering (MCC), to explore multiple clustering views of a given dataset from a set of input clusterings. Instead of generating a single consensus, we propose two sets of approaches to obtain multiple consensus. One employs the meta clustering method, and the other uses a hierarchical tree structure and further applies a dynamic programming algorithm to generate a flat partition from the hierarchical tree using the modularity measure. Multiple consensuses are finally obtained by applying consensus clustering algorithms to each cluster of the partition. Extensive experimental results on 11 real-world datasets and a case study on a Protein-Protein Interaction (PPI) dataset demonstrate the effectiveness of the MCC framework.

Suggested Citation

  • Tao Li & Yi Zhang & Dingding Wang & Jian Xu, 2019. "MCC: a Multiple Consensus Clustering Framework," Journal of Classification, Springer;The Classification Society, vol. 36(3), pages 414-434, October.
  • Handle: RePEc:spr:jclass:v:36:y:2019:i:3:d:10.1007_s00357-019-09318-4
    DOI: 10.1007/s00357-019-09318-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00357-019-09318-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00357-019-09318-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Fallah Shafagh & Tritchler David & Beyene Joseph, 2008. "Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(1), pages 1-25, August.
    2. Alexander Strehl & Joydeep Ghosh, 2003. "Relationship-Based Clustering and Visualization for High-Dimensional Data Mining," INFORMS Journal on Computing, INFORMS, vol. 15(2), pages 208-230, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sewell, Daniel K., 2018. "Visualizing data through curvilinear representations of matrices," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 255-270.
    2. Jiang, Jianhua & Hao, Dehao & Chen, Yujun & Parmar, Milan & Li, Keqin, 2018. "GDPC: Gravitation-based Density Peaks Clustering algorithm," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 502(C), pages 345-355.
    3. Cyrus Shahabi & Farnoush Banaei-Kashani, 2003. "Efficient and Anonymous Web-Usage Mining for Web Personalization," INFORMS Journal on Computing, INFORMS, vol. 15(2), pages 123-147, May.
    4. Kaiquan Xu & Stephen Shaoyi Liao & Raymond Y. K. Lau & J. Leon Zhao, 2014. "Effective Active Learning Strategies for the Use of Large-Margin Classifiers in Semantic Annotation: An Optimal Parameter Discovery Perspective," INFORMS Journal on Computing, INFORMS, vol. 26(3), pages 461-483, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jclass:v:36:y:2019:i:3:d:10.1007_s00357-019-09318-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.