IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i14p2211-d1435520.html
   My bibliography  Save this article

An Improved K-Means Algorithm Based on Contour Similarity

Author

Listed:
  • Jing Zhao

    (Key Laboratory of Industrial Automation and Machine Vision of Qiannan, School of Mathematics and Statistics, Qiannan Normal University for Nationalities, Duyun 558000, China)

  • Yanke Bao

    (College of Science, Liaoning Technical University, Fuxin 123000, China)

  • Dongsheng Li

    (Key Laboratory of Industrial Automation and Machine Vision of Qiannan, School of Mathematics and Statistics, Qiannan Normal University for Nationalities, Duyun 558000, China)

  • Xinguo Guan

    (Key Laboratory of Industrial Automation and Machine Vision of Qiannan, School of Mathematics and Statistics, Qiannan Normal University for Nationalities, Duyun 558000, China)

Abstract

The traditional k-means algorithm is widely used in large-scale data clustering because of its easy implementation and efficient process, but it also suffers from the disadvantages of local optimality and poor robustness. In this study, a Csk-means algorithm based on contour similarity is proposed to overcome the drawbacks of the traditional k-means algorithm. For the traditional k-means algorithm, which results in local optimality due to the influence of outliers or noisy data and random selection of the initial clustering centers, the Csk-means algorithm overcomes both drawbacks by combining data lattice transformation and dissimilar interpolation. In particular, the Csk-means algorithm employs Fisher optimal partitioning of the similarity vectors between samples for the process of determining the number of clusters. To improve the robustness of the k-means algorithm to the shape of the clusters, the Csk-means algorithm utilizes contour similarity to compute the similarity between samples during the clustering process. Experimental results show that the Csk-means algorithm provides better clustering results than the traditional k-means algorithm and other comparative algorithms.

Suggested Citation

  • Jing Zhao & Yanke Bao & Dongsheng Li & Xinguo Guan, 2024. "An Improved K-Means Algorithm Based on Contour Similarity," Mathematics, MDPI, vol. 12(14), pages 1-16, July.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:14:p:2211-:d:1435520
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/14/2211/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/14/2211/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Pablo Cristini Guedes & Fernanda Maria Müller & Marcelo Brutti Righi, 2023. "Risk measures-based cluster methods for finance," Risk Management, Palgrave Macmillan, vol. 25(1), pages 1-56, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Foguesatto, Cristian Rogério & Righi, Marcelo Brutti & Müller, Fernanda Maria, 2024. "Is there a dark side to financial inclusion? Understanding the relationship between financial inclusion and market risk," The North American Journal of Economics and Finance, Elsevier, vol. 72(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:14:p:2211-:d:1435520. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.