IDEAS home Printed from https://ideas.repec.org/a/wsi/ijitdm/v16y2017i05ns0219622017500262.html
   My bibliography  Save this article

A Combined Approach Based on K-Means and Modified Electromagnetism-Like Mechanism for Data Clustering

Author

Listed:
  • Esmaeil Mehdizadeh

    (Faculty of Industrial & Mechanical Engineering, Qazvin Branch, Islamic Azad University, Qazvin, Iran)

  • Mohammad Teimouri

    (Faculty of Industrial & Mechanical Engineering, Qazvin Branch, Islamic Azad University, Qazvin, Iran)

  • Arash Zaretalab

    (Faculty of Industrial & Mechanical Engineering, Qazvin Branch, Islamic Azad University, Qazvin, Iran‡Department of Industrial Engineering, Amirkabir University of Technology, Tehran, Iran)

  • S. T. A. Niaki

    (#x2020;Department of Industrial Engineering, Sharif University of Technology, P.O. Box 11155-9414 Azadi Ave., Tehran 1458889694, Iran)

Abstract

Clustering is one of the useful methods in many scientific fields. It is a classification process to group data in specific clusters based on their similarities. Many heuristic and meta-heuristic algorithms have been successfully applied in the literature to solve clustering problems. Among them, the K-means is one of the best due to its simplicity and computational efficiency. However, it suffers from several drawbacks, the most significant of which is its dependency on the initial state that leads to trapping in local optima. In this paper, the K-means method is combined with a modified electromagnetism-like mechanism (MEM) algorithm to develop a new algorithm called K-MEM in order to avoid trapping in local optima. In addition, two modifications are made in this paper to improve the performance of the EM algorithm. First, a modified local search procedure is adopted to improve searching. Second, an elitism approach is imported to improve the moving procedure. The proposed algorithm is tested on four standard datasets chosen from the UCI Machine Learning repository and several artificial datasets, where its performance is compared with those of EM, MEM, K-means, combination of K-means and EM (K-EM), genetic algorithm (GA), simulated annealing (SA), ant colony optimization (ACO), honey bee mating optimization (HBMO) and particle swarm optimization (PSO). The results illustrate that the proposed K-MEM algorithm has a good performance to find desired results.

Suggested Citation

  • Esmaeil Mehdizadeh & Mohammad Teimouri & Arash Zaretalab & S. T. A. Niaki, 2017. "A Combined Approach Based on K-Means and Modified Electromagnetism-Like Mechanism for Data Clustering," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(05), pages 1279-1307, September.
  • Handle: RePEc:wsi:ijitdm:v:16:y:2017:i:05:n:s0219622017500262
    DOI: 10.1142/S0219622017500262
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S0219622017500262
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219622017500262?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gang Kou & Yanqun Lu & Yi Peng & Yong Shi, 2012. "Evaluation Of Classification Algorithms Using Mcdm And Rank Correlation," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 11(01), pages 197-225.
    2. Debels, Dieter & De Reyck, Bert & Leus, Roel & Vanhoucke, Mario, 2006. "A hybrid scatter search/electromagnetism meta-heuristic for project scheduling," European Journal of Operational Research, Elsevier, vol. 169(2), pages 638-653, March.
    3. Yi Peng & Gang Kou & Yong Shi & Zhengxin Chen, 2008. "A Descriptive Framework For The Field Of Data Mining And Knowledge Discovery," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 7(04), pages 639-682.
    4. Po, Rung-Wei & Guh, Yuh-Yuan & Yang, Miin-Shen, 2009. "A new clustering approach using data envelopment analysis," European Journal of Operational Research, Elsevier, vol. 199(1), pages 276-284, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chun-Hao Chen & Tzung-Pei Hong & Yeong-Chyi Lee & Vincent S. Tseng, 2015. "Finding Active Membership Functions for Genetic-Fuzzy Data Mining," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 14(06), pages 1215-1242, November.
    2. Ginger Saltos & Mihaela Cocea, 2017. "An Exploration of Crime Prediction Using Data Mining on Open Data," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(05), pages 1155-1181, September.
    3. Parishani, Maede & Rasti-Barzoki, Morteza, 2024. "CWBCM method to determine the importance of classification performance evaluation criteria in machine learning: Case studies of COVID-19, Diabetes, and Thyroid Disease," Omega, Elsevier, vol. 127(C).
    4. Rahime Ceylan & Hasan Koyuncu, 2016. "A New Breakpoint in Hybrid Particle Swarm-Neural Network Architecture: Individual Boundary Adjustment," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 15(06), pages 1313-1343, November.
    5. Jianfeng Xu & Yuanjian Zhang & Peng Zhang & Azhar Mahmood & Yu Li & Shaheen Khatoon, 2017. "Data Mining on ICU Mortality Prediction Using Early Temporal Data: A Survey," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(01), pages 117-159, January.
    6. Fenghua Wen & Xin Yang & Xu Gong & Kin Keung Lai, 2017. "Multi-Scale Volatility Feature Analysis and Prediction of Gold Price," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(01), pages 205-223, January.
    7. H. Tolga Kahraman & Seref Sagiroglu & Ilhami Colak, 2016. "Novel User Modeling Approaches for Personalized Learning Environments," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 15(03), pages 575-602, May.
    8. Yen-Hao Hsieh & Soe-Tsyr Yuan, 2016. "Can Customer Expectations be Measured in Real Time?," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 15(01), pages 119-149, January.
    9. Roman Vavrek, 2019. "Evaluation of the Impact of Selected Weighting Methods on the Results of the TOPSIS Technique," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 18(06), pages 1821-1843, November.
    10. P. D. Mahendhiran & S. Kannimuthu, 2018. "Deep Learning Techniques for Polarity Classification in Multimodal Sentiment Analysis," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 17(03), pages 883-910, May.
    11. Giyasettin Ozcan, 2018. "Unsupervised Learning from Multi-Dimensional Data: A Fast Clustering Algorithm Utilizing Canopies and Statistical Information," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 17(03), pages 841-856, May.
    12. Feyzan Arikan & Senay Citak, 2017. "Multiple Criteria Inventory Classification in an Electronics Firm," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(02), pages 315-331, March.
    13. O. H. Salman & A. A. Zaidan & B. B. Zaidan & Naserkalid & M. Hashim, 2017. "Novel Methodology for Triage and Prioritizing Using “Big Data” Patients with Chronic Heart Diseases Through Telemedicine Environmental," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(05), pages 1211-1245, September.
    14. Min-Yuan Cheng & Nhat-Duc Hoang, 2016. "A Self-Adaptive Fuzzy Inference Model Based on Least Squares SVM for Estimating Compressive Strength of Rubberized Concrete," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 15(03), pages 603-619, May.
    15. Si He & Nabil Belacel & Alan Chan & Habib Hamam & Yassine Bouslimani, 2016. "A Hybrid Artificial Fish Swarm Simulated Annealing Optimization Algorithm for Automatic Identification of Clusters," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 15(05), pages 949-974, September.
    16. Thierno M. L. Diallo & Sébastien Henry & Yacine Ouzrout & Abdelaziz Bouras, 2018. "Data-Based Fault Diagnosis Model Using a Bayesian Causal Analysis Framework," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 17(02), pages 583-620, March.
    17. Yi Peng, 2015. "Regional earthquake vulnerability assessment using a combination of MCDM methods," Annals of Operations Research, Springer, vol. 234(1), pages 95-110, November.
    18. Fábio T. F. Silva & Alexandre Szklo & Amanda Vinhoza & Ana Célia Nogueira & André F. P. Lucena & Antônio Marcos Mendonça & Camilla Marcolino & Felipe Nunes & Francielle M. Carvalho & Isabela Tagomori , 2022. "Inter-sectoral prioritization of climate technologies: insights from a Technology Needs Assessment for mitigation in Brazil," Mitigation and Adaptation Strategies for Global Change, Springer, vol. 27(7), pages 1-39, October.
    19. Abiodun Ogunyemi & Kevin Johnston, 2017. "Is Server Virtualization Implementation in Business and Public Organizations a Worthwhile Investment?," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(03), pages 711-736, May.
    20. Wenyi Zeng & Deqing Li & Peizhuang Wang, 2016. "Variable Weight Decision Making and Balance Function Analysis Based on Factor Space," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 15(05), pages 999-1014, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:ijitdm:v:16:y:2017:i:05:n:s0219622017500262. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/ijitdm/ijitdm.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.