IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v289y2021i2p456-469.html
   My bibliography  Save this article

A polynomial algorithm for balanced clustering via graph partitioning

Author

Listed:
  • Caraballo, Luis Evaristo
  • Díaz-Báñez, José-Miguel
  • Kroher, Nadine

Abstract

The objective of clustering is to discover natural groups in datasets and to identify geometrical structures which might reside there, without assuming any prior knowledge on the characteristics of the data. The problem can be seen as detecting the inherent separations between groups of a given point set in a metric space governed by a similarity function. The pairwise similarities between all data objects form a weighted graph whose adjacency matrix contains all necessary information for the clustering process. Consequently, the clustering task can be formulated as a graph partitioning problem. In this context, we propose a new cluster quality measure which uses the ratio of intra- and inter-cluster variance and allows us to compute the optimal clustering under the min-max principle in polynomial time. Our algorithm can be applied to both partitional and hierarchical clustering.

Suggested Citation

  • Caraballo, Luis Evaristo & Díaz-Báñez, José-Miguel & Kroher, Nadine, 2021. "A polynomial algorithm for balanced clustering via graph partitioning," European Journal of Operational Research, Elsevier, vol. 289(2), pages 456-469.
  • Handle: RePEc:eee:ejores:v:289:y:2021:i:2:p:456-469
    DOI: 10.1016/j.ejor.2020.07.031
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221720306421
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2020.07.031?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Klincewicz, J. G., 1991. "Heuristics for the p-hub location problem," European Journal of Operational Research, Elsevier, vol. 53(1), pages 25-37, July.
    2. Jane, Chin-Chia & Laih, Yih-Wenn, 2005. "A clustering algorithm for item assignment in a synchronized zone order picking system," European Journal of Operational Research, Elsevier, vol. 166(2), pages 489-496, October.
    3. Eitan Sharon & Meirav Galun & Dahlia Sharon & Ronen Basri & Achi Brandt, 2006. "Hierarchy and adaptivity in segmenting visual scenes," Nature, Nature, vol. 442(7104), pages 810-813, August.
    4. Caraballo, L.E. & Díaz-Báñez, J.M. & Maza, I. & Ollero, A., 2017. "The block-information-sharing strategy for task allocation: A case study for structure assembly with aerial robots," European Journal of Operational Research, Elsevier, vol. 260(2), pages 725-738.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ah-Pine, Julien, 2022. "Learning doubly stochastic and nearly idempotent affinity matrix for graph-based clustering," European Journal of Operational Research, Elsevier, vol. 299(3), pages 1069-1078.
    2. Chen, Claire Y.T. & Sun, Edward W. & Miao, Wanyu & Lin, Yi-Bing, 2024. "Reconciling business analytics with graphically initialized subspace clustering for optimal nonlinear pricing," European Journal of Operational Research, Elsevier, vol. 312(3), pages 1086-1107.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marianov, Vladimir & Serra, Daniel & ReVelle, Charles, 1999. "Location of hubs in a competitive environment," European Journal of Operational Research, Elsevier, vol. 114(2), pages 363-371, April.
    2. Dhyani, Sneha & Jayaswal, Sachin & Sinha, Ankur & Vidyarthi, Navneet, 2019. "Alternate Second Order Conic Programming Reformulations for Hub Location with Capacity Selection under Demand," IIMA Working Papers WP 2018-12-04, Indian Institute of Management Ahmedabad, Research and Publication Department.
    3. Kovács, András, 2011. "Optimizing the storage assignment in a warehouse served by milkrun logistics," International Journal of Production Economics, Elsevier, vol. 133(1), pages 312-318, September.
    4. Ricardo Saraiva de Camargo & Gilberto de Miranda & Henrique Pacca L. Luna, 2009. "Benders Decomposition for Hub Location Problems with Economies of Scale," Transportation Science, INFORMS, vol. 43(1), pages 86-97, February.
    5. van Gils, Teun & Ramaekers, Katrien & Braekers, Kris & Depaire, Benoît & Caris, An, 2018. "Increasing order picking efficiency by integrating storage, batching, zone picking, and routing policy decisions," International Journal of Production Economics, Elsevier, vol. 197(C), pages 243-261.
    6. Rafael Diaz, 2016. "Using dynamic demand information and zoning for the storage of non-uniform density stock keeping units," International Journal of Production Research, Taylor & Francis Journals, vol. 54(8), pages 2487-2498, April.
    7. Grzegorz Tarczyński, 2023. "Linear programming models for optimal workload and batching in pick-and-pass warehousing systems," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 33(3), pages 141-158.
    8. Kijmanawat, Kerati & Ieda, Hitoshi, 2005. "Development and Application of CM-GATS Algorithms in Solving Large Multilevel Hierarchical Network Design Problems," Research in Transportation Economics, Elsevier, vol. 13(1), pages 121-142, January.
    9. Milorad Vidović & Slobodan Zečević & Milorad Kilibarda & Jelena Vlajić & Nenad Bjelić & Snežana Tadić, 2011. "The p-hub Model with Hub-catchment Areas, Existing Hubs, and Simulation: A Case Study of Serbian Intermodal Terminals," Networks and Spatial Economics, Springer, vol. 11(2), pages 295-314, June.
    10. Pan, Jason Chao-Hsien & Shih, Po-Hsun & Wu, Ming-Hung, 2015. "Order batching in a pick-and-pass warehousing system with group genetic algorithm," Omega, Elsevier, vol. 57(PB), pages 238-248.
    11. Yan T. Yang & Barak Fishbain & Dorit S. Hochbaum & Eric B. Norman & Erik Swanberg, 2014. "The Supervised Normalized Cut Method for Detecting, Classifying, and Identifying Special Nuclear Materials," INFORMS Journal on Computing, INFORMS, vol. 26(1), pages 45-58, February.
    12. Ruriko Yoshida & Kenji Fukumizu & Chrysafis Vogiatzis, 2019. "Multilocus phylogenetic analysis with gene tree clustering," Annals of Operations Research, Springer, vol. 276(1), pages 293-313, May.
    13. R de Koster & M Yu, 2008. "Minimizing makespan and throughput times at Aalsmeer flower auction," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 59(9), pages 1182-1190, September.
    14. Ishfaq, Rafay & Sox, Charles R., 2011. "Hub location-allocation in intermodal logistic networks," European Journal of Operational Research, Elsevier, vol. 210(2), pages 213-230, April.
    15. Mengyue Zhai & Zheng Wang, 2024. "Optimizing Rack Locations in the Mobile-Rack Picking System: A Method of Integrating Rack Heat and Relevance," Mathematics, MDPI, vol. 12(3), pages 1-20, January.
    16. Yu, M. & de Koster, M.B.M., 2007. "Performance Approximation and Design of Pick-and-Pass Order Picking Systems," ERIM Report Series Research in Management ERS-2007-082-LIS, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
    17. Samir Elhedhli & Huyu Wu, 2010. "A Lagrangean Heuristic for Hub-and-Spoke System Design with Capacity Selection and Congestion," INFORMS Journal on Computing, INFORMS, vol. 22(2), pages 282-296, May.
    18. Kratica, Jozef & Stanimirovic, Zorica & Tosic, Dusan & Filipovic, Vladimir, 2007. "Two genetic algorithms for solving the uncapacitated single allocation p-hub median problem," European Journal of Operational Research, Elsevier, vol. 182(1), pages 15-28, October.
    19. Cunha, Claudio B. & Silva, Marcos Roberto, 2007. "A genetic algorithm for the problem of configuring a hub-and-spoke network for a LTL trucking company in Brazil," European Journal of Operational Research, Elsevier, vol. 179(3), pages 747-758, June.
    20. Li, Xiaowei & Hua, Guowei & Huang, Anqiang & Sheu, Jiuh-Biing & Cheng, T.C.E. & Huang, Fengquan, 2020. "Storage assignment policy with awareness of energy consumption in the Kiva mobile fulfilment system," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 144(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:289:y:2021:i:2:p:456-469. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.