IDEAS home Printed from https://ideas.repec.org/a/spr/jglopt/v49y2011i3p449-465.html
   My bibliography  Save this article

Evaluating a branch-and-bound RLT-based algorithm for minimum sum-of-squares clustering

Author

Listed:
  • Daniel Aloise
  • Pierre Hansen

Abstract

Minimum sum-of-squares clustering consists in partitioning a given set of n points into c clusters in order to minimize the sum of squared distances from the points to the centroid of their cluster. Recently, Sherali and Desai (JOGO, 2005) proposed a reformulation-linearization based branch-and-bound algorithm for this problem, claiming to solve instances with up to 1,000 points. In this paper, their algorithm is investigated in further detail, reproducing some of their computational experiments. However, our computational times turn out to be drastically larger. Indeed, for two data sets from the literature only instances with up to 20 points could be solved in less than 10 h of computer time. Possible reasons for this discrepancy are discussed. The effect of a symmetry breaking rule due to Plastria (EJOR, 2002) and of the introduction of valid inequalities of the convex hull of points in two dimensions which may belong to each cluster is also explored. Copyright Springer Science+Business Media, LLC. 2011

Suggested Citation

  • Daniel Aloise & Pierre Hansen, 2011. "Evaluating a branch-and-bound RLT-based algorithm for minimum sum-of-squares clustering," Journal of Global Optimization, Springer, vol. 49(3), pages 449-465, March.
  • Handle: RePEc:spr:jglopt:v:49:y:2011:i:3:p:449-465
    DOI: 10.1007/s10898-010-9571-3
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s10898-010-9571-3
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10898-010-9571-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Pierre Hansen & Eric Ngai & Bernard K. Cheung & Nenad Mladenovic, 2005. "Analysis of Global k-Means, an Incremental Heuristic for Minimum Sum-of-Squares Clustering," Journal of Classification, Springer;The Classification Society, vol. 22(2), pages 287-310, September.
    2. Michael Brusco & Douglas Steinley, 2007. "A Comparison of Heuristic Procedures for Minimum Within-Cluster Sums of Squares Partitioning," Psychometrika, Springer;The Psychometric Society, vol. 72(4), pages 583-600, December.
    3. Plastria, Frank, 2002. "Formulating logical implications in combinatorial optimisation," European Journal of Operational Research, Elsevier, vol. 140(2), pages 338-353, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Daniel Aloise & Nielsen Castelo Damasceno & Nenad Mladenović & Daniel Nobre Pinheiro, 2017. "On Strategies to Fix Degenerate k-means Solutions," Journal of Classification, Springer;The Classification Society, vol. 34(2), pages 165-190, July.
    2. Roe, R.A. & Smeelen, M. & Hoefeld, C., 2005. "Outsourcing and organizational change : an employee perspective," Research Memorandum 045, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
    3. Federico Della Croce, 2016. "MP or not MP: that is the question," Journal of Scheduling, Springer, vol. 19(1), pages 33-42, February.
    4. J. Fernando Vera & Rodrigo Macías, 2021. "On the Behaviour of K-Means Clustering of a Dissimilarity Matrix by Means of Full Multidimensional Scaling," Psychometrika, Springer;The Psychometric Society, vol. 86(2), pages 489-513, June.
    5. Michael Brusco & Douglas Steinley, 2015. "Affinity Propagation and Uncapacitated Facility Location Problems," Journal of Classification, Springer;The Classification Society, vol. 32(3), pages 443-480, October.
    6. Michael Brusco & Hans-Friedrich Köhn, 2009. "Exemplar-Based Clustering via Simulated Annealing," Psychometrika, Springer;The Psychometric Society, vol. 74(3), pages 457-475, September.
    7. Carrizosa, E. & Martin-Barragán, B. & Plastria, F. & Romero Morales, M.D., 2002. "A Dissimilarity-based approach for Classification," Research Memorandum 027, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
    8. Amaldi, Edoardo & Coniglio, Stefano, 2013. "A distance-based point-reassignment heuristic for the k-hyperplane clustering problem," European Journal of Operational Research, Elsevier, vol. 227(1), pages 22-29.
    9. Adib Roshani & Mehdi Hamidi, 2022. "Groundwater Level Fluctuations in Coastal Aquifer: Using Artificial Neural Networks to Predict the Impacts of Climatical CMIP6 Scenarios," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(11), pages 3981-4001, September.
    10. Jorge A. Rivero, 2023. "Unobserved Grouped Heteroskedasticity and Fixed Effects," Papers 2310.14068, arXiv.org, revised Oct 2023.
    11. Pierre Hansen & Nenad Mladenović & José Moreno Pérez, 2010. "Variable neighbourhood search: methods and applications," Annals of Operations Research, Springer, vol. 175(1), pages 367-407, March.
    12. Christian Hennig, 2022. "An empirical comparison and characterisation of nine popular clustering methods," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(1), pages 201-229, March.
    13. Jolivet, Grégory & Postel-Vinay, Fabien, 2020. "A Structural Analysis of Mental Health and Labor Market Trajectories," IZA Discussion Papers 13518, Institute of Labor Economics (IZA).
    14. Scott P. Stevens & Susan W. Palocsay, 2017. "Teaching Use of Binary Variables in Integer Linear Programs: Formulating Logical Conditions," INFORMS Transactions on Education, INFORMS, vol. 18(1), pages 28-36, September.
    15. Michael Brusco & Renu Singh & Douglas Steinley, 2009. "Variable Neighborhood Search Heuristics for Selecting a Subset of Variables in Principal Component Analysis," Psychometrika, Springer;The Psychometric Society, vol. 74(4), pages 705-726, December.
    16. Rumen Andonov & Stefan Balev & Nicola Yanev, 2004. "Protein Threading: From Mathematical Models to Parallel Implementations," INFORMS Journal on Computing, INFORMS, vol. 16(4), pages 393-405, November.
    17. Lumsdaine, Robin L. & Okui, Ryo & Wang, Wendun, 2023. "Estimation of panel group structure models with structural breaks in group memberships and coefficients," Journal of Econometrics, Elsevier, vol. 233(1), pages 45-65.
    18. Susan Brudvig & Michael J. Brusco & J. Dennis Cradit, 2019. "Joint selection of variables and clusters: recovering the underlying structure of marketing data," Journal of Marketing Analytics, Palgrave Macmillan, vol. 7(1), pages 1-12, March.
    19. Plastria, F., 2012. "A note towards improved homeland defense," Omega, Elsevier, vol. 40(2), pages 244-248, April.
    20. Aurore Delaigle & Peter Hall & Tung Pham, 2019. "Clustering functional data into groups by using projections," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(2), pages 271-304, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jglopt:v:49:y:2011:i:3:p:449-465. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.