IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v014i12.html
   My bibliography  Save this article

A CLUE for CLUster Ensembles

Author

Listed:
  • Hornik, Kurt

Abstract

Cluster ensembles are collections of individual solutions to a given clustering problem which are useful or necessary to consider in a wide range of applications. The R package clue provides an extensible computational environment for creating and analyzing cluster ensembles, with basic data structures for representing partitions and hierarchies, and facilities for computing on these, including methods for measuring proximity and obtaining consensus and "secondary" clusterings.

Suggested Citation

  • Hornik, Kurt, 2005. "A CLUE for CLUster Ensembles," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 14(i12).
  • Handle: RePEc:jss:jstsof:v:014:i12
    DOI: http://hdl.handle.net/10.18637/jss.v014.i12
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v014i12/v14i12.pdf
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v014.i12?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Struyf, Anja & Hubert, Mia & Rousseeuw, Peter, 1997. "Clustering in an Object-Oriented Environment," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 1(i04).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Marcin Pełka, 2012. "Ensemble approach for clustering of interval-valued symbolic data," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 13(2), pages 335-342, June.
    2. Hahsler, Michael & Bolaños, Matthew & Forrest, John, 2017. "Introduction to stream: An Extensible Framework for Data Stream Clustering Research with R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 76(i14).
    3. Luis Lorenzo & Javier Arroyo, 2022. "Analysis of the cryptocurrency market using different prototype-based clustering techniques," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-46, December.
    4. repec:hum:wpaper:sfb649dp2006-006 is not listed on IDEAS
    5. Juan José Fernández-Durán & María Mercedes Gregorio-Domínguez, 2021. "Consumer Segmentation Based on Use Patterns," Journal of Classification, Springer;The Classification Society, vol. 38(1), pages 72-88, April.
    6. Pełka Marcin, 2018. "Analysis of Innovations in the European Union Via Ensemble Symbolic Density Clustering," Econometrics. Advances in Applied Data Analysis, Sciendo, vol. 22(3), pages 84-98, September.
    7. Boztuğ, Yasemin & Reutterer, Thomas, 2006. "A combined approach for segment-specific analysis of market basket data," SFB 649 Discussion Papers 2006-006, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    8. Wu, Han-Ming, 2011. "On biological validity indices for soft clustering algorithms for gene expression data," Computational Statistics & Data Analysis, Elsevier, vol. 55(5), pages 1969-1979, May.
    9. Fišar, Miloš & Greiner, Ben & Huber, Christoph & Katok, Elena & Ozkes, Ali & Management Science Reproducibility Collaboration, 2023. "Reproducibility in Management Science," Department for Strategy and Innovation Working Paper Series 03/2023, WU Vienna University of Economics and Business.
    10. Boztug, Yasemin & Reutterer, Thomas, 2008. "A combined approach for segment-specific market basket analysis," European Journal of Operational Research, Elsevier, vol. 187(1), pages 294-312, May.
    11. Hornik, Kurt & Grün, Bettina, 2014. "movMF: An R Package for Fitting Mixtures of von Mises-Fisher Distributions," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 58(i10).
    12. Linda Vidman & David Källberg & Patrik Rydén, 2019. "Cluster analysis on high dimensional RNA-seq data with applications to cancer research - An evaluation study," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-21, December.
    13. Thomas Reutterer & Kurt Hornik & Nicolas March & Kathrin Gruber, 2017. "A data mining framework for targeted category promotions," Journal of Business Economics, Springer, vol. 87(3), pages 337-358, April.
    14. Hornik, Kurt & Feinerer, Ingo & Kober, Martin & Buchta, Christian, 2012. "Spherical k-Means Clustering," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 50(i10).
    15. repec:jss:jstsof:25:i05 is not listed on IDEAS
    16. repec:jss:jstsof:25:i04 is not listed on IDEAS
    17. Fionn Murtagh, 2009. "The Remarkable Simplicity of Very High Dimensional Data: Application of Model-Based Clustering," Journal of Classification, Springer;The Classification Society, vol. 26(3), pages 249-277, December.
    18. Axel Strauß & François Guilhaumon & Roger Daniel Randrianiaina & Katharina C Wollenberg Valero & Miguel Vences & Julian Glos, 2016. "Opposing Patterns of Seasonal Change in Functional and Phylogenetic Diversity of Tadpole Assemblages," PLOS ONE, Public Library of Science, vol. 11(3), pages 1-18, March.
    19. Patrick Oliver Schenk & Christoph Kern, 2024. "Connecting algorithmic fairness to quality dimensions in machine learning in official statistics and survey production," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 18(2), pages 131-184, June.
    20. Pełka Marcin, 2019. "Analysis of Happiness in EU Countries Using the Multi-Model Classification based on Models of Symbolic Data," Econometrics. Advances in Applied Data Analysis, Sciendo, vol. 23(3), pages 15-25, September.
    21. Pełka Marcin, 2019. "Assessment of the Development of the European Oecd Countries with the Application of Linear Ordering and Ensemble Clustering of Symbolic Data," Folia Oeconomica Stetinensia, Sciendo, vol. 19(2), pages 117-133, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kauffmann, Albrecht, 2012. "Delineation of City Regions Based on Commuting Interrelations: The Example of Large Cities in Germany," IWH Discussion Papers 4/2012, Halle Institute for Economic Research (IWH).
    2. Beata Gavurova & Ladislav Suhanyi & Martin Rigelský, 2020. "Tourist spending and productivity of economy in OECD countries – research on perspectives of sustainable tourism," Entrepreneurship and Sustainability Issues, VsI Entrepreneurship and Sustainability Center, vol. 8(1), pages 983-1000, September.
    3. Kim, Jaejik & Billard, L., 2011. "A polythetic clustering process and cluster validity indexes for histogram-valued objects," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2250-2262, July.
    4. Mohiuddin Ahmed, 2018. "Collective Anomaly Detection Techniques for Network Traffic Analysis," Annals of Data Science, Springer, vol. 5(4), pages 497-512, December.
    5. Jörg Weking & Andreas Hein & Markus Böhm & Helmut Krcmar, 2020. "A hierarchical taxonomy of business model patterns," Electronic Markets, Springer;IIM University of St. Gallen, vol. 30(3), pages 447-468, September.
    6. Tommaso Agasisti & Francesca Ieva & Anna Maria Paganoni, 2017. "Heterogeneity, school-effects and the North/South achievement gap in Italian secondary education: evidence from a three-level mixed model," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 26(1), pages 157-180, March.
    7. Ma, Zhenjun & Yan, Rui & Nord, Natasa, 2017. "A variation focused cluster analysis strategy to identify typical daily heating load profiles of higher education buildings," Energy, Elsevier, vol. 134(C), pages 90-102.
    8. Renato Cordeiro Amorim & Vladimir Makarenkov & Boris Mirkin, 2020. "Core Clustering as a Tool for Tackling Noise in Cluster Labels," Journal of Classification, Springer;The Classification Society, vol. 37(1), pages 143-157, April.
    9. Wen, Xuanhao & Cao, Huajun & Li, Hongcheng & Zheng, Jie & Ge, Weiwei & Chen, Erheng & Gao, Xi & Hon, Bernard, 2022. "A dual energy benchmarking methodology for energy-efficient production planning and operation of discrete manufacturing systems using data mining techniques," Energy, Elsevier, vol. 255(C).
    10. Karpinska, Lilia & Śmiech, Sławomir, 2021. "Breaking the cycle of energy poverty. Will Poland make it?," Energy Economics, Elsevier, vol. 94(C).
    11. Jesus Gonzalez-Feliu & Joelle Morana & Josep-Maria Salanova Grau & Tai-Yu Ma, 2013. "Design And Scenario Assessment For Collaborative Logistics And Freight Transport Systems," Articles, International Journal of Transport Economics, vol. 40(2).
    12. Frederickson Entila & Xiaowei Han & Akira Mine & Paul Schulze-Lefert & Kenichi Tsuda, 2024. "Commensal lifestyle regulated by a negative feedback loop between Arabidopsis ROS and the bacterial T2SS," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    13. Jörg Weking & Michael Mandalenakis & Andreas Hein & Sebastian Hermes & Markus Böhm & Helmut Krcmar, 2020. "The impact of blockchain technology on business models – a taxonomy and archetypal patterns," Electronic Markets, Springer;IIM University of St. Gallen, vol. 30(2), pages 285-305, June.
    14. Alexander Platzer, 2013. "Visualization of SNPs with t-SNE," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-6, February.
    15. Albrecht Kauffmann, 2011. "Wirkung kommunaler Investitionen in die Tourismusinfrastruktur am Beispiel Sachsens," Review of Regional Research: Jahrbuch für Regionalwissenschaft, Springer;Gesellschaft für Regionalforschung (GfR), vol. 31(1), pages 57-73, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:014:i12. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.