Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

A fast and accurate algorithm for unsupervised clustering around centroids

Articolo
Data di Pubblicazione:
2017
Abstract:
A centroid-based clustering algorithm is proposed that works in a totally unsupervised fashion and is significantly faster and more accurate than existing algorithms. The algorithm, named CLUBS++ (for CLustering Using Binary Splitting), achieves these results by combining features of hierarchical and partition-based algorithms. Thus, CLUBS++ consists of two major phases, i.e., a divisive phase and an agglomerative phase, each followed by a refinement phase. Each major phase consists of successive steps in which the samples are repartitioned using a criterion based on least quadratic distance. This criterion possesses unique analytical properties that are elucidated in the paper and exploited by the algorithm to achieve a very fast computation. The paper presents the results of the extensive experiments performed: these confirm that the new algorithm is fast, impervious to noise, and produces results of better quality than other algorithms, such as BOOL, BIRCH, and k-means++, even when the analyst can determine the correct number of clusters--a very difficult task from which users are spared by CLUBS++.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Clu; Experimental assessment
Elenco autori:
Masciari, Elio
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/327607
Pubblicato in:
INFORMATION SCIENCES
Journal
  • Dati Generali

Dati Generali

URL

https://doi.org/10.1016/j.ins.2017.03.002
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)