Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Performance of a K-Means Algorithm driven by careful seeding

Contributo in Atti di convegno
Data di Pubblicazione:
2023
Abstract:
This paper proposes a variation of the K-Means clustering algorithm, named Population-Based K-Means (PBK-MEANS), which founds its behaviour on careful seeding. The new K-Means algorithm rests on a greedy version of the K-Means++ seeding procedure (g_kmeans++), which proves effective in the search for an accurate clustering solution. PB-K-MEANS first builds a population of candidate solutions by independent runs of K-Means with g_kmeans++. Then the reservoir is used for recombining the stored solutions by Repeated K-Means toward the attainment of a final solution which minimizes the distortion index. PB-KMEANS is currently implemented in Java through parallel streams and lambda expressions. The paper first recalls basic concepts of clustering and of K-Means together with the role of the seeding procedure, then it goes on by describing basic design and implementation issues of PB-K-MEANS. After that, simulation experiments carried out both on synthetic and real-world datasets are reported, confirming good execution performance and careful clustering.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
K-Means Clustering; Seeding Procedure; Greedy K-Means++; Clustering Accuracy Indexes; Java Parallel Streams; Benchmark and Real-World Datasets; Execution Performance.
Elenco autori:
Cicirelli, FRANCO DOMENICO
Autori di Ateneo:
CICIRELLI FRANCO DOMENICO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/461867
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)