Hierarchical Big Data Clustering

Contributo in Atti di convegno

Data di Pubblicazione:

2015

Abstract:

The urgent need for new techniques for Big data analysis calls for a great deal of attention by both research and industry communities. Among the techniques that must be redesigned for big data analysis purposes, clustering plays a crucial role. Dealing with Big Data implies that information to be analyzed have size ranging from terabytes to petabytes of data, making the use of clustering algorithms quite challenging, due to their (relatively) high computational costs. In this paper we discuss how to tackle this problem and how to implement a clustering strategy suitable for big data and having a reasonable execution time. We focus our attention on hierarchical clustering, as this class of algorithms easily meet some constraints set by big data features, while allowing the use of the most efficient solution for data access in distributed environments.

Tipologia CRIS:

04.01 Contributo in Atti di convegno

Keywords:

Big Data Clustering

Elenco autori:

Masciari, Elio

Link alla scheda completa:

https://iris.cnr.it/handle/20.500.14243/300333