Data di Pubblicazione:
2015
Abstract:
The urgent need for new techniques for Big data analysis calls for
a great deal of attention by both research and industry communities. Among
the techniques that must be redesigned for big data analysis purposes, clustering
plays a crucial role. Dealing with Big Data implies that information to be
analyzed have size ranging from terabytes to petabytes of data, making the use
of clustering algorithms quite challenging, due to their (relatively) high computational
costs. In this paper we discuss how to tackle this problem and how to
implement a clustering strategy suitable for big data and having a reasonable execution
time. We focus our attention on hierarchical clustering, as this class of
algorithms easily meet some constraints set by big data features, while allowing
the use of the most efficient solution for data access in distributed environments.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Big Data Clustering
Elenco autori:
Masciari, Elio
Link alla scheda completa: