Data di Pubblicazione:
2013
Abstract:
The availability of cost-effective data collections
and storage hardware has allowed organizations to accumulate
very large data sets, which are a potential source of previously
unknown valuable information. The process of discovering
interesting patterns in such large data sets is referred to as
data mining. Outlier detection is a data mining task consisting
in the discovery of observations which deviate substantially
from the rest of the data, and has many important practical
applications. Outlier detection in very large data sets is however
computationally very demanding and currently requires highperformance
computing facilities. We propose a family of parallel
algorithms for Graphic Processing Units (GPU), derived from two
distance-based outlier detection algorithms: the BruteForce and
the SolvingSet. We analyze their performance with an extensive
set of experiments, comparing the GPU implementations with
the base CPU versions and obtaining significant speedups.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Data mining exploiting GPUs; outlier detection
Elenco autori:
Basta, Stefano
Link alla scheda completa: