Accelerating outlier detection with intra- and inter-node parallelism
Contributo in Atti di convegno
Data di Pubblicazione:
2014
Abstract:
Outlier detection is a data mining task consisting
in the discovery of observations which deviate substantially
from the rest of the data, and has many important practical
applications. Outlier detection in very large data sets is however
computationally very demanding and the size limit of the data
that can be elaborated is considerably pushed forward by mixing
three ingredients: efficient algorithms, intra-cpu parallelism of
high-performance architectures, network level parallelism. In
this paper we propose an outlier detection algorithm able to
exploit the internal parallelism of a GPU and the external
parallelism of a cluster of GPU. The algorithm is the evolution of
our previous solutions which considered either GPU or network
level parallelism. We discuss a set of large scale experiments
executed in a supercomputing facility and show the speedup
obtained with varying number of nodes.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Distance-based outliers; high performance computing; GPU; parallel algorithms.
Elenco autori:
Basta, Stefano
Link alla scheda completa: