A hierarchical model-based approach to co-clustering high-dimensional data
Contributo in Atti di convegno
Data di Pubblicazione:
2008
Abstract:
We propose a hierarchical, model-based co-clustering framework for handling high-dimensional datasets. The technique views the dataset as a joint probability distribution over row and column variables. Our approach starts by clustering tuples in a dataset, where each cluster is characterized by a different probability distribution. Subsequently, the conditional distribution of attributes over tuples is exploited to discover natural co-clusters in the data. An intensive empirical evaluation highlights the effectiveness of our approach.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Elenco autori:
Manco, Giuseppe; Ortale, Riccardo; Costa, Giovanni
Link alla scheda completa: