DCI Closed: a fast and memory efficient algorithm to mine frequent closed itemsets
Contributo in Atti di convegno
Data di Pubblicazione:
2004
Abstract:
One of the main problems raising up in the frequent closed itemsets mining problem is the duplicate detection. In this paper we propose a general technique for promptly detecting and discarding duplicate closed itemsets, without the need of keeping in the main memory the whole set of closed patterns. Our approach can be exploited with substantial performance benefits by any algorithm that adopts a vertical representation of the dataset. We implemented our technique within a new depth-first closed itemsets mining algorithm. The experimental evaluation demonstrates that our algorithm outperforms other state of the art algorithms like CLOSET+ and FPCLOSE.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Frequet Closed Itemsets Mining
Elenco autori:
Orlando, Salvatore; Lucchese, Claudio; Perego, Raffaele
Link alla scheda completa: