Interpretable data partitioning through tree-based clustering methods
Contributo in Atti di convegno
Data di Pubblicazione:
2023
Abstract:
Interpretable Data Partitioning Through Tree-Based Clustering Methods
Riccardo Guidotti, Cristiano Landi, Andrea Beretta, Daniele Fadda & Mirco Nanni
Conference paper
First Online: 08 October 2023
311 Accesses
Part of the Lecture Notes in Computer Science book series (LNAI,volume 14276)
The growing interpretable machine learning research field is mainly focusing on the explanation of supervised approaches. However, also unsupervised approaches might benefit from considering interpretability aspects. While existing clustering methods only provide the assignment of records to clusters without justifying the partitioning, we propose tree-based clustering methods that offer interpretable data partitioning through a shallow decision tree. These decision trees enable easy-to-understand explanations of cluster assignments through short and understandable split conditions. The proposed methods are evaluated through experiments on synthetic and real datasets and proved to be more effective than traditional clustering approaches and interpretable ones in terms of standard evaluation measures and runtime. Finally, a case study involving human participation demonstrates the effectiveness of the interpretable clustering trees returned by the proposed method.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Interpretable clustering; Tree-based clustering; Interpretable data partitioning; Explainable unsupervised learning
Elenco autori:
Guidotti, Riccardo; Beretta, Andrea; Nanni, Mirco; Fadda, Daniele
Link alla scheda completa:
Titolo del libro:
Discovery Science