Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

A Tree-based Approach to Clustering XML Documents by Structure

Contributo in Atti di convegno
Data di Pubblicazione:
2004
Abstract:
We propose a novel methodology for clustering XML documents on the basis of their structural similarities. The basic idea is to equip each cluster with an XML cluster representative, i.e. an XML document subsuming the most typical structural specifics of a set of XML documents. Clustering is essentially accomplished by comparing cluster representatives, and updating the representatives as soon as new clusters are detected. We propose an algorithm for computing an XML representative through three phases. Suitable techniques for identifying significant node matchings and for reliably merging and pruning XML trees are investigated. Also, experimental evaluation performed on both synthetic and real data shows the effectiveness of our approach.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Elenco autori:
Manco, Giuseppe; Ortale, Riccardo; Costa, Giovanni
Autori di Ateneo:
COSTA GIOVANNI
MANCO GIUSEPPE
ORTALE RICCARDO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/14609
  • Dati Generali

Dati Generali

URL

http://www.springerlink.com/content/eb7k2a8b9ye6fhqq/
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)