Data di Pubblicazione:
2008
Abstract:
A novel approach for reconciling tuples stored as free text into an existing attribute schema is proposed. The basic idea is to subject the available text to progressive classification, i.e., a multi-stage classification scheme where, at each intermediate stage, a classifier is learnt that analyzes the textual fragments not reconciled at the end of the previous steps. Classifica- tion is accomplished by an ad hoc exploitation of traditional association mining algorithms, and is supported by a data transformation scheme which takes advantage of domain-specific dictionaries/ontologies. A key feature is the capability of progressively enriching the avail- able ontology with the results of the previous stages of classification, thus significantly improving the overall classification accuracy. An extensive experimental evaluation shows the effectiveness of our approach.
Tipologia CRIS:
01.01 Articolo in rivista
Elenco autori:
Manco, Giuseppe; Cesario, Eugenio; Ortale, Riccardo; Folino, FRANCESCO PAOLO
Link alla scheda completa:
Pubblicato in: