Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Strutture

A latent semantic approach to XML clustering by content and structure based on non-negative matrix factorization

Contributo in Atti di convegno
Data di Pubblicazione:
2013
Abstract:
Non-negative matrix factorization is intensively used in text clustering. We investigate its exploitation in the XML domain for clustering XML documents by structure and content into topically homogeneous groups. Non-negative matrix factorization is performed through an alternating least squares method, which incorporates expedients to attenuate the burden of large-scale factorizations. This is especially relevant when massive text-centric XML corpora are processed. Empirical evidence from a comparative evaluation on real-world XML corpora reveals that our approach overcomes several state-of-the-art competitors in effectiveness. © 2013 IEEE.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Elenco autori:
Ortale, Riccardo; Costa, Giovanni
Autori di Ateneo:
COSTA GIOVANNI
ORTALE RICCARDO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/268336
  • Dati Generali

Dati Generali

URL

http://www.scopus.com/record/display.url?eid=2-s2.0-84899452286&origin=inward
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)