Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Learning to weight for text classification

Articolo
Data di Pubblicazione:
2018
Abstract:
In information retrieval (IR) and related tasks, term weighting approaches typically consider the frequency of the term in the document and in the collection in order to compute a score reflecting the importance of the term for the document. In tasks characterized by the presence of training data (such as text classification) it seems logical to design a term weighting function that leverages the distribution (as estimated from training data) of the term across the classes of interest. Although "supervised term weighting" approaches that use this intuition have been described before, they have failed to show consistent improvements. In this article we analyse the possible reasons for this failure, and call consolidated assumptions into question. Following this criticism, we propose a novel supervised term weighting approach that, instead of relying on any predefined formula, learns a term weighting function optimised on the training set of interest; we dub this approach Learning to Weight (LTW). The experiments that we have run on several well-known benchmarks, and using different learning methods, show that our method outperforms previous term weighting approaches in text classification.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Term weighting; Supervised term weighting; Text classification; Neural networks; Deep learning
Elenco autori:
Esuli, Andrea; MOREO FERNANDEZ, ALEJANDRO DAVID; Sebastiani, Fabrizio
Autori di Ateneo:
ESULI ANDREA
MOREO FERNANDEZ ALEJANDRO DAVID
SEBASTIANI FABRIZIO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/359350
Link al Full Text:
https://iris.cnr.it//retrieve/handle/20.500.14243/359350/20751/prod_401311-doc_139450.pdf
Pubblicato in:
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (ONLINE)
Journal
  • Dati Generali

Dati Generali

URL

https://ieeexplore.ieee.org/document/8550687
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)