Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Word-class embeddings for multiclass text classification

Articolo
Data di Pubblicazione:
2021
Abstract:
Pre-trained word embeddings encode general word semantics and lexical regularities of natural language, and have proven useful across many NLP tasks, including word sense disambiguation, machine translation, and sentiment analysis, to name a few. In supervised tasks such as multiclass text classification (the focus of this article) it seems appealing to enhance word representations with ad-hoc embeddings that encode task-specific information. We propose (supervised) word-class embeddings (WCEs), and show that, when concatenated to (unsupervised) pre-trained word embeddings, they substantially facilitate the training of deep-learning models in multiclass classification by topic. We show empirical evidence that WCEs yield a consistent improvement in multiclass classification accuracy, using six popular neural architectures and six widely used and publicly available datasets for multiclass text classification. One further advantage of this method is that it is conceptually simple and straightforward to implement. Our code that implements WCEs is publicly available at https://github.com/AlexMoreo/word-class-embeddings.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Machine learning; Text classification; Language models; Neural networks; Deep learning
Elenco autori:
Esuli, Andrea; MOREO FERNANDEZ, ALEJANDRO DAVID; Sebastiani, Fabrizio
Autori di Ateneo:
ESULI ANDREA
MOREO FERNANDEZ ALEJANDRO DAVID
SEBASTIANI FABRIZIO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/397836
Link al Full Text:
https://iris.cnr.it//retrieve/handle/20.500.14243/397836/102349/prod_454276-doc_175039.pdf
https://iris.cnr.it//retrieve/handle/20.500.14243/397836/102353/prod_454276-doc_175070.pdf
Pubblicato in:
DATA MINING AND KNOWLEDGE DISCOVERY
Journal
  • Dati Generali

Dati Generali

URL

https://link.springer.com/article/10.1007/s10618-020-00735-3
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)