Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Heterogeneous document embeddings for cross-lingual text classification

Contributo in Atti di convegno
Data di Pubblicazione:
2021
Abstract:
Funnelling (Fun) is a method for cross-lingual text classification (CLC) based on a two-tier ensemble for heterogeneous transfer learning. In Fun, 1st-tier classifiers, each working on a different, language-dependent feature space, return a vector of calibrated posterior probabilities (with one dimension for each class) for each document, and the final classification decision is taken by a meta-classifier that uses this vector as its input. The metaclassifier can thus exploit class-class correlations, and this (among other things) gives Fun an edge over CLC systems where these correlations cannot be leveraged. We here describe Generalized Funnelling (gFun), a learning ensemble where the metaclassifier receives as input the above vector of calibrated posterior probabilities, concatenated with document embeddings (aligned across languages) that embody other types of correlations, such as word-class correlations (as encoded by Word-Class Embeddings) and word-word correlations (as encoded by Multilingual Unsupervised or Supervised Embeddings). We show that gFun improves on Fun by describing experiments on two large, standard multilingual datasets for multi-label text classification.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Heterogeneous transfer learning; Transfer learning; Text classification; Ensemble learning; Word embeddings
Elenco autori:
Pedrotti, Andrea; MOREO FERNANDEZ, ALEJANDRO DAVID; Sebastiani, Fabrizio
Autori di Ateneo:
MOREO FERNANDEZ ALEJANDRO DAVID
SEBASTIANI FABRIZIO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/399578
Link al Full Text:
https://iris.cnr.it//retrieve/handle/20.500.14243/399578/124348/prod_456428-doc_176637.pdf
Titolo del libro:
SAC '21: Proceedings of the 36th Annual ACM Symposium on Applied Computing
  • Dati Generali

Dati Generali

URL

https://dl.acm.org/doi/10.1145/3412841.3442093
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)