Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

Generalized funnelling: ensemble learning and heterogeneous document embeddings for cross-lingual text classification

Conference Paper
Publication Date:
2021
abstract:
Funnelling (Fun) is a method for cross-lingual text classification (CLTC) based on a two-tier learning ensemble for heterogeneous transfer learning (HTL). In this ensemble method, 1st-tier classifiers, each working on a different and language-dependent feature space, return a vector of calibrated posterior probabilities (with one dimension for each class) for each document, and the final classification decision is taken by a metaclassifier that uses this vector as its input. In this paper we describe Generalized Funnelling (gFun), a generalization of Fun consisting of a HTL architecture in which 1st-tier components can be arbitrary view-generating functions, i.e., language-dependent functions that each produce a language-independent representation ("view") of the document. We describe an instance of gFun in which the metaclassifier receives as input a vector of calibrated posterior probabilities (as in Fun) aggregated to other embedded representations that embody other types of correlations. We describe preliminary results that we have obtained on a large standard dataset for multilingual multilabel text classification.
Iris type:
04.01 Contributo in Atti di convegno
Keywords:
Transfer learning; Cross-lingual text classification; Ensemble learning; Word embeddings
List of contributors:
Pedrotti, Andrea; MOREO FERNANDEZ, ALEJANDRO DAVID; Sebastiani, Fabrizio
Authors of the University:
MOREO FERNANDEZ ALEJANDRO DAVID
SEBASTIANI FABRIZIO
Handle:
https://iris.cnr.it/handle/20.500.14243/398921
Full Text:
https://iris.cnr.it//retrieve/handle/20.500.14243/398921/115698/prod_457947-doc_177825.pdf
Book title:
IIR 2021 - 11th Italian Information Retrieval Workshop
Published in:
CEUR WORKSHOP PROCEEDINGS
Series
  • Overview

Overview

URL

http://ceur-ws.org/Vol-2947/
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)