Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

An ELECTRA-Based Model for Neural Coreference Resolution

Articolo
Data di Pubblicazione:
2022
Abstract:
In last years, coreference resolution has received a sensibly performance boost exploiting different pre-trained Neural Language Models, from BERT to SpanBERT until Longformer. This work is aimed at assessing, for the first time, the impact of ELECTRA model on this task, moved by the experimental evidence of an improved contextual representation and better performance on different downstream tasks. In particular, ELECTRA has been employed as representation layer in an assessed neural coreference architecture able to determine entity mentions among spans of text and to best cluster them. The architecture itself has been optimized: i) by simplifying the modality of representation of spans of text but still considering both the context they appear and their entire content, ii) by maximizing both the number and length of input textual segments to exploit better the improved contextual representation power of ELECTRA, iii) by maximizing the number of spans of text to be processed, since potentially representing mentions, preserving computational efficiency. Experimental results on the OntoNotes dataset have shown the effectiveness of this solution from both a quantitative and qualitative perspective, and also with respect to other state-of-the-art models, thanks to a more proficient token and span representation. The results also hint at the possible use of this solution also for low-resource languages, simply requiring a pre-trained version of ELECTRA instead of language-specific models trained to handle either spans of text or long documents.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Coreference resolution; ELECTRA; neural language model; OntoNotes; natural language processing
Elenco autori:
Damiano, Emanuele; DE PIETRO, Giuseppe; Esposito, Massimo; Minutolo, Aniello; Gargiulo, Francesco; Guarasci, Raffaele
Autori di Ateneo:
DAMIANO EMANUELE
ESPOSITO MASSIMO
GARGIULO FRANCESCO
GUARASCI RAFFAELE
MINUTOLO ANIELLO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/417073
Pubblicato in:
IEEE ACCESS
Journal
  • Dati Generali

Dati Generali

URL

https://ieeexplore.ieee.org/abstract/document/9826714
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)