Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Assessing BERT's ability to learn Italian syntax: a study on null-subject and agreement phenomena

Articolo
Data di Pubblicazione:
2021
Abstract:
The work presented in this paper investigates the ability of BERT neural language model pretrained in Italian to embed syntactic dependency relationships into its layers, by approximating a Dependency Parse Tree. To this end, a structural probe, namely a supervised model able to extract linguistic structures from a language model, has been trained leveraging the contextual embeddings from the layers of BERT. An experimental assessment has been performed using an Italian version of BERT-base model and a set of datasets for Italian labelled with Universal Dependencies formalism. The results, achieved using standard metrics of dependency parsers, have shown that a knowledge of the Italian syntax is embedded in central-upper layers of the BERT model, according to what observed in literature for the English case. In addition, the probe has been also used to experimentally evaluate the BERT model behaviour in case of two specific syntactic phenomena in Italian, namely null-subject and subject-verb-agreement, showing better performance than an Italian state-of-the-art parser. These findings can open a path for the development of new hybrid approaches, exploiting the probe to integrate or improve limits or weaknesses in analysing articulated constructions of Italian syntax, traditionally complex to be parsed.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Neural language model; Syntactic phenomena; Dependency parse tree; Structural probe
Elenco autori:
Silvestri, Stefano; DE PIETRO, Giuseppe; Esposito, Massimo; Guarasci, Raffaele
Autori di Ateneo:
ESPOSITO MASSIMO
GUARASCI RAFFAELE
SILVESTRI STEFANO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/398750
Pubblicato in:
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING
Journal
  • Dati Generali

Dati Generali

URL

https://link.springer.com/epdf/10.1007/s12652-021-03297-4?sharing_token=aJn18omkSd895dkniOaROfe4RwlQNchNByi7wbcMAY4sLtZvFHt9KeGs_VdwyeG6X_e4vDj700qMBRis9yLnzuo95fXJBzupnObB79_iKDWw52rVMAMfxgIb7H1UjKyInL95t-Up7lztMutQu263Dd-VKWOhagAi9fPapeTKbY4%3D
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)