Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Why is this language complex? Cherry-pick the optimal set of features in multilingual treebanks

Articolo
Data di Pubblicazione:
2022
Abstract:
This paper investigates linguistic complexity across natural languages from a corpus-based perspective and relies on the assumptions of linguistic profiling as a methodological framework. We focus in particular on the domain of syntactic complexity and analyze the distribution of a set of features taken as proxies of complexity phenomena at the sentence level, which were extracted from 63 treebanks annotated according to the Universal Dependencies formalism. This dataset guarantees that the features considered are modeling the same linguistic phenomena in different treebanks, allowing reliable comparison among languages. We show that our approach is able to identify tendencies of structural proximity between languages not necessarily in line with typologically-supported classification, thus shedding light on new corpus-based findings.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Linguistic Complexity; Linguistic Profiling; Universal Dependencies; Syntactic Domain
Elenco autori:
Venturi, Giulia; Brunato, DOMINIQUE PIERINA
Autori di Ateneo:
BRUNATO DOMINIQUE PIERINA
VENTURI GIULIA
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/420475
Pubblicato in:
LINGUISTICS VANGUARD
Journal
  • Dati Generali

Dati Generali

URL

https://www.degruyter.com/document/doi/10.1515/lingvan-2021-0017/html
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)