Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

Why is this language complex? Cherry-pick the optimal set of features in multilingual treebanks

Academic Article
Publication Date:
2022
abstract:
This paper investigates linguistic complexity across natural languages from a corpus-based perspective and relies on the assumptions of linguistic profiling as a methodological framework. We focus in particular on the domain of syntactic complexity and analyze the distribution of a set of features taken as proxies of complexity phenomena at the sentence level, which were extracted from 63 treebanks annotated according to the Universal Dependencies formalism. This dataset guarantees that the features considered are modeling the same linguistic phenomena in different treebanks, allowing reliable comparison among languages. We show that our approach is able to identify tendencies of structural proximity between languages not necessarily in line with typologically-supported classification, thus shedding light on new corpus-based findings.
Iris type:
01.01 Articolo in rivista
Keywords:
Linguistic Complexity; Linguistic Profiling; Universal Dependencies; Syntactic Domain
List of contributors:
Venturi, Giulia; Brunato, DOMINIQUE PIERINA
Authors of the University:
BRUNATO DOMINIQUE PIERINA
VENTURI GIULIA
Handle:
https://iris.cnr.it/handle/20.500.14243/420475
Published in:
LINGUISTICS VANGUARD
Journal
  • Overview

Overview

URL

https://www.degruyter.com/document/doi/10.1515/lingvan-2021-0017/html
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)