Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

That Looks Hard: Characterizing Linguistic Complexity in Humans and Language Models

Conference Paper
Publication Date:
2021
abstract:
This paper investigates the relationship between two complementary perspectives in the human assessment of sentence complexity and how they are modeled in a neural language model (NLM). The first perspective takes into account multiple online behavioral metrics obtained from eye-tracking recordings. The second one concerns the offline perception of complexity measured by explicit human judgments. Using a broad spectrum of linguistic features modeling lexical, morpho-syntactic, and syntactic properties of sentences, we perform a comprehensive analysis of linguistic phenomena associated with the two complexity viewpoints and report similarities and differences. We then show the effectiveness of linguistic features when explicitly leveraged by a regression model for predicting sentence complexity and compare its results with the ones obtained by a fine-tuned neural language model. We finally probe the NLM's linguistic competence before and after fine-tuning, highlighting how linguistic information encoded in representations changes when the model learns to predict complexity.
Iris type:
04.01 Contributo in Atti di convegno
Keywords:
linguistic complexity; eyetracking; human evaluation
List of contributors:
Dell'Orletta, Felice; Brunato, DOMINIQUE PIERINA
Authors of the University:
BRUNATO DOMINIQUE PIERINA
DELL'ORLETTA FELICE
Handle:
https://iris.cnr.it/handle/20.500.14243/440173
  • Overview

Overview

URL

https://aclanthology.org/2021.cmcl-1.5
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)