Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

PaCCSS-IT: A Parallel Corpus of Complex-Simple Sentences for Automatic Text Simplification

Conference Paper
Publication Date:
2016
abstract:
In this paper we present PaCCSS-IT, a Parallel Corpus of Complex-Simple Sentences for ITalian. To build the resource we develop a new method for automatically acquiring a corpus of complex-simple paired sentences able to intercept structural transformations and particularly suitable for text simplification. The method requires a wide amount of texts that can be easily extracted from the web making it suitable also for less-resourced languages. We test it on the Italian language making available the biggest Italian corpus for automatic text simplification.
Iris type:
04.01 Contributo in Atti di convegno
Keywords:
Automatic Text Simplification; Sentence alignment; Italian corpus
List of contributors:
Venturi, Giulia; Cimino, Andrea; Brunato, DOMINIQUE PIERINA; Dell'Orletta, Felice
Authors of the University:
BRUNATO DOMINIQUE PIERINA
DELL'ORLETTA FELICE
VENTURI GIULIA
Handle:
https://iris.cnr.it/handle/20.500.14243/333951
  • Overview

Overview

URL

https://www.aclweb.org/anthology/D/D16/D16-1034.pdf
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)