Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Home

Flexible Acquisition of Subcategorization Frames in Italian

Contributo in Atti di convegno
Data di Pubblicazione:
2012
Abstract:
Lexica of predicate-argument structures constitute a useful tool for several tasks in NLP. This paper describes a web-service system for automatic acquisition of verb subcategorization frames (SCFs) from parsed data in Italian. The system acquires SCFs in an unsupervised manner. We created two gold standards for the evaluation of the system, the first by mixing together information from two lexica (one manually created and the second automatically acquired) and manual exploration of corpus data and the other annotating data extracted from a specialized corpus (environmental domain). Data filtering is accomplished by means of the maximum likelihood estimate (MLE). The evaluation phase has allowed us to identify the best empirical MLE threshold for the creation of a lexicon (P=0.653, R=0.557, F1=0.601). In addition to this, we assigned to the extracted entries of the lexicon a confidence score based on the relative frequency and evaluated the extractor on domain specific data. The confidence score will allow the final user to easily select the entries of the lexicon in terms of their reliability: one of the most interesting feature of this work is the possibility the final users have to customize the results of the SCF extractor, obtaining different SCF lexica in terms of size and accuracy.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
lexicon; automatic acquisition; subcategorisation frames
Elenco autori:
Caselli, Tommaso; Russo, Irene; Frontini, Francesca; Rubino, Francesco; Quochi, Valeria
Autori di Ateneo:
FRONTINI FRANCESCA
QUOCHI VALERIA
RUSSO IRENE
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/222834
Titolo del libro:
Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)
  • Dati Generali

Dati Generali

URL

http://www.lrec-conf.org/proceedings/lrec2012/summaries/390.html
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)