Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Automatic Incremental Term Acquisition from Domain Corpora

Contributo in Atti di convegno
Data di Pubblicazione:
2005
Abstract:
We describe a technique for the acquisition of terms from Italian domain text corpora, which relies both on sophisticated linguistic analysis and on statistical measures applied to linguistically processed text rather than to raw text as it is usually the case. The main advantage of this technique is that minimal a priori knowledge of term structure is required, thus allowing to explore and discover terms in a given domain without imposing a strict pattern matching structure on them, and also to easily extend it to different domains. The approach we present in this paper is incremental as it may be iterated to discover terms of increasing complexity built on top of terms discovered in the previous iteration. The reason why it is convenient to adopt such an incremental approach is that it allows to "clean" data from noise in the first step, elicitating the constituent terms, and then to refine term acquisition on "skimmed" term data.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Elenco autori:
Pirrelli, Vito; Montemagni, Simonetta; Bartolini, Roberto
Autori di Ateneo:
BARTOLINI ROBERTO
MONTEMAGNI SIMONETTA
PIRRELLI VITO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/431279
Titolo del libro:
Proceedings of TKE 2005 - 7th International Conference on Terminology and Knowledge Engineering
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)