Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Morpheme-based recognition and translation of medical terms

Capitolo di libro
Data di Pubblicazione:
2016
Abstract:
In this paper we use Nooj to solve a recognition and translation task on medical terms with a morphosemantic approach. The Medical domain is characterized by a huge number of different terms that appear in corpora with very low frequencies. For this reason, machine learning or statistical approaches do not achieve good results on this domain. In our work we apply a morpho-semantic approach that take advantage from a number of Italian and English word-formation strategies for the automatic analysis of Italian words and for the generation of Italian/English bilingual lexicons in the medical sub-code. Using Nooj we built a series of Italian and bilingual dictionaries of morphemes, a set of morphological grammars that specify how morphemes combine with each other, a syntactic grammar for the recognition of compound terms and a Finite State Transducer (FST) for the translation of medical terms based on morphemes. This approach produces as output: a categorized Italian electronic dictionary of medical simple words, provided with labels specifying the meaning of each term; a Thesaurus of simple and compound medical terms, organized in 22 medical subcategories; A an Italian/English translation of medical terms.
Tipologia CRIS:
02.01 Contributo in volume (Capitolo o Saggio)
Keywords:
Medica Domain; Morpho-Semantics; Finite-State Automata; Automatic Processing of Natural-Language Electronic Texts with NooJ
Elenco autori:
Guarasci, Raffaele
Autori di Ateneo:
GUARASCI RAFFAELE
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/420076
Titolo del libro:
Automatic Processing of Natural-Language Electronic Texts with NooJ
Pubblicato in:
COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE (PRINT)
Series
  • Dati Generali

Dati Generali

URL

http://dx.doi.org/10.1007/978-3-319-42471-2_15
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)