Data di Pubblicazione:
2020
Abstract:
The problem of creating a fully automated specific-domain thesaurus is very topical. The paper presents a novel method to address this problem in the Italian language. The main feature of this approach is the integration of different methods: machine learning classification methods working on the semantic representation of candidate terms, word embeddings models, able to capture the semantics of words, and a computation of the degree of specialization of a term. The work is in progress and results obtained so far are promising.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Classification Methods; Word Em; Probability; Food; Italian Language
Elenco autori:
Gagliardi, Isabella; Artese, MARIA TERESA
Link alla scheda completa:
Pubblicato in: