Data di Pubblicazione:
2017
Abstract:
The need of smart information retrieval systems is in contrast with the difficulties to deal with huge amount of data. In this paper we present a Big Data Analytics architecture used to implement a semantic similarity search tool for natural language texts in biomedical domain. The implemented methodology is based on Word Embeddings (WEs) models obtained using the word2vec algorithm. The system has been assessed with documents extracted from the whole PubMed library. It will be also presented a user friendly web front-end able to assess the methodology on a real context.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Big Data Analytics; Natural Language Processing; Word Embeddings; SPARK; PubMed; Semantic Similarity Search; Bio-Medical Literature; Semantic Similarity Search; Semantics
Elenco autori:
Silvestri, Stefano; Ciampi, Mario; Gargiulo, Francesco
Link alla scheda completa: