An unsupervised data-driven cross-lingual method for building high precision sentiment lexicons
Conference Paper
Publication Date:
2013
abstract:
In this paper we present a completely unsupervised approach for creating a sentiment lexicon. The approach has been realized by designing a pipeline which implements an unsupervised system that covers different aspects: the automatic extraction of user reviews, the pre-processing of text, the use of a scoring measure which combines: entropy, term frequency, inverse document frequency, and finally a cross lingual intersection. We have validated the approach though the analysis of app reviews present in the Google Play market. The results show the effectiveness of the approach given by satisfactory values of precision for the obtained lexicon.
Iris type:
04.01 Contributo in Atti di convegno
Keywords:
Sentiment Analysis; Sentiment Lexicon; Machine Learning
List of contributors: