Question Classification by Convolutional Neural Networks Embodying Subword Information
Contributo in Atti di convegno
Data di Pubblicazione:
2018
Abstract:
Question Classification is a core module of Question Answering paradigm. Development of classification models based on neural networks showed that convolutional architectures allow obtaining uppermost results for this task. In particular, this type of approach avoids extracting features from questions, by treating text as a sequence of words, and transforming each word in a dense vector, named word embedding. Among different techniques to learn word embeddings, a recent approach takes into account also subword information, which could be very useful for morphologically rich languages. In this paper, a Question Classification approach based on word embedding using subword information and Convolutional Neural Networks is proposed, in order to improve classification accuracy. In particular, questions taken from a TRC dataset are considered, and a comparison between English and Italian languages is reported, by highlighting eventual improvements obtained by initializing word embeddings with advanced vectors learned in an unsupervised manner using skip- gram model and comprising character-based information.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
feature extraction; feedforward neural nets; learning (artificial intelligence); natural language processing; pattern classification; query processing text analysis
Elenco autori:
Esposito, Massimo; Pota, Marco
Link alla scheda completa: