Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Speaker-independent emotion recognition exploiting a psychologically-inspired binary cascade classification schema

Articolo
Data di Pubblicazione:
2012
Abstract:
In this paper, a psychologically-inspired binary cascade classification schema is proposed for speech emotion recognition. Performance is enhanced because commonly confused pairs of emotions are distinguishable from one another. Extracted features are related to statistics of pitch, formants, and energy contours, as well as spectrum, cepstrum, perceptual and temporal features, autocorrelation, MPEG-7 descriptors, Fujisaki's model parameters, voice quality, jitter, and shimmer. Selected features are fed as input to K nearest neighborhood classifier and to support vector machines. Two kernels are tested for the latter: linear and Gaussian radial basis function. The recently proposed speaker-independent experimental protocol is tested on the Berlin emotional speech database for each gender separately. The best emotion recognition accuracy, achieved by support vector machines with linear kernel, equals 87.7%, outperforming state-of-the-art approaches. Statistical analysis is first carried out with respect to the classifiers' error rates and then to evaluate the information expressed by the classifiers' confusion matrices.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Emotion recognition; Large-scale feature extraction; Binary classification schema
Elenco autori:
Kotti, Margarita; Paterno', Fabio
Autori di Ateneo:
PATERNO' FABIO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/262437
Pubblicato in:
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY
Journal
  • Dati Generali

Dati Generali

URL

http://link.springer.com/article/10.1007/s10772-012-9127-7
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)