Connected Digit Recognition Experiments with the OGI Toolkit's Neural Network and HMM-Based Recognizers
Contributo in Atti di convegno
Data di Pubblicazione:
1998
Abstract:
This paper describes a series of experiments that compare different approaches to training a speaker-independent continuous-speech digit recognizer using the CSLU Toolkit. Comparisons are made between the Hidden Markov Model (HMM) and Neural Network (NN) approaches. In addition, a description of the CSLU Toolkit research environment is given.
The CSLU Toolkit is a research and development software environment that provides a powerful and flexible tool for creating and using spoken language systems for telephone and PC applications. In particular, the CSLU-HMM, the CSLU-NN, and the CSLU-FBNN development environments, with which our experiments were implemented, will be described in detail and recognition results will be compared.
Our speech corpus is OGI 30K-Numbers, which is a collection of spontaneous ordinal and cardinal numbers, continuous digit strings and isolated digit strings. The utterances were recorded by having a large number of people recite their ZIP code, street address, or other numeric information over the telephone. This corpus represents a very noisy and difficult recognition task.
Our best results (98% word recognition, 92% sentence recognition), obtained with the FBNN architecture, suggest the effectiveness of the CSLU Toolkit in building real-life speech recognition systems.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Connected Digit Recognition; OGI Toolkit; Neural Network; HMM
Elenco autori:
Cosi, Piero
Link alla scheda completa:
Titolo del libro:
Proceedings of 4th IEEE Workshop on Interactive Voice Technology for Telecommunications Applications