RSS

SPRACH: Speech Recognition Algorithms for Connectionist Hybrids
Principal Investigator: Steve Renals

Funded by EC RTD in IT Basic Research Project 20077 from 15 Dec 1995
Partners: FPMs (Mons, BE), Cambridge University (UK), INESC (Lisbon, PT), Sheffield University (UK)
Industrial Advisors: BBC (UK), CSELT (IT), Daimler-Benz (DE), Thomson-CSF (FR)

Keywords: speech recognition, hidden Markov models (HMM), artificial neural networks (ANN), statistical inference in ANNs, hybrid HMM/ANN technology, language models, application domain adaptation, neural network hardware and software, speech recognition applications.

The goal of SPRACH is to further improve the current state-of-the-art in continuous speech recognition using Artificial Neural Network (ANN) and Hidden Markov Model (HMM) approaches. Pursuing the theoretical and development work successfully carried out under the WERNICKE project (ESPRIT Basic Research Project 6487, October 1992-October 1995), SPRACH will extend the research to robust and flexible speech recognition systems that can easily be adapted to new languages and new domains with new lexica and new syntaxes.

In WERNICKE, on top of substantial theoretical results, it was demonstrated, using standard international reference databases (such as the unlimited vocabulary ARPA North American Business News database, and the EU funded SQALE project that the hybrid HMM/ANN approaches lead to competitive state-of-the-art speech recognizers. Furthermore, the investigated hybrid approach was shown to have additional advantages in terms of CPU utilization and memory bandwidth. These conclusions have been confirmed by many different independent sources.

While building on the WERNICKE large vocabulary continuous speech recognition system, SPRACH will also investigate the development of flexible systems for smaller, task independent applications, in different languages (UK English, French and Portuguese).

The industrial relevance of this project is high, and many useful results are expected. Firstly, it is clear that speech processing, and speech recognition in particular, will play a major role in the future multimedia and telematics applications. In this respect, while SPRACH is fully exploiting the promising HMM/ANN technology, it also addresses most of the relevant issues of speech recognition in general, such as language and lexicon modeling, application domain adaptation, and prototype development. Secondly, on top of its obvious relevance to the speech recognition technology, it is also important to note that, motivated by the results achieved in Wernicke, these hybrid systems have already been adopted by several industries and laboratories in many different areas.

To reinforce the industrial relevance of this project and its possible industrial impact, a SPRACH Industrial Advisory Board has been set up.