Authors
John Dines, Mathew Magimai Doss
Publication date
2007/6/28
Book
International Workshop on Machine Learning for Multimodal Interaction
Pages
215-226
Publisher
Springer Berlin Heidelberg
Description
In this paper we present a study of automatic speech recognition systems using context-dependent phonemes and graphemes as sub-word units based on the conventional HMM/GMM system as well as tandem system. Experimental studies conducted on three different continuous speech recognition tasks show that systems using only context-dependent graphemes can yield competitive performance on small to medium vocabulary tasks when compared to a context-dependent phoneme-based automatic speech recognition system. In particular, we demonstrate the utility of tandem features that use an MLP trained to estimate phoneme posterior probabilities in improving grapheme based recognition system performance by implicitly incorporating phonemic knowledge into the system without having to define a phonetically transcribed lexicon.
Total citations
20082009201020112012201320142015201620172018201920202021202220232242562231111
Scholar articles
J Dines, M Magimai Doss - International Workshop on Machine Learning for …, 2007