Authors
Darren Moore, John Dines, Mathew Magimai Doss, Jithendra Vepa, Octavian Cheng, Thomas Hain
Publication date
2006
Description
A major component in the development of any speech recognition system is the decoder. As task complexities and, consequently, system complexities have continued to increase the decoding problem has become an increasingly significant component in the overall speech recognition system development effort, with efficient decoder design contributing to significantly improve the trade-off between decoding time and search errors. In this paper we present the “Juicer” (from transducer) large vocabulary continuous speech recognition (LVCSR) decoder based on weighted finite-State transducer (WFST). We begin with a discussion of the need for open source, state-of-the-art decoding software in LVCSR research and how this lead to the development of Juicer, followed by a brief overview of decoding techniques and major issues in decoder design. We present Juicer and its major features, emphasising its …
Total citations
20062007200820092010201120122013201420152016201720182019202020212022251071298573323411
Scholar articles
D Moore, J Dines, MM Doss, J Vepa, O Cheng, T Hain - Machine Learning for Multimodal Interaction: Third …, 2006