Authors
Roy Wallace, Robbie Vogt, Sridha Sridharan
Publication date
2009/4/19
Conference
2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Pages
4881-4884
Publisher
IEEE
Description
While spoken term detection (STD) systems based on word indices provide good accuracy, there are several practical applications where it is infeasible or too costly to employ an LVCSR engine. An STD system is presented, which is designed to incorporate a fast phonetic decoding front-end and be robust to decoding errors whilst still allowing for rapid search speeds. This goal is achieved through monophone open-loop decoding coupled with fast hierarchical phone lattice search. Results demonstrate that an STD system that is designed with the constraint of a fast and simple phonetic decoding front-end requires a compromise to be made between search speed and search accuracy.
Total citations
200920102011201220132014201520162017201820192020202120223544445114221
Scholar articles
R Wallace, R Vogt, S Sridharan - 2009 IEEE International Conference on Acoustics …, 2009