View article

[PDF] from qut.edu.au

Discriminative optimisation of the Figure of Merit for phonetic spoken term detection

Authors

Roy Wallace, Brendan Baker, Robbie Vogt, Sridha Sridharan

Publication date

2010

Journal

Audio, Speech, and Language Processing, IEEE Transactions on

Volume

Issue

Pages

1677-1687

Publisher

IEEE

Description

This paper proposes to improve spoken term detection (STD) accuracy by optimizing the figure of merit (FOM). In this paper, the index takes the form of a phonetic posterior-feature matrix. Accuracy is improved by formulating STD as a discriminative training problem and directly optimizing the FOM, through its use as an objective function to train a transformation of the index. The outcome of indexing is then a matrix of enhanced posterior-features that are directly tailored for the STD task. The technique is shown to improve the FOM by up to 13% on held-out data. Additional analysis explores the effect of the technique on phone recognition accuracy, examines the actual values of the learned transform, and demonstrates that using an extended training data set results in further improvement in the FOM.

Total citations

Cited by 15

2012201320142015201620172 5 3 2 1 1

Scholar articles

Discriminative optimization of the figure of merit for phonetic spoken term detection

R Wallace, B Baker, R Vogt, S Sridharan - IEEE transactions on audio, speech, and language …, 2010