View article

[PDF] from ru.nl

Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection

Authors

Marijn Huijbregts, Mitchell McLaren, David Van Leeuwen

Publication date

2011/5/22

Conference

2011 IEEE international conference on Acoustics, speech and signal processing (ICASSP)

Pages

4436-4439

Publisher

IEEE

Description

In this paper we present a method for automatically generating acoustic sub-word units that can substitute conventional phone models in a query-by-example spoken term detection system. We generate the sub-word units with a modified version of our speaker diarization system. Given a speech recording, the original diarization system generates a set of speaker models in an unsupervised manner without the need for training or development data. Modifying the diarization system to process the speech of a single speaker and decreasing the minimum segment duration constraint allows us to detect speaker-dependent sub-word units. For the task of query-by-example spoken term detection, we show that the pro posed system performs well on both broadcast and non-broadcast recordings, unlike a conventional phone-based system trained solely on broadcast data. A mean average precision of 0.28 and 0.38 was …

Total citations

Cited by 77

20122013201420152016201720182019202020212022202320248 8 10 11 3 9 5 2 4 6 7 2 1

Scholar articles

Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection

M Huijbregts, M McLaren, D Van Leeuwen - 2011 IEEE international conference on Acoustics …, 2011