Authors
Toni Heittola, Annamaria Mesaros, Tuomas Virtanen, Antti Eronen
Publication date
2011
Conference
Machine Listening in Multisource Environments
Description
This paper proposes a sound event detection system for natural multisource environments, using a sound source separation front-end. The recognizer aims at detecting sound events from various everyday contexts. The audio is preprocessed using non-negative matrix factorization and separated into four individual signals. Each sound event class is represented by a Hidden Markov Model trained using mel frequency cepstral coefficients extracted from the audio. Each separated signal is used individually for feature extraction and then segmentation and classification of sound events using the Viterbi algorithm. The separation allows detection of a maximum of four overlapping events. The proposed system shows a significant increase in event detection accuracy compared to a system able to output a single sequence of events.
Total citations
2012201320142015201620172018201920202021202220232024513159142122181510766
Scholar articles
T Heittola, A Mesaros, T Virtanen, A Eronen - Machine Listening in Multisource Environments, 2011