View article

[PDF] from github.io

A Systematic Evaluation of the Bag-of-frames Representation for Music Information Retrieval

Authors

Li Su, Chin-Chia Michael Yeh, Jen-Yu Liu, Ju-Chiang Wang, Yi-Hsuan Yang

Publication date

2014/3/11

Journal

IEEE Transactions on Multimedia

Volume

Issue

Pages

1188-1200

Publisher

IEEE

Description

There has been an increasing attention on learning feature representations from the complex, high-dimensional audio data applied in various music information retrieval (MIR) problems. Unsupervised feature learning techniques, such as sparse coding and deep belief networks have been utilized to represent music information as a term-document structure comprising of elementary audio codewords. Despite the widespread use of such bag-of-frames (BoF) model, few attempts have been made to systematically compare different component settings. Moreover, whether techniques developed in the text retrieval community are applicable to audio codewords is poorly understood. To further our understanding of the BoF model, we present in this paper a comprehensive evaluation that compares a large number of BoF variants on three different MIR tasks, by considering different ways of low-level feature …

Total citations

Cited by 67

2013201420152016201720182019202020212022202320242 10 18 7 10 6 1 3 2 4 2 1

Scholar articles

A systematic evaluation of the bag-of-frames representation for music information retrieval

L Su, CCM Yeh, JY Liu, JC Wang, YH Yang - IEEE Transactions on Multimedia, 2014