Authors
Guo-Jun Qi, Xian-Sheng Hua, Yong Rui, Jinhui Tang, Tao Mei, Hong-Jiang Zhang
Publication date
2007/9/29
Book
Proceedings of the 15th ACM international conference on Multimedia
Pages
17-26
Description
Automatically annotating concepts for video is a key to semantic-level video browsing, search and navigation. The research on this topic evolved through two paradigms. The first paradigm used binary classification to detect each individual concept in a concept set. It achieved only limited success, as it did not model the inherent correlation between concepts, e.g., urban and building. The second paradigm added a second step on top of the individual concept detectors to fuse multiple concepts. However, its performance varies because the errors incurred in the first detection step can propagate to the second fusion step and therefore degrade the overall performance. To address the above issues, we propose a third paradigm which simultaneously classifies concepts and models correlations between them in a single step by using a novel Correlative Multi-Label (CML) framework. We compare the performance …
Total citations
2007200820092010201120122013201420152016201720182019202020212022202320247304164554562505449412326412517146
Scholar articles
GJ Qi, XS Hua, Y Rui, J Tang, T Mei, HJ Zhang - Proceedings of the 15th ACM international conference …, 2007
GJ Qi, XS Hua, Y Rui, J Tang, T Mei, M Wang… - ACM Transactions on Multimedia Computing …, 2008