Authors
Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, Yantao Zheng
Publication date
2009/7/8
Book
Proceedings of the ACM international conference on image and video retrieval
Pages
1-9
Description
This paper introduces a web image dataset created by NUS's Lab for Media Search. The dataset includes: (1) 269,648 images and the associated tags from Flickr, with a total of 5,018 unique tags; (2) six types of low-level features extracted from these images, including 64-D color histogram, 144-D color correlogram, 73-D edge direction histogram, 128-D wavelet texture, 225-D block-wise color moments extracted over 5x5 fixed grid partitions, and 500-D bag of words based on SIFT descriptions; and (3) ground-truth for 81 concepts that can be used for evaluation. Based on this dataset, we highlight characteristics of Web image collections and identify four research issues on web image annotation and retrieval. We also provide the baseline results for web image annotation by learning from the tags using the traditional k-NN algorithm. The benchmark results indicate that it is possible to learn effective models from …
Total citations
2009201020112012201320142015201620172018201920202021202220232024275485102128149216203216250327319314349357236
Scholar articles
TS Chua, J Tang, R Hong, H Li, Z Luo, Y Zheng - Proceedings of the ACM international conference on …, 2009