Authors
Bryan C Russell, Antonio Torralba, Kevin P Murphy, William T Freeman
Publication date
2008/5
Journal
International journal of computer vision
Volume
77
Pages
157-173
Publisher
Springer US
Description
We seek to build a large collection of images with ground truth labels to be used for object detection and recognition research. Such data is useful for supervised learning and quantitative evaluation. To achieve this, we developed a web-based tool that allows easy image annotation and instant sharing of such annotations. Using this annotation tool, we have collected a large dataset that spans many object categories, often containing multiple instances over a wide variety of images. We quantify the contents of the dataset and compare against existing state of the art datasets used for object recognition and detection. Also, we show how to extend the dataset to automatically enhance object labels with WordNet, discover object parts, recover a depth ordering of objects in a scene, and increase the number of labels using minimal user supervision and images from the web.
Total citations
2007200820092010201120122013201420152016201720182019202020212022202320245293178215225221274244245281254229273298324401395269
Scholar articles
BC Russell, A Torralba, KP Murphy, WT Freeman - International journal of computer vision, 2008