Authors
Tamara L Berg, Alexander C Berg, Jonathan Shih
Publication date
2010
Conference
Computer Vision–ECCV 2010: 11th European Conference on Computer Vision, Heraklion, Crete, Greece, September 5-11, 2010, Proceedings, Part I 11
Pages
663-676
Publisher
Springer Berlin Heidelberg
Description
It is common to use domain specific terminology – attributes – to describe the visual appearance of objects. In order to scale the use of these describable visual attributes to a large number of categories, especially those not well studied by psychologists or linguists, it will be necessary to find alternative techniques for identifying attribute vocabularies and for learning to recognize attributes without hand labeled training data. We demonstrate that it is possible to accomplish both these tasks automatically by mining text and image data sampled from the Internet. The proposed approach also characterizes attributes according to their visual representation: global or local, and type: color, texture, or shape. This work focuses on discovering attributes and their visual appearance, and is as agnostic as possible about the textual description.
Total citations
201120122013201420152016201720182019202020212022202320241935607279735339212126181919
Scholar articles
TL Berg, AC Berg, J Shih - Computer Vision–ECCV 2010: 11th European …, 2010