Authors
Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, Antonio Torralba
Publication date
2017/7/4
Journal
IEEE transactions on pattern analysis and machine intelligence
Volume
40
Issue
6
Pages
1452-1464
Publisher
IEEE
Description
The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification performance at tasks such as visual object and scene recognition. Here we describe the Places Database, a repository of 10 million scene photographs, labeled with scene semantic categories, comprising a large and diverse list of the types of environments encountered in the world. Using the state-of-the-art Convolutional Neural Networks (CNNs), we provide scene classification CNNs (Places-CNNs) as baselines, that significantly outperform the previous approaches. Visualization of the CNNs trained on Places shows that object detectors emerge as an intermediate representation of scene classification. With its high-coverage and high-diversity of exemplars, the Places Database along with the Places-CNNs offer a novel resource to guide future progress on scene …
Total citations
2017201820192020202120222023202435231440547768897938509
Scholar articles
B Zhou, A Lapedriza, A Khosla, A Oliva, A Torralba - IEEE transactions on pattern analysis and machine …, 2017