View article

[PDF] from thecvf.com

Learning deep features for discriminative localization

Authors

Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba

Publication date

2016

Conference

Proceedings of the IEEE conference on computer vision and pattern recognition

Pages

2921-2929

Description

In this work, we revisit the global average pooling layer proposed in [13], and shed light on how it explicitly enables the convolutional neural network (CNN) to have remarkable localization ability despite being trained on image-level labels. While this technique was previously proposed as a means for regularizing training, we find that it actually builds a generic localizable deep representation that exposes the implicit attention of CNNs on image. Despite the apparent simplicity of global average pooling, we are able to achieve 37.1% top-5 error for object localization on ILSVRC 2014 without training on any bounding box annotation. We demonstrate that our network is able to localize the discriminative image regions on a variety of tasks despite not being trained for them.

Total citations

Cited by 11848

20162017201820192020202120222023202443 245 570 1004 1525 2057 2385 2476 1444

Scholar articles

Learning deep features for discriminative localization

B Zhou, A Khosla, A Lapedriza, A Oliva, A Torralba - Proceedings of the IEEE conference on computer …, 2016

Cited by 11515 Related articles All 22 versions

Lapedriza*

B Zhou, A Khosla - A., A. Oliva, and A. Torralba. Learning Deep Features …, 2016

Cited by 364 Related articles

Agata Lapedriza, Aude Oliva, and Antonio Torralba*

B Zhou, A Khosla - Learning deep features for discriminative localization …, 2015

Cited by 41 Related articles