View article

[PDF] from arxiv.org

Semantic understanding of scenes through the ade20k dataset

Authors

Bolei Zhou, Hang Zhao, Xavier Puig, Tete Xiao, Sanja Fidler, Adela Barriuso, Antonio Torralba

Publication date

2019/3/15

Journal

International Journal of Computer Vision

Volume

127

Pages

302-321

Publisher

Springer US

Description

Semantic understanding of visual scenes is one of the holy grails of computer vision. Despite efforts of the community in data collection, there are still few image datasets covering a wide range of scenes and object categories with pixel-wise annotations for scene understanding. In this work, we present a densely annotated dataset ADE20K, which spans diverse annotations of scenes, objects, parts of objects, and in some cases even parts of parts. Totally there are 25k images of the complex everyday scenes containing a variety of objects in their natural spatial context. On average there are 19.5 instances and 10.5 object classes per image. Based on ADE20K, we construct benchmarks for scene parsing and instance segmentation. We provide baseline performances on both of the benchmarks and re-implement state-of-the-art models for open source. We further evaluate the effect of synchronized batch …

Total citations

Cited by 1736

201820192020202120222023202460 123 167 206 316 480 314

Scholar articles

Semantic understanding of scenes through the ade20k dataset

B Zhou, H Zhao, X Puig, T Xiao, S Fidler, A Barriuso… - International Journal of Computer Vision, 2019

Cited by 1736 Related articles All 9 versions