View article

[PDF] from arxiv.org

Towards generalization across depth for monocular 3d object detection

Authors

Andrea Simonelli, Samuel Rota Bulo, Lorenzo Porzi, Elisa Ricci, Peter Kontschieder

Publication date

2020

Conference

Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXII 16

Pages

767-782

Publisher

Springer International Publishing

Description

While expensive LiDAR and stereo camera rigs have enabled the development of successful 3D object detection methods, monocular RGB-only approaches lag much behind. This work advances the state of the art by introducing MoVi-3D, a novel, single-stage deep architecture for monocular 3D object detection. MoVi-3D builds upon a novel approach which leverages geometrical information to generate, both at training and test time, virtual views where the object appearance is normalized with respect to distance. These virtually generated views facilitate the detection task as they significantly reduce the visual appearance variability associated to objects placed at different distances from the camera. As a consequence, the deep model is relieved from learning depth-specific representations and its complexity can be significantly reduced. In particular, in this work we show that, thanks to our virtual views …

Total citations

Cited by 67

20212022202320249 24 26 8

Scholar articles

Towards generalization across depth for monocular 3d object detection

A Simonelli, SR Bulo, L Porzi, E Ricci, P Kontschieder - Computer Vision–ECCV 2020: 16th European …, 2020