View article

[PDF] from arxiv.org

Self-supervised single-view 3d reconstruction via semantic consistency

Authors

Xueting Li, Sifei Liu, Kihwan Kim, Shalini De Mello, Varun Jampani, Ming-Hsuan Yang, Jan Kautz

Publication date

2020

Conference

Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XIV 16

Pages

677-693

Publisher

Springer International Publishing

Description

We learn a self-supervised, single-view 3D reconstruction model that predicts the 3D mesh shape, texture and camera pose of a target object with a collection of 2D images and silhouettes. The proposed method does not necessitate 3D supervision, manually annotated keypoints, multi-view images of an object or a prior 3D template. The key insight of our work is that objects can be represented as a collection of deformable parts, and each part is semantically coherent across different instances of the same category (e.g., wings on birds and wheels on cars). Therefore, by leveraging part segmentation of a large collection of category-specific images learned via self-supervision, we can effectively enforce semantic consistency between the reconstructed meshes and the original images. This significantly reduces ambiguities during joint prediction of shape and camera pose of an object, along with texture. To …

Total citations

Cited by 162

202020212022202320245 35 43 49 30

Scholar articles

Self-supervised single-view 3d reconstruction via semantic consistency

X Li, S Liu, K Kim, S De Mello, V Jampani, MH Yang… - Computer Vision–ECCV 2020: 16th European …, 2020