View article

[PDF] from arxiv.org

Introduction to camera pose estimation with deep learning

Authors

Yoli Shavit, Ron Ferens

Publication date

2019/7/8

Journal

arXiv preprint arXiv:1907.05272

Description

Over the last two decades, deep learning has transformed the field of computer vision. Deep convolutional networks were successfully applied to learn different vision tasks such as image classification, image segmentation, object detection and many more. By transferring the knowledge learned by deep models on large generic datasets, researchers were further able to create fine-tuned models for other more specific tasks. Recently this idea was applied for regressing the absolute camera pose from an RGB image. Although the resulting accuracy was sub-optimal, compared to classic feature-based solutions, this effort led to a surge of learning-based pose estimation methods. Here, we review deep learning approaches for camera pose estimation. We describe key methods in the field and identify trends aiming at improving the original deep pose regression solution. We further provide an extensive cross-comparison of existing learning-based pose estimators, together with practical notes on their execution for reproducibility purposes. Finally, we discuss emerging solutions and potential future research directions.

Total citations

Cited by 74

2020202120222023202412 15 13 22 12

Scholar articles

Introduction to camera pose estimation with deep learning

Y Shavit, R Ferens - arXiv preprint arXiv:1907.05272, 2019