View article

[PDF] from thecvf.com

Semantic video CNNs through representation warping

Authors

Raghudeep Gadde, Varun Jampani, Peter V Gehler

Publication date

2017

Conference

International Conference on Computer Vision (ICCV)

Description

In this work, we propose a technique to convert CNN models for semantic segmentation of static images into CNNs for video data. We describe a warping method that can be used to augment existing architectures with very little extra computational cost. This module is called NetWarp and we demonstrate its use for a range of network architectures. The main design principle is to use optical flow of adjacent frames for warping internal network representations across time. A key insight of this work is that fast optical flow methods can be combined with many different CNN architectures for improved performance and end-to-end training. Experiments validate that the proposed approach incurs only little extra computational cost, while improving performance, when video streams are available. We achieve new state-of-the-art results on the CamVid and Cityscapes benchmark datasets and show consistent improvements over different baseline networks. Our code and models are available at http://segmentation. is. tue. mpg. de

Total citations

Cited by 265

2016201720182019202020212022202320241 4 26 32 29 45 48 51 25

Scholar articles

Semantic video cnns through representation warping

R Gadde, V Jampani, PV Gehler - Proceedings of the IEEE International Conference on …, 2017