View article

[PDF] from arxiv.org

Bi-directional cross-modality feature propagation with separation-and-aggregation gate for RGB-D semantic segmentation

Authors

Xiaokang Chen, Kwan-Yee Lin, Jingbo Wang, Wayne Wu, Chen Qian, Hongsheng Li, Gang Zeng

Publication date

2020/8/23

Book

European conference on computer vision

Pages

561-577

Publisher

Springer International Publishing

Description

Depth information has proven to be a useful cue in the semantic segmentation of RGB-D images for providing a geometric counterpart to the RGB representation. Most existing works simply assume that depth measurements are accurate and well-aligned with the RGB pixels and models the problem as a cross-modal feature fusion to obtain better feature representations to achieve more accurate segmentation. This, however, may not lead to satisfactory results as actual depth data are generally noisy, which might worsen the accuracy as the networks go deeper.

In this paper, we propose a unified and efficient Cross-modality Guided Encoder to not only effectively recalibrate RGB feature responses, but also to distill accurate depth information via multiple stages and aggregate the two recalibrated representations alternatively. The key of the proposed architecture is a novel Separation-and …

Total citations

Cited by 318

202020212022202320241 34 71 112 99

Scholar articles

Bi-directional cross-modality feature propagation with separation-and-aggregation gate for RGB-D semantic segmentation

X Chen, KY Lin, J Wang, W Wu, C Qian, H Li, G Zeng - European conference on computer vision, 2020