Authors
Chuanxia Zheng, Jianhua Wang, Weihai Chen, Xingming Wu
Publication date
2018/5
Journal
The Visual Computer (TVCJ)
Volume
34
Pages
735-747
Publisher
Springer Berlin Heidelberg
Description
Indoor semantic segmentation plays a critical role in many applications, such as intelligent robots. However, multi-class recognition is still challenging, especially for pixel-level indoor semantic labeling. In this paper, a novel deep structured model that combines the strengths of the widely used convolutional neural networks (CNNs) and recurrent neural networks (RNNs) is proposed. We first present a multi-information fusion model that utilizes the scene category information to fine-tune the fully convolutional network. Then, to refine the coarse outputs of CNN, the RNN is applied to the final CNN layer so that we can build an end-to-end trainable system. This Graph-RNN is transformed from a conditional random field based on superpixel segmentation graphical modeling that can utilize flexible contextual information of different neighboring regions. The experimental results on the recent large SUN RGB-D dataset …
Total citations
2018201920202021202220232024122232
Scholar articles