Authors
Heliang Zheng, Jianlong Fu, Tao Mei, Jiebo Luo
Publication date
2017
Conference
Proceedings of the IEEE international conference on computer vision
Pages
5209-5217
Description
Recognizing fine-grained categories (eg, bird species) highly relies on discriminative part localization and part-based fine-grained feature learning. Existing approaches predominantly solve these challenges independently, while neglecting the fact that part localization (eg, head of a bird) and fine-grained feature learning (eg, head shape) are mutually correlated. In this paper, we propose a novel part learning approach by a multi-attention convolutional neural network (MA-CNN), where part generation and feature learning can reinforce each other. MA-CNN consists of convolution, channel grouping and part classification sub-networks. The channel grouping network takes as input feature channels from convolutional layers, and generates multiple parts by clustering, weighting and pooling from spatially-correlated channels. The part classification network further classifies an image by each individual part, through which more discriminative fine-grained features can be learned. Two losses are proposed to guide the multi-task learning of channel grouping and part classification, which encourages MA-CNN to generate more discriminative parts from feature channels and learn better fine-grained features from parts in a mutual reinforced way. MA-CNN does not need bounding box/part annotation and can be trained end-to-end. We incorporate the learned parts from MA-CNN with part-CNN for recognition, and show the best performances on three challenging published fine-grained datasets, eg, CUB-Birds, FGVC-Aircraft and Stanford-Cars.
Total citations
20182019202020212022202320245711916223918719280
Scholar articles
H Zheng, J Fu, T Mei, J Luo - Proceedings of the IEEE international conference on …, 2017