View article

[PDF] from bjtu.edu.cn

HCP: A flexible CNN framework for multi-label image classification

Authors

Yunchao Wei, Wei Xia, Min Lin, Junshi Huang, Bingbing Ni, Jian Dong, Yao Zhao, Shuicheng Yan

Publication date

2016/9/1

Journal

IEEE transactions on pattern analysis and machine intelligence

Volume

Issue

Pages

1901-1907

Publisher

IEEE

Description

Convolutional Neural Network (CNN) has demonstrated promising performance in single-label image classification tasks. However, how CNN best copes with multi-label images still remains an open problem, mainly due to the complex underlying object layouts and insufficient multi-label training images. In this work, we propose a flexible deep CNN infrastructure, called Hypotheses-CNN-Pooling (HCP), where an arbitrary number of object segment hypotheses are taken as the inputs, then a shared CNN is connected with each hypothesis, and finally the CNN output results from different hypotheses are aggregated with max pooling to produce the ultimate multi-label predictions. Some unique characteristics of this flexible deep CNN infrastructure include: 1) no ground-truth bounding box information is required for training; 2) the whole HCP infrastructure is robust to possibly noisy and/or redundant hypotheses; 3 …

Total citations

Cited by 994

201420152016201720182019202020212022202320248 36 66 99 127 133 124 121 111 97 44

Scholar articles

HCP: A flexible CNN framework for multi-label image classification

Y Wei, W Xia, M Lin, J Huang, B Ni, J Dong, Y Zhao… - IEEE transactions on pattern analysis and machine …, 2015

CNN: Single-label to multi-label*

Y Wei, W Xia, J Huang, B Ni, J Dong, Y Zhao, S Yan - arXiv preprint arXiv:1406.5726, 2014