Authors
Yunchao Wei, Wei Xia, Min Lin, Junshi Huang, Bingbing Ni, Jian Dong, Yao Zhao, Shuicheng Yan
Publication date
2016/9/1
Journal
IEEE transactions on pattern analysis and machine intelligence
Volume
38
Issue
9
Pages
1901-1907
Publisher
IEEE
Description
Convolutional Neural Network (CNN) has demonstrated promising performance in single-label image classification tasks. However, how CNN best copes with multi-label images still remains an open problem, mainly due to the complex underlying object layouts and insufficient multi-label training images. In this work, we propose a flexible deep CNN infrastructure, called Hypotheses-CNN-Pooling (HCP), where an arbitrary number of object segment hypotheses are taken as the inputs, then a shared CNN is connected with each hypothesis, and finally the CNN output results from different hypotheses are aggregated with max pooling to produce the ultimate multi-label predictions. Some unique characteristics of this flexible deep CNN infrastructure include: 1) no ground-truth bounding box information is required for training; 2) the whole HCP infrastructure is robust to possibly noisy and/or redundant hypotheses; 3 …
Total citations
2014201520162017201820192020202120222023202483666991271331241211119744
Scholar articles
Y Wei, W Xia, M Lin, J Huang, B Ni, J Dong, Y Zhao… - IEEE transactions on pattern analysis and machine …, 2015
Y Wei, W Xia, J Huang, B Ni, J Dong, Y Zhao, S Yan - arXiv preprint arXiv:1406.5726, 2014