Authors
Baoyuan Liu, Min Wang, Hassan Foroosh, Marshall Tappen, Marianna Pensky
Publication date
2015
Conference
Proceedings of the IEEE conference on computer vision and pattern recognition
Pages
806-814
Description
Deep neural networks have achieved remarkable performance in both image classification and object detection problems, at the cost of a large number of parameters and computational complexity. In this work, we show how to reduce the redundancy in these parameters using a sparse decomposition. Maximum sparsity is obtained by exploiting both inter-channel and intra-channel redundancy, with a fine-tuning step that minimize the recognition loss caused by maximizing sparsity. This procedure zeros out more than 90\% of parameters, with a drop of accuracy that is less than 1\% on the ILSVRC2012 dataset. We also propose an efficient sparse matrix multiplication algorithm on CPU for Sparse Convolutional Neural Networks (SCNN) models. Our CPU implementation demonstrates much higher efficiency than the off-the-shelf sparse matrix libraries, with a significant speedup realized over the original dense network. In addition, we apply the SCNN model to the object detection problem, in conjunction with a cascade model and sparse fully connected layers, to achieve significant speedups.
Total citations
20152016201720182019202020212022202320247399014912815015010912581
Scholar articles
B Liu, M Wang, H Foroosh, M Tappen, M Pensky - Proceedings of the IEEE conference on computer …, 2015