Authors
Erwei Wang, James J Davis, Ruizhe Zhao, Ho-Cheung Ng, Xinyu Niu, Wayne Luk, Peter YK Cheung, George A Constantinides
Publication date
2019/5/30
Source
ACM Computing Surveys (CSUR)
Volume
52
Issue
2
Pages
1-39
Publisher
ACM
Description
Deep neural networks have proven to be particularly effective in visual and audio recognition tasks. Existing models tend to be computationally expensive and memory intensive, however, and so methods for hardware-oriented approximation have become a hot topic. Research has shown that custom hardware-based neural network accelerators can surpass their general-purpose processor equivalents in terms of both throughput and energy efficiency. Application-tailored accelerators, when co-designed with approximation-based network training methods, transform large, dense, and computationally expensive networks into small, sparse, and hardware-efficient alternatives, increasing the feasibility of network deployment. In this article, we provide a comprehensive evaluation of approximation methods for high-performance network inference along with in-depth discussion of their effectiveness for custom …
Total citations
201920202021202220232024204161434716
Scholar articles
E Wang, JJ Davis, R Zhao, HC Ng, X Niu, W Luk… - ACM Computing Surveys (CSUR), 2019