View article

[PDF] from arxiv.org

Deep neural network approximation for custom hardware: Where we've been, where we're going

Authors

Erwei Wang, James J Davis, Ruizhe Zhao, Ho-Cheung Ng, Xinyu Niu, Wayne Luk, Peter YK Cheung, George A Constantinides

Publication date

2019/5/30

Source

ACM Computing Surveys (CSUR)

Volume

Issue

Pages

1-39

Publisher

ACM

Description

Deep neural networks have proven to be particularly effective in visual and audio recognition tasks. Existing models tend to be computationally expensive and memory intensive, however, and so methods for hardware-oriented approximation have become a hot topic. Research has shown that custom hardware-based neural network accelerators can surpass their general-purpose processor equivalents in terms of both throughput and energy efficiency. Application-tailored accelerators, when co-designed with approximation-based network training methods, transform large, dense, and computationally expensive networks into small, sparse, and hardware-efficient alternatives, increasing the feasibility of network deployment. In this article, we provide a comprehensive evaluation of approximation methods for high-performance network inference along with in-depth discussion of their effectiveness for custom …

Total citations

Cited by 229

20192020202120222023202420 41 61 43 47 16

Scholar articles

Deep neural network approximation for custom hardware: Where we've been, where we're going

E Wang, JJ Davis, R Zhao, HC Ng, X Niu, W Luk… - ACM Computing Surveys (CSUR), 2019