View article

Minerva: Enabling low-power, highly-accurate deep neural network accelerators

Authors

Brandon Reagen, Paul Whatmough, Robert Adolf, Saketh Rama, Hyunkwang Lee, Sae Kyu Lee, José Miguel Hernández-Lobato, Gu-Yeon Wei, David Brooks

Publication date

2016/6/18

Journal

ACM SIGARCH Computer Architecture News

Volume

Issue

Pages

267-278

Publisher

ACM

Description

The continued success of Deep Neural Networks (DNNs) in classification tasks has sparked a trend of accelerating their execution with specialized hardware. While published designs easily give an order of magnitude improvement over general-purpose hardware, few look beyond an initial implementation. This paper presents Minerva, a highly automated co-design approach across the algorithm, architecture, and circuit levels to optimize DNN hardware accelerators. Compared to an established fixed-point accelerator baseline, we show that fine-grained, heterogeneous datatype optimization reduces power by 1.5×; aggressive, inline predication and pruning of small activity values further reduces power by 2.0×; and active hardware fault detection coupled with domain-aware error mitigation eliminates an additional 2.7× through lowering SRAM voltages. Across five datasets, these optimizations provide a collective …

Total citations

Cited by 739

20162017201820192020202120222023202415 65 104 107 130 106 100 77 25

Scholar articles

Minerva: Enabling low-power, highly-accurate deep neural network accelerators

B Reagen, P Whatmough, R Adolf, S Rama, H Lee… - ACM SIGARCH Computer Architecture News, 2016