View article

[HTML] from springer.com

Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks

Authors

Christian Heidorn, Muhammad Sabih, Nicolai Meyerhöfer, Christian Schinabeck, Jürgen Teich, Frank Hannig

Publication date

2024/4

Journal

International Journal of Parallel Programming

Volume

Issue

Pages

40-58

Publisher

Springer US

Description

Filter pruning of convolutional neural networks (CNNs) is a common technique to effectively reduce the memory footprint, the number of arithmetic operations, and, consequently, inference time. Recent pruning approaches also consider the targeted device (i.e., graphics processing units) for CNN deployment to reduce the actual inference time. However, simple metrics, such as the -norm, are used for deciding which filters to prune. In this work, we propose a hardware-aware technique to explore the vast multi-objective design space of possible filter pruning configurations. Our approach incorporates not only the targeted device but also techniques from explainable artificial intelligence for ranking and deciding which filters to prune. For each layer, the number of filters to be pruned is optimized with the objective of minimizing the inference time and the error rate of the CNN. Experimental results show that our …

Total citations

Cited by 2

20242

Scholar articles

Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks

C Heidorn, M Sabih, N Meyerhöfer, C Schinabeck… - International Journal of Parallel Programming, 2024