Authors
Marco Carreras, Gianfranco Deriu, Paolo Meloni
Publication date
2020/9
Journal
Journal of Networking Technology Volume
Volume
11
Issue
3
Pages
93
Description
Computer Vision leads to large applications for image and video classification and segmentation. It provides the increasing focus of the community on this topic, has generated awide scope of approaches that use different kernel shapes and techniques for executing convolutions with respect to the classic one, such as for example separable convolutions, deformable convolutions or deconvolutions ([4, 5]), frequently used in semantic segmentation tasks ([23, 13]). It has been found that it is common knowledge that FPGAs can be used to accelerate classic Convolutional layers in CNNs, there is limited literature about FPGA-based accelerators supporting less regular and common processing kernels ([20]). This work begins from the previous experience acquired developing NEURAghe, we try to improve exibility of CNN accelerators and to study new methodologies to improve efficiency on the previously mentioned use-cases. Initially we focus on layered approaches based on 1D convolutions, that, as indicated by several recent research results, can be effectively used to classify and segment time series and sequences, as well as in tasks involving sequence modeling. In multiple scenarios a convolution approach applied on the time dimension, hereafter called Temporal Convolution Network (TCN) can outperform classic strategies relying on recurrent networks in terms of accuracy and training time. We modified NEURAghe to support TCN and validate results on an ECG-classification benchmark, achieving up to 95% efficiency in terms of GOPS/s with respect to the accelerator peak performance.
Scholar articles
M Carreras, G Deriu, P Meloni - Journal of Networking Technology Volume, 2020