Authors
Daniele Cesarini, Andrea Marongiu, Luca Benini
Publication date
2016/3/14
Conference
2016 Design, Automation & Test in Europe Conference & Exhibition (DATE)
Pages
1261-1266
Publisher
IEEE
Description
Manycore accelerators have recently proven a promising solution for increasingly powerful and energy efficient computing systems. This raises the need for parallel programming models capable of effectively leveraging hundreds to thousands of processors. Task-based parallelism has the potential to provide such capabilities, offering flexible support to fine-grained and irregular parallelism. However, efficiently supporting this programming paradigm on resource-constrained parallel accelerators is a challenging task. In this paper, we present an optimized implementation of the OpenMP tasking model for embedded parallel accelerators, discussing the key design solution that guarantee small memory (footprint) and minimize performance overheads. We validate our design by comparing to several state-of-the-art tasking implementations, using the most representative parallelization patterns. The experimental …
Total citations
2016201720182019202020212022111
Scholar articles
D Cesarini, A Marongiu, L Benini - 2016 Design, Automation & Test in Europe Conference …, 2016