Authors
Harald Lang, Andreas Kipf, Linnea Passing, Peter Boncz, Thomas Neumann, Alfons Kemper
Publication date
2018/6/11
Conference
Proceedings of the 14th International Workshop on Data Management on New Hardware
Pages
5
Publisher
ACM
Description
Increasing single instruction multiple data (SIMD) capabilities in modern hardware allows for compiling efficient data-parallel query pipelines. This means GPU-alike challenges arise: control flow divergence causes underutilization of vector-processing units. In this paper, we present efficient algorithms for the AVX-512 architecture to address this issue. These algorithms allow for fine-grained assignment of new tuples to idle SIMD lanes. Furthermore, we present strategies for their integration with compiled query pipelines without introducing inefficient memory materializations. We evaluate our approach with a high-performance geospatial join query, which shows performance improvements of up to 35%.
Total citations
20182019202020212022202320241499432
Scholar articles
H Lang, A Kipf, L Passing, P Boncz, T Neumann… - Proceedings of the 14th international workshop on data …, 2018