Authors
Dominik Durner, Viktor Leis, Thomas Neumann
Publication date
2023/7/1
Journal
Proceedings of the VLDB Endowment
Volume
16
Issue
11
Pages
2769-2782
Publisher
VLDB Endowment
Description
Elasticity of compute and storage is crucial for analytical cloud database systems. All cloud vendors provide disaggregated object stores, which can be used as storage backend for analytical query engines. Until recently, local storage was unavoidable to process large tables efficiently due to the bandwidth limitations of the network infrastructure in public clouds. However, the gap between remote network and local NVMe bandwidth is closing, making cloud storage more attractive. This paper presents a blueprint for performing efficient analytics directly on cloud object stores. We derive cost- and performance-optimal retrieval configurations for cloud object stores with the first in-depth study of this foundational service in the context of analytical query processing. For achieving high retrieval performance, we present AnyBlob, a novel download manager for query engines that optimizes throughput while minimizing CPU …
Total citations
Scholar articles
D Durner, V Leis, T Neumann - Proceedings of the VLDB Endowment, 2023