View article

[PDF] from openproceedings.org

Exploiting the query structure for efficient join ordering in SPARQL queries.

Authors

Andrey Gubichev, Thomas Neumann

Publication date

2014/3/24

Journal

EDBT

Volume

Pages

439-450

Description

The join ordering problem is a fundamental challenge that has to be solved by any query optimizer. Since the high-performance RDF systems are often implemented as triple stores (ie, they represent RDF data as a single table with three attributes, at least conceptually), the query optimization strategies employed by such systems are often adopted from relational query optimization. In this paper we show that the techniques borrowed from traditional SQL query optimization (such as Dynamic Programming algorithm or greedy heuristics) are not immediately capable of handling large SPARQL queries. We introduce a new join ordering algorithm that performs a SPARQL-tailored query simplification. Furthermore, we present a novel RDF statistical synopsis that accurately estimates cardinalities in large SPARQL queries. Our experiments show that this algorithm is highly superior to the state-of-the-art SPARQL optimization approaches, including the RDF-3X’s original Dynamic Programming strategy.

Total citations

Cited by 103

201420152016201720182019202020212022202320244 13 11 15 18 6 11 9 9 5 2

Scholar articles

Exploiting the query structure for efficient join ordering in SPARQL queries.

A Gubichev, T Neumann - EDBT, 2014