View article

[PDF] from psu.edu

PigSPARQL: A SPARQL Query Processing Baseline for Big Data.

Authors

Alexander Schätzle, Martin Przyjaciel-Zablocki, Thomas Hornung, Georg Lausen

Publication date

2013/10/23

Conference

ISWC (Posters & Demos)

Pages

241-244

Description

In this paper we discuss PigSPARQL, a competitive yet easy to use SPARQL query processing system on MapReduce that allows adhoc SPARQL query processing on large RDF graphs out of the box. Instead of a direct mapping, PigSPARQL uses the query language of Pig, a data analysis platform on top of Hadoop MapReduce, as an intermediate layer between SPARQL and MapReduce. This additional level of abstraction makes our approach independent of the actual Hadoop version and thus ensures the compatibility to future changes of the Hadoop framework as they will be covered by the underlying Pig layer. We revisit PigSPARQL and demonstrate the performance improvement when simply switching the underlying version of Pig from 0.5. 0 to 0.11. 0 without any changes to PigSPARQL itself. Because of this sustainability, PigSPARQL is an attractive long-term baseline for comparing various MapReduce based SPARQL implementations which is also underpinned by its competitiveness with existing systems, eg HadoopRDF.

Total citations

Cited by 61

20142015201620172018201920202021202220233 6 9 8 13 2 10 6 2 1

Scholar articles

PigSPARQL: A SPARQL Query Processing Baseline for Big Data.

A Schätzle, M Przyjaciel-Zablocki, T Hornung… - ISWC (Posters & Demos), 2013