View article

Sempala: Interactive SPARQL query processing on hadoop

Authors

Alexander Schätzle, Martin Przyjaciel-Zablocki, Antony Neu, Georg Lausen

Publication date

2014

Conference

The Semantic Web–ISWC 2014: 13th International Semantic Web Conference, Riva del Garda, Italy, October 19-23, 2014. Proceedings, Part I 13

Pages

164-179

Publisher

Springer International Publishing

Description

Driven by initiatives like Schema.org, the amount of semantically annotated data is expected to grow steadily towards massive scale, requiring cluster-based solutions to query it. At the same time, Hadoop has become dominant in the area of Big Data processing with large infrastructures being already deployed and used in manifold application fields. For Hadoop-based applications, a common data pool (HDFS) provides many synergy benefits, making it very attractive to use these infrastructures for semantic data processing as well. Indeed, existing SPARQL-on- Hadoop (MapReduce) approaches have already demonstrated very good scalability, however, query runtimes are rather slow due to the underlying batch processing framework. While this is acceptable for data-intensive queries, it is not satisfactory for the majority of SPARQL queries that are typically much more selective requiring only small …

Total citations

Cited by 110

20152016201720182019202020212022202320249 15 12 22 11 18 6 7 5 3

Scholar articles

Sempala: Interactive SPARQL query processing on hadoop

A Schätzle, M Przyjaciel-Zablocki, A Neu, G Lausen - The Semantic Web–ISWC 2014: 13th International …, 2014