Authors
Padmashree Ravindra, HyeongSik Kim, Kemafor Anyanwu
Publication date
2011
Conference
The Semanic Web: Research and Applications: 8th Extended Semantic Web Conference, ESWC 2011, Heraklion, Crete, Greece, May 29–June 2, 2011, Proceedings, Part II 8
Pages
46-61
Publisher
Springer Berlin Heidelberg
Description
Existing MapReduce systems support relational style join operators which translate multi-join query plans into several Map-Reduce cycles. This leads to high I/O and communication costs due to the multiple data transfer steps between map and reduce phases. SPARQL graph pattern matching is dominated by join operations, and is unlikely to be efficiently processed using existing techniques. This cost is prohibitive for RDF graph pattern matching queries which typically involve several join operations. In this paper, we propose an approach for optimizing graph pattern matching by reinterpreting certain join tree structures as grouping operations. This enables a greater degree of parallelism in join processing resulting in more “bushy” like query execution plans with fewer Map-Reduce cycles. This approach requires that the intermediate results are managed as sets of groups of triples or TripleGroups. We …
Total citations
2011201220132014201520162017201820192020202120222023202436121315911123113311
Scholar articles
P Ravindra, HS Kim, K Anyanwu - The Semanic Web: Research and Applications: 8th …, 2011