Authors
Makoto Onizuka, Toshimasa Fujimori, Hiroaki Shiokawa
Publication date
2017/3
Journal
Data Science and Engineering
Volume
2
Pages
94-105
Publisher
Springer Berlin Heidelberg
Description
There is a large demand for distributed engines that efficiently process large-scale graph data, such as social graph and web graph. The distributed graph engines execute analysis process after partitioning input graph data and assign them to distributed computers, so the quality of graph partitioning largely affects the communication cost and load balance among computers during the analysis process. We propose an effective graph partitioning technique that achieves low communication cost and good load balance among computers at the same time. We first generate more clusters than the number of computers by extending the modularity-based clustering, and then merge those clusters into balanced-size clusters until the number of clusters becomes the number of computers by using techniques designed for graph packing problem. We implemented our technique on top of distributed graph engine …
Total citations
2017201820192020202120222023111212935
Scholar articles
M Onizuka, T Fujimori, H Shiokawa - Data Science and Engineering, 2017