Authors
Marco Bressan, Flavio Chierichetti, Ravi Kumar, Stefano Leucci, Alessandro Panconesi
Publication date
2017/2/2
Conference
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining
Pages
557-566
Publisher
ACM
Description
Counting graphlets is a well-studied problem in graph mining and social network analysis. Recently, several papers explored very simple and natural approaches based on Monte Carlo sampling of Markov Chains (MC), and reported encouraging results. We show, perhaps surprisingly, that this approach is outperformed by a carefully engineered version of color coding (CC) [1], a sophisticated algorithmic technique that we extend to the case of graphlet sampling and for which we prove strong statistical guarantees. Our computational experiments on graphs with millions of nodes show CC to be more accurate than MC. Furthermore, we formally show that the mixing time of the MC approach is too high in general, even when the input graph has high conductance. All this comes at a price however. While MC is very efficient in terms of space, CC's memory requirements become demanding when the size of the input …
Total citations
2017201820192020202120222023202451315152310155
Scholar articles
M Bressan, F Chierichetti, R Kumar, S Leucci… - Proceedings of the tenth ACM international conference …, 2017