Authors
Stephen Bonner, John Brennan, Georgios Theodoropoulos, Ibad Kureshi, Andrew Stephen McGough
Publication date
2016/12/5
Conference
2016 IEEE international conference on big data (big data)
Pages
3298-3307
Publisher
IEEE
Description
The problem of how to compare empirical graphs is an area of great interest within the field of network science. The ability to accurately but efficiently compare graphs has a significant impact in such areas as temporal graph evolution, anomaly detection and protein comparison. The comparison problem is compounded when working with massive graphs containing millions of vertices and edges. This paper introduces a parallel feature extraction based approach for the efficient comparison of large unlabelled graph datasets using Apache Spark. The approach acts by producing a `Graph Fingerprint' which represents both vertex level and global level topological features from a graph. By using Spark we are able to efficiently compare graphs considered unmanageably large to other approaches. The runtime of the approach is shown to scale sub-linearly with the size and complexity of the graphs being fingerprinted …
Total citations
2018201920202021202223241
Scholar articles
S Bonner, J Brennan, G Theodoropoulos, I Kureshi… - 2016 IEEE international conference on big data (big …, 2016