Authors
Pan Peng
Publication date
2020
Book
Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms
Pages
2953-2972
Publisher
Society for Industrial and Applied Mathematics
Description
We develop sublinear time algorithms for analyzing the cluster structure of graphs with noisy partial information. A graph G with maximum degree at most d is called (k, φin, φout)-clusterable, if it can be partitioned into at most k parts, such that each part has inner conductance at least φin and outer conductance at most φout, where d is assumed to be constant. A graph G is called to be an -perturbation of a (k,φin, φout)-clusterable graph if there is partition of G with at most k parts (called clusters), such that one can insert/delete at most ϵdn intra-cluster edges to make it a (k,φinout)-clusterable graph. We are given query access to the adjacency list of such a graph.
We show that one can construct in time a robust clustering oracle for a bounded-degree graph G that is an -perturbation of a -clusterable graph. Using such an oracle, a typical clustering query (e.g., IsOutlier(s), SameCluster(s, t)) can be answered in …
Total citations
20212022202320241533
Scholar articles