Authors
Arindam Banerjee, Chase Krumpelman, Joydeep Ghosh, Sugato Basu, Raymond J Mooney
Publication date
2005/8/21
Book
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Pages
532-537
Description
While the vast majority of clustering algorithms are partitional, many real world datasets have inherently overlapping clusters. Several approaches to finding overlapping clusters have come from work on analysis of biological datasets. In this paper, we interpret an overlapping clustering model proposed by Segal et al. [23] as a generalization of Gaussian mixture models, and we extend it to an overlapping clustering model based on mixtures of any regular exponential family distribution and the corresponding Bregman divergence. We provide the necessary algorithm modifications for this extension, and present results on synthetic data as well as subsets of 20-Newsgroups and EachMovie datasets.
Total citations
20052006200720082009201020112012201320142015201620172018201920202021202220232024218151791315181717212225162191714144
Scholar articles
A Banerjee, C Krumpelman, J Ghosh, S Basu… - Proceedings of the eleventh ACM SIGKDD …, 2005