Authors
Sugato Basu, Mikhail Bilenko, Raymond J Mooney
Publication date
2004/8/22
Book
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Pages
59-68
Description
Unsupervised clustering can be significantly improved using supervision in the form of pairwise constraints, i.e., pairs of instances labeled as belonging to same or different clusters. In recent years, a number of algorithms have been proposed for enhancing clustering quality by employing such supervision. Such methods use the constraints to either modify the objective function, or to learn the distance measure. We propose a probabilistic model for semi-supervised clustering based on Hidden Markov Random Fields (HMRFs) that provides a principled framework for incorporating supervision into prototype-based clustering. The model generalizes a previous approach that combines constraints and Euclidean distance learning, and allows the use of a broad range of clustering distortion measures, including Bregman divergences (e.g., Euclidean distance and I-divergence) and directional similarity measures (e.g …
Total citations
20052006200720082009201020112012201320142015201620172018201920202021202220232024295366896595868370628060615343362532229
Scholar articles
S Basu, M Bilenko, RJ Mooney - Proceedings of the tenth ACM SIGKDD international …, 2004