Authors
Sugato Basu, Arindam Banerjee, Raymond J Mooney
Publication date
2004/4/22
Book
Proceedings of the 2004 SIAM international conference on data mining
Pages
333-344
Publisher
Society for Industrial and Applied Mathematics
Description
Semi-supervised clustering uses a small amount of supervised data to aid unsupervised learning. One typical approach specifies a limited number of must-link and cannot-link constraints between pairs of examples. This paper presents a pairwise constrained clustering framework and a new method for actively selecting informative pairwise constraints to get improved clustering performance. The clustering and active learning methods are both easily scalable to large datasets, and can handle very high dimensional data. Experimental and theoretical results confirm that this active querying of pairwise constraints significantly improves the accuracy of clustering when given a relatively small amount of supervision.
Total citations
20042005200620072008200920102011201220132014201520162017201820192020202120222023202461217284336564466434668454640344646455229
Scholar articles
S Basu, A Banerjee, RJ Mooney - Proceedings of the 2004 SIAM international conference …, 2004