Authors
Steven CH Hoi, Rong Jin, Michael R Lyu
Publication date
2006/5/23
Book
Proceedings of the 15th international conference on World Wide Web
Pages
633-642
Description
Large-scale text categorization is an important research topic for Web data mining. One of the challenges in large-scale text categorization is how to reduce the human efforts in labeling text documents for building reliable classification models. In the past, there have been many studies on applying active learning methods to automatic text categorization, which try to select the most informative documents for labeling manually. Most of these studies focused on selecting a single unlabeled document in each iteration. As a result, the text categorization model has to be retrained after each labeled document is solicited. In this paper, we present a novel active learning algorithm that selects a batch of text documents for labeling manually in each iteration. The key of the batch mode active learning is how to reduce the redundancy among the selected examples such that each example provides unique information for model …
Total citations
2006200720082009201020112012201320142015201620172018201920202021202220232024291423141824231817232325232122181810
Scholar articles
SCH Hoi, R Jin, MR Lyu - Proceedings of the 15th international conference on …, 2006