View article

[PDF] from uwindsor.ca

A survey of text clustering algorithms

Authors

Charu C Aggarwal, ChengXiang Zhai

Publication date

2012

Journal

Mining text data

Pages

77-128

Publisher

Springer US

Description

Clustering is a widely studied data mining problem in the text domains. The problem finds numerous applications in customer segmentation, classification, collaborative filtering, visualization, document organization, and indexing. In this chapter, we will provide a detailed survey of the problem of text clustering. We will study the key challenges of the clustering problem, as it applies to the text domain. We will discuss the key methods used for text clustering, and their relative advantages. We will also discuss a number of recent advances in the area in the context of social network and linked data.

Total citations

Cited by 942

20122013201420152016201720182019202020212022202320248 23 62 61 93 108 116 94 87 85 93 62 41

Scholar articles

A survey of text clustering algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012