Authors
Shantanu Godbole, Indrajit Bhattacharya, Ajay Gupta, Ashish Verma
Publication date
2010/10/26
Book
Proceedings of the 19th ACM international conference on Information and knowledge management
Pages
1189-1198
Description
Text mining, though still a nascent industry, has been growing quickly along with the awareness of the importance of unstructured data in business analytics, customer retention and extension, social media, and legal applications. There has been a recent increase in the number of commercial text mining product and service offerings, but successful or wide-spread deployments are rare, mainly due to a dependence on the expertise and skill of practitioners. Accordingly, there is a growing need for re-usable repositories for text mining. In this paper, we focus on dictionary-based text mining and its role in enabling practitioners in understanding and analyzing large text datasets. We motivate and define the problem of exploratory dictionary construction for capturing concepts of interest, and propose a framework for efficient construction, tuning, and re-use of these dictionaries across datasets. The construction framework …
Total citations
20092010201120122013201420152016201720182019202020212022202311477782596571
Scholar articles
S Godbole, I Bhattacharya, A Gupta, A Verma - Proceedings of the 19th ACM international conference …, 2010