Authors
L Venkata Subramaniam, Shourya Roy, Tanveer A Faruquie, Sumit Negi
Publication date
2009/7/23
Book
Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data
Pages
115-122
Description
Often, in the real world noise is ubiquitous in text communications. Text produced by processing signals intended for human use are often noisy for automated computer processing. Automatic speech recognition, optical character recognition and machine translation all introduce processing noise. Also digital text produced in informal settings such as online chat, SMS, emails, message boards, newsgroups, blogs, wikis and web pages contain considerable noise. In this paper, we present a survey of the existing measures for noise in text. We also cover application areas that ingest this noisy text for various tasks like Information Retrieval and Information Extraction.
Total citations
201020112012201320142015201620172018201920202021202220232024108131296113991213874
Scholar articles
LV Subramaniam, S Roy, TA Faruquie, S Negi - Proceedings of The Third Workshop on Analytics for …, 2009