View article

[PDF] from psu.edu

A study of smoothing methods for language models applied to information retrieval

Authors

Chengxiang Zhai, John Lafferty

Publication date

2004/4/1

Journal

ACM Transactions on Information Systems (TOIS)

Volume

Issue

Pages

179-214

Publisher

ACM

Description

Language modeling approaches to information retrieval are attractive and promising because they connect the problem of retrieval with that of language model estimation, which has been studied extensively in other application areas such as speech recognition. The basic idea of these approaches is to estimate a language model for each document, and to then rank documents by the likelihood of the query according to the estimated language model. A central issue in language model estimation is smoothing, the problem of adjusting the maximum likelihood estimator to compensate for data sparseness. In this article, we study the problem of language model smoothing and its influence on retrieval performance. We examine the sensitivity of retrieval performance to the smoothing parameters and compare several popular smoothing methods on different test collections. Experimental results show that not only is the …

Total citations

Cited by 1594

2004200520062007200820092010201120122013201420152016201720182019202020212022202320248 19 34 50 84 107 141 121 127 132 100 116 107 94 96 76 48 45 37 25 14

Scholar articles

A study of smoothing methods for language models applied to information retrieval

C Zhai, J Lafferty - ACM Transactions on Information Systems (TOIS), 2004

Cited by 1594 Related articles All 13 versions