Authors
Chengxiang Zhai, John Lafferty
Publication date
2004/4/1
Journal
ACM Transactions on Information Systems (TOIS)
Volume
22
Issue
2
Pages
179-214
Publisher
ACM
Description
Language modeling approaches to information retrieval are attractive and promising because they connect the problem of retrieval with that of language model estimation, which has been studied extensively in other application areas such as speech recognition. The basic idea of these approaches is to estimate a language model for each document, and to then rank documents by the likelihood of the query according to the estimated language model. A central issue in language model estimation is smoothing, the problem of adjusting the maximum likelihood estimator to compensate for data sparseness. In this article, we study the problem of language model smoothing and its influence on retrieval performance. We examine the sensitivity of retrieval performance to the smoothing parameters and compare several popular smoothing methods on different test collections. Experimental results show that not only is the …
Total citations
2004200520062007200820092010201120122013201420152016201720182019202020212022202320248193450841071411211271321001161079496764845372514
Scholar articles