View article

[PDF] from academia.edu

A simple and efficient sampling method for estimating AP and NDCG

Authors

Emine Yilmaz, Evangelos Kanoulas, Javed A Aslam

Publication date

2008/7/20

Book

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval

Pages

603-610

Description

We consider the problem of large scale retrieval evaluation. Recently two methods based on random sampling were proposed as a solution to the extensive effort required to judge tens of thousands of documents. While the first method proposed by Aslam et al. [1] is quite accurate and efficient, it is overly complex, making it difficult to be used by the community, and while the second method proposed by Yilmaz et al., infAP [14], is relatively simple, it is less efficient than the former since it employs uniform random sampling from the set of complete judgments. Further, none of these methods provide confidence intervals on the estimated values.

The contribution of this paper is threefold: (1) we derive confidence intervals for infAP, (2) we extend infAP to incorporate nonrandom relevance judgments by employing stratified random sampling, hence combining the efficiency of stratification with the simplicity of random …

Total citations

Cited by 352

200820092010201120122013201420152016201720182019202020212022202320242 9 12 25 15 28 33 27 33 42 25 26 20 19 15 13 3

Scholar articles

A simple and efficient sampling method for estimating AP and NDCG

E Yilmaz, E Kanoulas, JA Aslam - Proceedings of the 31st annual international ACM …, 2008