Authors
Emine Yilmaz, Evangelos Kanoulas, Javed A Aslam
Publication date
2008/7/20
Book
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Pages
603-610
Description
We consider the problem of large scale retrieval evaluation. Recently two methods based on random sampling were proposed as a solution to the extensive effort required to judge tens of thousands of documents. While the first method proposed by Aslam et al. [1] is quite accurate and efficient, it is overly complex, making it difficult to be used by the community, and while the second method proposed by Yilmaz et al., infAP [14], is relatively simple, it is less efficient than the former since it employs uniform random sampling from the set of complete judgments. Further, none of these methods provide confidence intervals on the estimated values.
The contribution of this paper is threefold: (1) we derive confidence intervals for infAP, (2) we extend infAP to incorporate nonrandom relevance judgments by employing stratified random sampling, hence combining the efficiency of stratification with the simplicity of random …
Total citations
200820092010201120122013201420152016201720182019202020212022202320242912251528332733422526201915133
Scholar articles
E Yilmaz, E Kanoulas, JA Aslam - Proceedings of the 31st annual international ACM …, 2008