Authors
Tetsuya Sakai
Publication date
2006/8/6
Book
Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Pages
525-532
Description
This paper describes how the Bootstrap approach to statistics can be applied to the evaluation of IR effectiveness metrics. First, we argue that Bootstrap Hypothesis Tests deserve more attention from the IR community, as they are based on fewer assumptions than traditional statistical significance tests. We then describe straightforward methods for comparing the sensitivity of IR metrics based on Bootstrap Hypothesis Tests. Unlike the heuristics-based "swap" method proposed by Voorhees and Buckley, our method estimates the performance difference required to achieve a given significance level directly from Bootstrap Hypothesis Test results. In addition, we describe a simple way of examining the accuracy of rank correlation between two metrics based on the Bootstrap Estimate of Standard Error. We demonstrate the usefulness of our methods using test collections and runs from the NTCIR CLIR track for …
Total citations
2006200720082009201020112012201320142015201620172018201920202021202220232024615969121029171817171815131711142
Scholar articles
T Sakai - Proceedings of the 29th annual international ACM …, 2006