View article

[PDF] from psu.edu

Statistical reform in information retrieval?

Authors

Tetsuya Sakai

Publication date

2014/6/26

Journal

ACM SIGIR Forum

Volume

Issue

Pages

3-12

Publisher

ACM

Description

IR revolves around evaluation. Therefore, IR researchers should employ sound evaluation practices. Nowadays many of us know that statistical significance testing is not enough, but not all of us know exactly what to do about it. This paper provides suggestions on how to report effect sizes and confidence intervals along with p-values, in the context of comparing IR systems using test collections. Hopefully, these practices will make IR papers more informative, and help researchers form more reliable conclusions that "add up." Finally, I pose a specific question for the IR community: should IR journal editors and SIGIR PC chairs require (rather than encourage) reporting of effect sizes and confidence intervals.

Total citations

Cited by 87

201420152016201720182019202020212022202320243 7 12 19 18 8 3 7 2 7 1

Scholar articles

Statistical reform in information retrieval?

T Sakai - ACM SIGIR Forum, 2014