Authors
Mark Sanderson, Hideo Joho
Publication date
2004/7/25
Book
Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval
Pages
33-40
Description
Forming test collection relevance judgments from the pooled output of multiple retrieval systems has become the standard process for creating resources such as the TREC, CLEF, and NTCIR test collections. This paper presents a series of experiments examining three different ways of building test collections where no system pooling is used. First, a collection formation technique combining manual feedback and multiple systems is adapted to work with a single retrieval system. Second, an existing method based on pooling the output of multiple manual searches is re-examined: testing a wider range of searchers and retrieval systems than has been examined before. Third, a new approach is explored where the ranked output of a single automatic search on a single retrieval system is assessed for relevance: no pooling whatsoever. Using established techniques for evaluating the quality of relevance judgments, in …
Total citations
200420052006200720082009201020112012201320142015201620172018201920202021202220232024361320111110448741268354132
Scholar articles
M Sanderson, H Joho - Proceedings of the 27th annual international ACM …, 2004