View article

[PDF] from acm.org

Comparing the performance of database selection algorithms

Authors

James C French, Allison L Powell, Jamie Callan, Charles L Viles, Travis Emmitt, Kevin J Prey, Yun Mou

Publication date

1999/8/1

Book

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval

Pages

238-245

Description

We compare the performance of two database selection algorithms reported in the literature. Their performance is compared using a common testbed designed specifically for database selection techniques. The testbed is a decomposition of the TREC/TIPSTER data into 236 subcollections. The databases from our testbed were ranked using both the gGlOSS and CORI techniques and compared to a baseline derived from TREC relevance judgements. We examined the degree to which CORI and gGlOSS approximate this baseline. Our results confirm our earlier observation that the gGlOSS Ideal (l) ranks do not estimate relevancebased ranks well. We also find that CORI is a uniformly better estimator of relevance-based ranks than gGlOSS for the test environment used in this study. Part of the advantage of the CORI algorithm can be explained by a strong correlation between gGlOSS and a size-based baseline …

Total citations

Cited by 206

199920002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320248 14 13 29 24 21 13 14 12 13 8 9 8 2 9 1 4 1 1 1 1

Scholar articles

Comparing the performance of database selection algorithms

JC French, AL Powell, J Callan, CL Viles, T Emmitt… - Proceedings of the 22nd annual international ACM …, 1999