Authors
Michael Persin, Justin Zobel, Ron Sacks‐Davis
Publication date
1996/10
Journal
Journal of the American Society for Information Science
Volume
47
Issue
10
Pages
749-764
Publisher
Wiley Subscription Services, Inc., A Wiley Company
Description
Ranking techniques are effective at finding answers in document collections but can be expensive to evaluate. We propose an evaluation technique that uses early recognition of which documents are likely to be highly ranked to reduce costs; for our test data, queries are evaluated in 2% of the memory of the standard implementation without degradation in retrieval effectiveness. Cpu time and disk traffic can also be dramatically reduced by designing inverted indexes explicitly to support the technique. The principle of the index design is that inverted lists are sorted by decreasing within‐document frequency rather than by document number, and this method experimentally reduces cpu time and disk traffic to around one third of the original requirement. We also show that frequency sorting can lead to a net reduction in index size, regardless of whether the index is compressed. © 1996 John Wiley & Sons, Inc.
Total citations
1997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202228828107626292324161521182391916162431
Scholar articles
M Persin, J Zobel, R Sacks‐Davis - Journal of the American Society for Information Science, 1996