Authors
Tetsuya Sakai, Ruihua Song
Publication date
2011/7/24
Book
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Pages
1043-1052
Description
Search queries are often ambiguous and/or underspecified. To accomodate different user needs, search result diversification has received attention in the past few years. Accordingly, several new metrics for evaluating diversification have been proposed, but their properties are little understood. We compare the properties of existing metrics given the premises that (1) queries may have multiple intents; (2) the likelihood of each intent given a query is available; and (3) graded relevance assessments are available for each intent. We compare a wide range of traditional and diversified IR metrics after adding graded relevance assessments to the TREC 2009 Web track diversity task test collection which originally had binary relevance assessments. Our primary criterion is discriminative power, which represents the reliability of a metric in an experiment. Our results show that diversified IR experiments with a given …
Total citations
20112012201320142015201620172018201920202021202220232024101927231114181161710371
Scholar articles
T Sakai, R Song - Proceedings of the 34th international ACM SIGIR …, 2011