Authors
Qinglei Wang, Yanan Qian, Ruihua Song, Zhicheng Dou, Fan Zhang, Tetsuya Sakai, Qinghua Zheng
Publication date
2013/8
Journal
Information retrieval
Volume
16
Pages
484-503
Publisher
Springer Netherlands
Description
Web search queries are often ambiguous or faceted, and the task of identifying the major underlying senses and facets of queries has received much attention in recent years. We refer to this task as query subtopic mining. In this paper, we propose to use surrounding text of query terms in top retrieved documents to mine subtopics and rank them. We first extract text fragments containing query terms from different parts of documents. Then we group similar text fragments into clusters and generate a readable subtopic for each cluster. Based on the cluster and the language model trained from a query log, we calculate three features and combine them into a relevance score for each subtopic. Subtopics are finally ranked by balancing relevance and novelty. Our evaluation experiments with the NTCIR-9 INTENT Chinese Subtopic Mining test collection show that our method significantly outperforms a query log …
Total citations
20142015201620172018201920202021202220235352423223
Scholar articles
Q Wang, Y Qian, R Song, Z Dou, F Zhang, T Sakai… - Information retrieval, 2013