Authors
Yunhua Hu, Yanan Qian, Hang Li, Daxin Jiang, Jian Pei, Qinghua Zheng
Publication date
2012/8/12
Book
Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Pages
305-314
Description
Most queries in web search are ambiguous and multifaceted. Identifying the major senses and facets of queries from search log data, referred to as query subtopic mining in this paper, is a very important issue in web search. Through search log analysis, we show that there are two interesting phenomena of user behavior that can be leveraged to identify query subtopics, referred to as `one subtopic per search' and `subtopic clarification by keyword'. One subtopic per search means that if a user clicks multiple URLs in one query, then the clicked URLs tend to represent the same sense or facet. Subtopic clarification by keyword means that users often add an additional keyword or keywords to expand the query in order to clarify their search intent. Thus, the keywords tend to be indicative of the sense or facet. We propose a clustering algorithm that can effectively leverage the two phenomena to automatically mine the …
Total citations
20132014201520162017201820192020202120222023115121681134314
Scholar articles
Y Hu, Y Qian, H Li, D Jiang, J Pei, Q Zheng - Proceedings of the 35th international ACM SIGIR …, 2012