Authors
Jian-Tao Sun, Dou Shen, Hua-Jun Zeng, Qiang Yang, Yuchang Lu, Zheng Chen
Publication date
2005/8/15
Conference
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Pages
194-201
Publisher
ACM
Description
Most previous Web-page summarization methods treat a Web page as plain text. However, such methods fail to uncover the full knowledge associated with a Web page needed in building a high-quality summary, because many of these methods do not consider the hidden relationships in the Web. Uncovering the hidden knowledge is important in building good Web-page summarizers. In this paper, we extract the extra knowledge from the clickthrough data of a Web search engine to improve Web-page summarization. Wefirst analyze the feasibility in utilizing the clickthrough data to enhance Web-page summarization and then propose two adapted summarization methods that take advantage of the relationships discovered from the clickthrough data. For those pages that are not covered by the clickthrough data, we design a thematic lexicon approach to generate implicit knowledge for them. Our methods are …
Total citations
20052006200720082009201020112012201320142015201620172018201920202021202214141212111315128812676641
Scholar articles
JT Sun, D Shen, HJ Zeng, Q Yang, Y Lu, Z Chen - Proceedings of the 28th annual international ACM …, 2005