Authors
Athena Vakali, Jaroslav Pokorný, Theodore Dalamagas
Publication date
2004/3/14
Source
International conference on extending database technology
Pages
597-606
Publisher
Springer Berlin Heidelberg
Description
Clustering is a challenging topic in the area of Web data management. Various forms of clustering are required in a wide range of applications, including finding mirrored Web pages, detecting copyright violations, and reporting search results in a structured way. Clustering can either be performed once offline, (independently to search queries), or online (on the results of search queries). Important efforts have focused on mining Web access logs and to cluster search engine results on the fly. Online methods based on link structure and text have been applied successfully to finding pages on related topics. This paper presents an overview of the most popular methodologies and implementations in terms of clustering either Web users or Web sources and presents a survey about current status and future trends in clustering employed over the Web.
Total citations
2005200620072008200920102011201220132014201520162017201820192020202111097929895281315
Scholar articles
A Vakali, J Pokorný, T Dalamagas - International conference on extending database …, 2004