Authors
Carlos Castillo, Debora Donato, Luca Becchetti, Paolo Boldi, Stefano Leonardi, Massimo Santini, Sebastiano Vigna
Publication date
2006/12/1
Journal
ACM Sigir Forum
Volume
40
Issue
2
Pages
11-24
Publisher
ACM
Description
We describe the WEBSPAM-UK2006 collection, a large set of Web pages that have been manually annotated with labels indicating if the hosts are include Web spam aspects or not. This is the first publicly available Web spam collection that includes page contents and links, and that has been labelled by a large and diverse set of judges.
Total citations
200720082009201020112012201320142015201620172018201920202021202220232024202917293325242131211391364752
Scholar articles
C Castillo, D Donato, L Becchetti, P Boldi, S Leonardi… - ACM Sigir Forum, 2006