Authors
Paolo Boldi, Massimo Santini, Sebastiano Vigna
Publication date
2008/11/30
Journal
ACM SIGIR Forum
Volume
42
Issue
2
Pages
33-38
Publisher
ACM
Description
We describe the techniques developed to gather and distribute in a highly compressed, yet accessible, form a series of twelve snapshot of the .uk web domain. Ad hoc compression techniques made it possible to store the twelve snapshots using just 1:9 bits per link, with constant-time access to temporal information. Our collection makes it possible to study the temporal evolution link-based scores (e.g., PageRank), the growth of online communities, and in general time-dependent phenomena related to the link structure.
Total citations
2008200920102011201220132014201520162017201820192020202120222023202444547111916171612141113888
Scholar articles
P Boldi, M Santini, S Vigna - ACM SIGIR Forum, 2008