Authors
Hugh E Williams, Justin Zobel
Publication date
1999/1/1
Journal
The Computer Journal
Volume
42
Issue
3
Pages
193-201
Publisher
Oxford University Press
Description
Fast access to files of integers is crucial for the efficient resolution of queries to databases. Integers are the basis of indexes used to resolve queries, for example, in large internet search systems, and numeric data forms a large part of most databases. Disk access costs can be reduced by compression, if the cost of retrieving a compressed representation from disk and the CPU cost of decoding such a representation is less than that of retrieving uncompressed data. In this paper we show experimentally that, for large or small collections, storing integers in a compressed format reduces the time required for either sequential stream access or random access. We compare different approaches to compressing integers, including the Elias gamma and delta codes, Golomb coding, and a variable-byte integer scheme. As a conclusion, we recommend that, for fast access to integers, files be stored compressed.
Total citations
19992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024236817101015181062013191719191526189142793
Scholar articles