Authors
Edo Liberty
Publication date
2013/8/11
Book
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Pages
581-588
Description
A sketch of a matrix A is another matrix B which is significantly smaller than A but still approximates it well. Finding such sketches efficiently is an important building block in modern algorithms for approximating, for example, the PCA of massive matrices. This task is made more challenging in the streaming model, where each row of the input matrix can only be processed once and storage is severely limited.
In this paper we adapt a well known streaming algorithm for approximating item frequencies to the matrix sketching setting. The algorithm receives n rows of a large matrix A ε ℜ n x m one after the other in a streaming fashion. It maintains a sketch B ℜ l x m containing only l << n rows but still guarantees that ATA BTB. More accurately, ∀x || x,||=1 0≤||Ax||2 - ||Bx||2 ≤ 2||A||_f 2 l Or BTB prec ATA and ||ATA - BTB|| ≤ 2 ||A||f2 l.
This gives a streaming algorithm whose error decays proportional to 1/l using O(ml …
Total citations
Scholar articles
E Liberty - Proceedings of the 19th ACM SIGKDD international …, 2013