Authors
Edo Liberty
Publication date
2013/8/11
Book
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Pages
581-588
Description
A sketch of a matrix A is another matrix B which is significantly smaller than A but still approximates it well. Finding such sketches efficiently is an important building block in modern algorithms for approximating, for example, the PCA of massive matrices. This task is made more challenging in the streaming model, where each row of the input matrix can only be processed once and storage is severely limited.
In this paper we adapt a well known streaming algorithm for approximating item frequencies to the matrix sketching setting. The algorithm receives n rows of a large matrix A ε ℜ n x m one after the other in a streaming fashion. It maintains a sketch Bl x m containing only l << n rows but still guarantees that ATA BTB. More accurately, ∀x || x,||=1 0≤||Ax||2 - ||Bx||2 ≤ 2||A||_f 2 l Or BTB prec ATA and ||ATA - BTB|| ≤ 2 ||A||f2 l.
This gives a streaming algorithm whose error decays proportional to 1/l using O(ml …
Total citations
2012201320142015201620172018201920202021202220232024232032363746323042362311
Scholar articles
E Liberty - Proceedings of the 19th ACM SIGKDD international …, 2013