144

Near-Optimal Entrywise Sampling for Data Matrices

Neural Information Processing Systems (NeurIPS), 2013
Abstract

We consider the problem of selecting non-zero entries of a matrix AA in order to produce a sparse sketch of it, BB, that minimizes AB2\|A-B\|_2. For large m×nm \times n matrices, such that nmn \gg m (for example, representing nn observations over mm attributes) we give sampling distributions that exhibit four important properties. First, they have closed forms computable from minimal information regarding AA. Second, they allow sketching of matrices whose non-zeros are presented to the algorithm in arbitrary order as a stream, with O(1)O(1) computation per non-zero. Third, the resulting sketch matrices are not only sparse, but their non-zero entries are highly compressible. Lastly, and most importantly, under mild assumptions, our distributions are provably competitive with the optimal offline distribution. Note that the probabilities in the optimal offline distribution may be complex functions of all the entries in the matrix. Therefore, regardless of computational complexity, the optimal distribution might be impossible to compute in the streaming model.

View on arXiv
Comments on this paper