28
35
v1v2 (latest)

Sparse Approximation via Generating Point Sets

Abstract

\newcommand{\kalg}{{k_{\mathrm{alg}}}} \newcommand{\kopt}{{k_{\mathrm{opt}}}} \newcommand{\algset}{{T}} \renewcommand{\Re}{\mathbb{R}} \newcommand{\eps}{\varepsilon} \newcommand{\pth}[2][\!]{#1\left({#2}\right)} \newcommand{\npoints}{n} \newcommand{\ballD}{\mathsf{b}} \newcommand{\dataset}{{P}} For a set \dataset\dataset of \npoints\npoints points in the unit ball \ballDd\ballD \subseteq \Re^d, consider the problem of finding a small subset \algset\dataset\algset \subseteq \dataset such that its convex-hull \eps\eps-approximates the convex-hull of the original set. We present an efficient algorithm to compute such a \eps\eps'-approximation of size \kalg\kalg, where \eps\eps' is function of \eps\eps, and \kalg\kalg is a function of the minimum size \kopt\kopt of such an \eps\eps-approximation. Surprisingly, there is no dependency on the dimension dd in both bounds. Furthermore, every point of \dataset\dataset can be \eps\eps-approximated by a convex-combination of points of \algset\algset that is O(1/\eps2)O(1/\eps^2)-sparse. Our result can be viewed as a method for sparse, convex autoencoding: approximately representing the data in a compact way using sparse combinations of a small subset \algset\algset of the original data. The new algorithm can be kernelized, and it preserves sparsity in the original input.

View on arXiv
Comments on this paper