23
11

Scaling Submodular Maximization via Pruned Submodularity Graphs

Abstract

We propose a new random pruning method (called "submodular sparsification (SS)") to reduce the cost of submodular maximization. The pruning is applied via a "submodularity graph" over the nn ground elements, where each directed edge is associated with a pairwise dependency defined by the submodular function. In each step, SS prunes a 11/c1-1/\sqrt{c} (for c>1c>1) fraction of the nodes using weights on edges computed based on only a small number (O(logn)O(\log n)) of randomly sampled nodes. The algorithm requires logcn\log_{\sqrt{c}}n steps with a small and highly parallelizable per-step computation. An accuracy-speed tradeoff parameter cc, set as c=8c = 8, leads to a fast shrink rate 2/4\sqrt{2}/4 and small iteration complexity log22n\log_{2\sqrt{2}}n. Analysis shows that w.h.p., the greedy algorithm on the pruned set of size O(log2n)O(\log^2 n) can achieve a guarantee similar to that of processing the original dataset. In news and video summarization tasks, SS is able to substantially reduce both computational costs and memory usage, while maintaining (or even slightly exceeding) the quality of the original (and much more costly) greedy algorithm.

View on arXiv
Comments on this paper