24
14

Efficient Convex Optimization Requires Superlinear Memory

Abstract

We show that any memory-constrained, first-order algorithm which minimizes dd-dimensional, 11-Lipschitz convex functions over the unit ball to 1/poly(d)1/\mathrm{poly}(d) accuracy using at most d1.25δd^{1.25 - \delta} bits of memory must make at least Ω~(d1+(4/3)δ)\tilde{\Omega}(d^{1 + (4/3)\delta}) first-order queries (for any constant δ[0,1/4]\delta \in [0, 1/4]). Consequently, the performance of such memory-constrained algorithms are a polynomial factor worse than the optimal O~(d)\tilde{O}(d) query bound for this problem obtained by cutting plane methods that use O~(d2)\tilde{O}(d^2) memory. This resolves a COLT 2019 open problem of Woodworth and Srebro.

View on arXiv
Comments on this paper