29
512

Regularized M-estimators with nonconvexity: Statistical and algorithmic theory for local optima

Abstract

We provide novel theoretical results regarding local optima of regularized MM-estimators, allowing for nonconvexity in both loss and penalty functions. Under restricted strong convexity on the loss and suitable regularity conditions on the penalty, we prove that \emph{any stationary point} of the composite objective function will lie within statistical precision of the underlying parameter vector. Our theory covers many nonconvex objective functions of interest, including the corrected Lasso for errors-in-variables linear models; regression for generalized linear models with nonconvex penalties such as SCAD, MCP, and capped-1\ell_1; and high-dimensional graphical model estimation. We quantify statistical accuracy by providing bounds on the 1\ell_1-, 2\ell_2-, and prediction error between stationary points and the population-level optimum. We also propose a simple modification of composite gradient descent that may be used to obtain a near-global optimum within statistical precision ϵ\epsilon in log(1/ϵ)\log(1/\epsilon) steps, which is the fastest possible rate of any first-order method. We provide simulation studies illustrating the sharpness of our theoretical results.

View on arXiv
Comments on this paper