65
36
v1v2 (latest)

Efficient active learning of sparse halfspaces

Abstract

We study the problem of efficient PAC active learning of homogeneous linear classifiers (halfspaces) in Rd\mathbb{R}^d, where the goal is to learn a halfspace with low error using as few label queries as possible. Under the extra assumption that there is a tt-sparse halfspace that performs well on the data (tdt \ll d), we would like our active learning algorithm to be {\em attribute efficient}, i.e. to have label requirements sublinear in dd. In this paper, we provide a computationally efficient algorithm that achieves this goal. Under certain distributional assumptions on the data, our algorithm achieves a label complexity of O(tpolylog(d,1ϵ))O(t \cdot \mathrm{polylog}(d, \frac 1 \epsilon)). In contrast, existing algorithms in this setting are either computationally inefficient, or subject to label requirements polynomial in dd or 1ϵ\frac 1 \epsilon.

View on arXiv
Comments on this paper