44
8
v1v2 (latest)

Sample compression schemes for balls in graphs

Abstract

One of the open problems in machine learning is whether any set-family of VC-dimension dd admits a sample compression scheme of size O(d)O(d). In this paper, we study this problem for balls in graphs. For a ball B=Br(x)B=B_r(x) of a graph G=(V,E)G=(V,E), a realizable sample for BB is a signed subset X=(X+,X)X=(X^+,X^-) of VV such that BB contains X+X^+ and is disjoint from XX^-. A proper sample compression scheme of size kk consists of a compressor and a reconstructor. The compressor maps any realizable sample XX to a subsample XX' of size at most kk. The reconstructor maps each such subsample XX' to a ball BB' of GG such that BB' includes X+X^+ and is disjoint from XX^-. For balls of arbitrary radius rr, we design proper labeled sample compression schemes of size 22 for trees, of size 33 for cycles, of size 44 for interval graphs, of size 66 for trees of cycles, and of size 2222 for cube-free median graphs. For balls of a given radius, we design proper labeled sample compression schemes of size 22 for trees and of size 44 for interval graphs. We also design approximate sample compression schemes of size 2 for balls of δ\delta-hyperbolic graphs.

View on arXiv
Comments on this paper