Nearly Optimal Risk Bounds for Kernel K-Means
Abstract
In this paper, we study the statistical properties of the kernel -means and obtain a nearly optimal excess risk bound, substantially improving the state-of-art bounds in the existing clustering risk analyses. We further analyze the statistical effect of computational approximations of the Nystr\"{o}m kernel -means, and demonstrate that it achieves the same statistical accuracy as the exact kernel -means considering only Nystr\"{o}m landmark points. To the best of our knowledge, such sharp excess risk bounds for kernel (or approximate kernel) -means have never been seen before.
View on arXivComments on this paper
