Nearly Optimal Risk Bounds for Kernel K-Means

9 March 2020

Yong Liu

Lizhong Ding

Weiping Wang

Wenqi Ren

Xiao Zhang

Shali Jiang

Xinwang Liu

Weiping Wang

ArXiv (abs)PDF HTML

Abstract

In this paper, we study the statistical properties of the kernel $k$ -means and obtain a nearly optimal excess risk bound, substantially improving the state-of-art bounds in the existing clustering risk analyses. We further analyze the statistical effect of computational approximations of the Nystr\"{o}m kernel $k$ -means, and demonstrate that it achieves the same statistical accuracy as the exact kernel $k$ -means considering only $\sqrt{nk}$ Nystr\"{o}m landmark points. To the best of our knowledge, such sharp excess risk bounds for kernel (or approximate kernel) $k$ -means have never been seen before.

View on arXiv

Comments on this paper