ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1306.2194
47
10

Adaptive Noisy Clustering

10 June 2013
M. Chichignoud
S. Loustau
ArXiv (abs)PDFHTML
Abstract

The problem of adaptive noisy clustering is investigated. Given a set of noisy observations Zi=Xi+ϵiZ_i=X_i+\epsilon_iZi​=Xi​+ϵi​, i=1,...,ni=1,...,ni=1,...,n, the goal is to design clusters associated with the law of XiX_iXi​'s, with unknown density fff with respect to the Lebesgue measure. Since we observe a corrupted sample, a direct approach as the popular {\it kkk-means} is not suitable in this case. In this paper, we propose a noisy kkk-means minimization, which is based on the kkk-means loss function and a deconvolution estimator of the density fff. In particular, this approach suffers from the dependence on a bandwidth involved in the deconvolution kernel. Fast rates of convergence for the excess risk are proposed for a particular choice of the bandwidth, which depends on the smoothness of the density fff. Then, we turn out into the main issue of the paper: the data-driven choice of the bandwidth. We state an adaptive upper bound for a new selection rule, called ERC (Empirical Risk Comparison). This selection rule is based on the Lepski's principle, where empirical risks associated with different bandwidths are compared. Finally, we illustrate that this adaptive rule can be used in many statistical problems of MMM-estimation where the empirical risk depends on a nuisance parameter.

View on arXiv
Comments on this paper