ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10923
19
13

ERM and RERM are optimal estimators for regression problems when malicious outliers corrupt the labels

24 October 2019
Chinot Geoffrey
ArXivPDFHTML
Abstract

We study Empirical Risk Minimizers (ERM) and Regularized Empirical Risk Minimizers (RERM) for regression problems with convex and LLL-Lipschitz loss functions. We consider a setting where ∣\cO∣|\cO|∣\cO∣ malicious outliers contaminate the labels. In that case, under a local Bernstein condition, we show that the L2L_2L2​-error rate is bounded by rN+AL∣\cO∣/N r_N + AL |\cO|/NrN​+AL∣\cO∣/N, where NNN is the total number of observations, rNr_NrN​ is the L2L_2L2​-error rate in the non-contaminated setting and AAA is a parameter coming from the local Bernstein condition. When rNr_NrN​ is minimax-rate-optimal in a non-contaminated setting, the rate rN+AL∣\cO∣/Nr_N + AL|\cO|/NrN​+AL∣\cO∣/N is also minimax-rate-optimal when ∣\cO∣|\cO|∣\cO∣ outliers contaminate the label. The main results of the paper can be used for many non-regularized and regularized procedures under weak assumptions on the noise. We present results for Huber's M-estimators (without penalization or regularized by the ℓ1\ell_1ℓ1​-norm) and for general regularized learning problems in reproducible kernel Hilbert spaces when the noise can be heavy-tailed.

View on arXiv
Comments on this paper