ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.01547
27
2

Near-Optimal Algorithms for Gaussians with Huber Contamination: Mean Estimation and Linear Regression

4 December 2023
Ilias Diakonikolas
Daniel M. Kane
Ankit Pensia
Thanasis Pittas
ArXivPDFHTML
Abstract

We study the fundamental problems of Gaussian mean estimation and linear regression with Gaussian covariates in the presence of Huber contamination. Our main contribution is the design of the first sample near-optimal and almost linear-time algorithms with optimal error guarantees for both of these problems. Specifically, for Gaussian robust mean estimation on Rd\mathbb{R}^dRd with contamination parameter ϵ∈(0,ϵ0)\epsilon \in (0, \epsilon_0)ϵ∈(0,ϵ0​) for a small absolute constant ϵ0\epsilon_0ϵ0​, we give an algorithm with sample complexity n=O~(d/ϵ2)n = \tilde{O}(d/\epsilon^2)n=O~(d/ϵ2) and almost linear runtime that approximates the target mean within ℓ2\ell_2ℓ2​-error O(ϵ)O(\epsilon)O(ϵ). This improves on prior work that achieved this error guarantee with polynomially suboptimal sample and time complexity. For robust linear regression, we give the first algorithm with sample complexity n=O~(d/ϵ2)n = \tilde{O}(d/\epsilon^2)n=O~(d/ϵ2) and almost linear runtime that approximates the target regressor within ℓ2\ell_2ℓ2​-error O(ϵ)O(\epsilon)O(ϵ). This is the first polynomial sample and time algorithm achieving the optimal error guarantee, answering an open question in the literature. At the technical level, we develop a methodology that yields almost-linear time algorithms for multi-directional filtering that may be of broader interest.

View on arXiv
Comments on this paper