ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.12289
19
2

Beyond Moments: Robustly Learning Affine Transformations with Asymptotically Optimal Error

23 February 2023
He Jia
Pravesh Kothari
Santosh Vempala
ArXivPDFHTML
Abstract

We present a polynomial-time algorithm for robustly learning an unknown affine transformation of the standard hypercube from samples, an important and well-studied setting for independent component analysis (ICA). Specifically, given an ϵ\epsilonϵ-corrupted sample from a distribution DDD obtained by applying an unknown affine transformation x→Ax+sx \rightarrow Ax+sx→Ax+s to the uniform distribution on a ddd-dimensional hypercube [−1,1]d[-1,1]^d[−1,1]d, our algorithm constructs A^,s^\hat{A}, \hat{s}A^,s^ such that the total variation distance of the distribution D^\hat{D}D^ from DDD is O(ϵ)O(\epsilon)O(ϵ) using poly(d)(d)(d) time and samples. Total variation distance is the information-theoretically strongest possible notion of distance in our setting and our recovery guarantees in this distance are optimal up to the absolute constant factor multiplying ϵ\epsilonϵ. In particular, if the columns of AAA are normalized to be unit length, our total variation distance guarantee implies a bound on the sum of the ℓ2\ell_2ℓ2​ distances between the column vectors of AAA and A′A'A′, ∑i=1d∥ai−a^i∥2=O(ϵ)\sum_{i =1}^d \|a_i-\hat{a}_i\|_2 = O(\epsilon)∑i=1d​∥ai​−a^i​∥2​=O(ϵ). In contrast, the strongest known prior results only yield a ϵO(1)\epsilon^{O(1)}ϵO(1) (relative) bound on the distance between individual aia_iai​'s and their estimates and translate into an O(dϵ)O(d\epsilon)O(dϵ) bound on the total variation distance. Our key innovation is a new approach to ICA (even to outlier-free ICA) that circumvents the difficulties in the classical method of moments and instead relies on a new geometric certificate of correctness of an affine transformation. Our algorithm is based on a new method that iteratively improves an estimate of the unknown affine transformation whenever the requirements of the certificate are not met.

View on arXiv
Comments on this paper