ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1702.02828
32
22

Minimax Lower Bounds for Ridge Combinations Including Neural Nets

9 February 2017
Jason M. Klusowski
Andrew R. Barron
ArXivPDFHTML
Abstract

Estimation of functions of d d d variables is considered using ridge combinations of the form ∑k=1mc1,kϕ(∑j=1dc0,j,kxj−bk) \textstyle\sum_{k=1}^m c_{1,k} \phi(\textstyle\sum_{j=1}^d c_{0,j,k}x_j-b_k) ∑k=1m​c1,k​ϕ(∑j=1d​c0,j,k​xj​−bk​) where the activation function ϕ \phi ϕ is a function with bounded value and derivative. These include single-hidden layer neural networks, polynomials, and sinusoidal models. From a sample of size n n n of possibly noisy values at random sites X∈B=[−1,1]d X \in B = [-1,1]^d X∈B=[−1,1]d, the minimax mean square error is examined for functions in the closure of the ℓ1 \ell_1 ℓ1​ hull of ridge functions with activation ϕ \phi ϕ. It is shown to be of order d/n d/n d/n to a fractional power (when d d d is of smaller order than n n n), and to be of order (log⁡d)/n (\log d)/n (logd)/n to a fractional power (when d d d is of larger order than n n n). Dependence on constraints v0 v_0 v0​ and v1 v_1 v1​ on the ℓ1 \ell_1 ℓ1​ norms of inner parameter c0 c_0 c0​ and outer parameter c1 c_1 c1​, respectively, is also examined. Also, lower and upper bounds on the fractional power are given. The heart of the analysis is development of information-theoretic packing numbers for these classes of functions.

View on arXiv
Comments on this paper