ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.05205
11
63

Learning a Single Neuron with Gradient Methods

15 January 2020
Gilad Yehudai
Ohad Shamir
    MLT
ArXivPDFHTML
Abstract

We consider the fundamental problem of learning a single neuron x↦σ(w⊤x)x \mapsto\sigma(w^\top x)x↦σ(w⊤x) using standard gradient methods. As opposed to previous works, which considered specific (and not always realistic) input distributions and activation functions σ(⋅)\sigma(\cdot)σ(⋅), we ask whether a more general result is attainable, under milder assumptions. On the one hand, we show that some assumptions on the distribution and the activation function are necessary. On the other hand, we prove positive guarantees under mild assumptions, which go beyond those studied in the literature so far. We also point out and study the challenges in further strengthening and generalizing our results.

View on arXiv
Comments on this paper