ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.01028
  4. Cited By
Can SGD Learn Recurrent Neural Networks with Provable Generalization?
v1v2 (latest)

Can SGD Learn Recurrent Neural Networks with Provable Generalization?

4 February 2019
Zeyuan Allen-Zhu
Yuanzhi Li
    MLTLRM
ArXiv (abs)PDFHTML

Papers citing "Can SGD Learn Recurrent Neural Networks with Provable Generalization?"

26 / 26 papers shown
Title
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation
Can Jin
Ying Li
Mingyu Zhao
Shiyu Zhao
Zhenting Wang
Xiaoxiao He
Ligong Han
Tong Che
Dimitris N. Metaxas
VPVLMVLM
287
2
0
02 Feb 2025
On Generalization Bounds of a Family of Recurrent Neural Networks
On Generalization Bounds of a Family of Recurrent Neural Networks
Minshuo Chen
Xingguo Li
T. Zhao
61
71
0
28 Oct 2019
Learning and Generalization in Overparameterized Neural Networks, Going
  Beyond Two Layers
Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers
Zeyuan Allen-Zhu
Yuanzhi Li
Yingyu Liang
MLT
201
775
0
12 Nov 2018
A Convergence Theory for Deep Learning via Over-Parameterization
A Convergence Theory for Deep Learning via Over-Parameterization
Zeyuan Allen-Zhu
Yuanzhi Li
Zhao Song
AI4CEODL
266
1,469
0
09 Nov 2018
On the Convergence Rate of Training Recurrent Neural Networks
On the Convergence Rate of Training Recurrent Neural Networks
Zeyuan Allen-Zhu
Yuanzhi Li
Zhao Song
178
192
0
29 Oct 2018
Learning Overparameterized Neural Networks via Stochastic Gradient
  Descent on Structured Data
Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data
Yuanzhi Li
Yingyu Liang
MLT
219
653
0
03 Aug 2018
Stabilizing Gradients for Deep Neural Networks via Efficient SVD
  Parameterization
Stabilizing Gradients for Deep Neural Networks via Efficient SVD Parameterization
Jiong Zhang
Qi Lei
Inderjit S. Dhillon
55
112
0
25 Mar 2018
Recent Advances in Recurrent Neural Networks
Recent Advances in Recurrent Neural Networks
Hojjat Salehinejad
Sharan Sankar
Joseph Barfett
E. Colak
S. Valaee
AI4TS
105
584
0
29 Dec 2017
Algorithmic Regularization in Over-parameterized Matrix Sensing and
  Neural Networks with Quadratic Activations
Algorithmic Regularization in Over-parameterized Matrix Sensing and Neural Networks with Quadratic Activations
Yuanzhi Li
Tengyu Ma
Hongyang R. Zhang
61
31
0
26 Dec 2017
Learning One-hidden-layer Neural Networks with Landscape Design
Learning One-hidden-layer Neural Networks with Landscape Design
Rong Ge
Jason D. Lee
Tengyu Ma
MLT
206
262
0
01 Nov 2017
Theoretical properties of the global optimizer of two layer neural
  network
Theoretical properties of the global optimizer of two layer neural network
Digvijay Boob
Guanghui Lan
MLT
148
34
0
30 Oct 2017
Theoretical insights into the optimization landscape of
  over-parameterized shallow neural networks
Theoretical insights into the optimization landscape of over-parameterized shallow neural networks
Mahdi Soltanolkotabi
Adel Javanmard
Jason D. Lee
177
423
0
16 Jul 2017
Recovery Guarantees for One-hidden-layer Neural Networks
Recovery Guarantees for One-hidden-layer Neural Networks
Kai Zhong
Zhao Song
Prateek Jain
Peter L. Bartlett
Inderjit S. Dhillon
MLT
181
337
0
10 Jun 2017
Convergence Analysis of Two-layer Neural Networks with ReLU Activation
Convergence Analysis of Two-layer Neural Networks with ReLU Activation
Yuanzhi Li
Yang Yuan
MLT
155
652
0
28 May 2017
Machine Learning on Sequential Data Using a Recurrent Weighted Average
Machine Learning on Sequential Data Using a Recurrent Weighted Average
Jared Ostmeyer
L. Cowell
43
32
0
03 Mar 2017
An Analytical Formula of Population Gradient for two-layered ReLU
  network and its Applications in Convergence and Critical Point Analysis
An Analytical Formula of Population Gradient for two-layered ReLU network and its Applications in Convergence and Critical Point Analysis
Yuandong Tian
MLT
197
217
0
02 Mar 2017
Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs
Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs
Alon Brutzkus
Amir Globerson
MLT
171
313
0
26 Feb 2017
Diverse Neural Network Learns True Target Functions
Diverse Neural Network Learns True Target Functions
Bo Xie
Yingyu Liang
Le Song
179
140
0
09 Nov 2016
Gradient Descent Learns Linear Dynamical Systems
Gradient Descent Learns Linear Dynamical Systems
Moritz Hardt
Tengyu Ma
Benjamin Recht
110
240
0
16 Sep 2016
No bad local minima: Data independent training error guarantees for
  multilayer neural networks
No bad local minima: Data independent training error guarantees for multilayer neural networks
Daniel Soudry
Y. Carmon
200
236
0
26 May 2016
Deep Learning without Poor Local Minima
Deep Learning without Poor Local Minima
Kenji Kawaguchi
ODL
224
925
0
23 May 2016
A vector-contraction inequality for Rademacher complexities
A vector-contraction inequality for Rademacher complexities
Andreas Maurer
77
261
0
01 May 2016
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
446
20,590
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
578
27,327
0
01 Sep 2014
Speech Recognition with Deep Recurrent Neural Networks
Speech Recognition with Deep Recurrent Neural Networks
Alex Graves
Abdel-rahman Mohamed
Geoffrey E. Hinton
230
8,526
0
22 Mar 2013
On the difficulty of training Recurrent Neural Networks
On the difficulty of training Recurrent Neural Networks
Razvan Pascanu
Tomas Mikolov
Yoshua Bengio
ODL
204
5,360
0
21 Nov 2012
1