Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.01028
Cited By
v1
v2 (latest)
Can SGD Learn Recurrent Neural Networks with Provable Generalization?
4 February 2019
Zeyuan Allen-Zhu
Yuanzhi Li
MLT
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Can SGD Learn Recurrent Neural Networks with Provable Generalization?"
26 / 26 papers shown
Title
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation
Can Jin
Ying Li
Mingyu Zhao
Shiyu Zhao
Zhenting Wang
Xiaoxiao He
Ligong Han
Tong Che
Dimitris N. Metaxas
VPVLM
VLM
287
2
0
02 Feb 2025
On Generalization Bounds of a Family of Recurrent Neural Networks
Minshuo Chen
Xingguo Li
T. Zhao
61
71
0
28 Oct 2019
Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers
Zeyuan Allen-Zhu
Yuanzhi Li
Yingyu Liang
MLT
201
775
0
12 Nov 2018
A Convergence Theory for Deep Learning via Over-Parameterization
Zeyuan Allen-Zhu
Yuanzhi Li
Zhao Song
AI4CE
ODL
266
1,469
0
09 Nov 2018
On the Convergence Rate of Training Recurrent Neural Networks
Zeyuan Allen-Zhu
Yuanzhi Li
Zhao Song
178
192
0
29 Oct 2018
Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data
Yuanzhi Li
Yingyu Liang
MLT
219
653
0
03 Aug 2018
Stabilizing Gradients for Deep Neural Networks via Efficient SVD Parameterization
Jiong Zhang
Qi Lei
Inderjit S. Dhillon
55
112
0
25 Mar 2018
Recent Advances in Recurrent Neural Networks
Hojjat Salehinejad
Sharan Sankar
Joseph Barfett
E. Colak
S. Valaee
AI4TS
105
584
0
29 Dec 2017
Algorithmic Regularization in Over-parameterized Matrix Sensing and Neural Networks with Quadratic Activations
Yuanzhi Li
Tengyu Ma
Hongyang R. Zhang
61
31
0
26 Dec 2017
Learning One-hidden-layer Neural Networks with Landscape Design
Rong Ge
Jason D. Lee
Tengyu Ma
MLT
206
262
0
01 Nov 2017
Theoretical properties of the global optimizer of two layer neural network
Digvijay Boob
Guanghui Lan
MLT
148
34
0
30 Oct 2017
Theoretical insights into the optimization landscape of over-parameterized shallow neural networks
Mahdi Soltanolkotabi
Adel Javanmard
Jason D. Lee
177
423
0
16 Jul 2017
Recovery Guarantees for One-hidden-layer Neural Networks
Kai Zhong
Zhao Song
Prateek Jain
Peter L. Bartlett
Inderjit S. Dhillon
MLT
181
337
0
10 Jun 2017
Convergence Analysis of Two-layer Neural Networks with ReLU Activation
Yuanzhi Li
Yang Yuan
MLT
155
652
0
28 May 2017
Machine Learning on Sequential Data Using a Recurrent Weighted Average
Jared Ostmeyer
L. Cowell
43
32
0
03 Mar 2017
An Analytical Formula of Population Gradient for two-layered ReLU network and its Applications in Convergence and Critical Point Analysis
Yuandong Tian
MLT
197
217
0
02 Mar 2017
Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs
Alon Brutzkus
Amir Globerson
MLT
171
313
0
26 Feb 2017
Diverse Neural Network Learns True Target Functions
Bo Xie
Yingyu Liang
Le Song
179
140
0
09 Nov 2016
Gradient Descent Learns Linear Dynamical Systems
Moritz Hardt
Tengyu Ma
Benjamin Recht
110
240
0
16 Sep 2016
No bad local minima: Data independent training error guarantees for multilayer neural networks
Daniel Soudry
Y. Carmon
200
236
0
26 May 2016
Deep Learning without Poor Local Minima
Kenji Kawaguchi
ODL
224
925
0
23 May 2016
A vector-contraction inequality for Rademacher complexities
Andreas Maurer
77
261
0
01 May 2016
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
446
20,590
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
578
27,327
0
01 Sep 2014
Speech Recognition with Deep Recurrent Neural Networks
Alex Graves
Abdel-rahman Mohamed
Geoffrey E. Hinton
230
8,526
0
22 Mar 2013
On the difficulty of training Recurrent Neural Networks
Razvan Pascanu
Tomas Mikolov
Yoshua Bengio
ODL
204
5,360
0
21 Nov 2012
1