v1v2 (latest)

Can SGD Learn Recurrent Neural Networks with Provable Generalization?

4 February 2019

Papers citing "Can SGD Learn Recurrent Neural Networks with Provable Generalization?"

26 / 26 papers shown

Title
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation Can Jin Ying Li Mingyu Zhao Shiyu Zhao Zhenting Wang Xiaoxiao He Ligong Han Tong Che Dimitris N. Metaxas VPVLM VLM 287 2 0 02 Feb 2025
On Generalization Bounds of a Family of Recurrent Neural Networks Minshuo Chen Xingguo Li T. Zhao 61 71 0 28 Oct 2019
Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers Zeyuan Allen-Zhu Yuanzhi Li Yingyu Liang MLT 201 775 0 12 Nov 2018
A Convergence Theory for Deep Learning via Over-Parameterization Zeyuan Allen-Zhu Yuanzhi Li Zhao Song AI4CE ODL 266 1,469 0 09 Nov 2018
On the Convergence Rate of Training Recurrent Neural Networks Zeyuan Allen-Zhu Yuanzhi Li Zhao Song 178 192 0 29 Oct 2018
Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data Yuanzhi Li Yingyu Liang MLT 219 653 0 03 Aug 2018
Stabilizing Gradients for Deep Neural Networks via Efficient SVD Parameterization Jiong Zhang Qi Lei Inderjit S. Dhillon 55 112 0 25 Mar 2018
Recent Advances in Recurrent Neural Networks Hojjat Salehinejad Sharan Sankar Joseph Barfett E. Colak S. Valaee AI4TS 105 584 0 29 Dec 2017
Algorithmic Regularization in Over-parameterized Matrix Sensing and Neural Networks with Quadratic Activations Yuanzhi Li Tengyu Ma Hongyang R. Zhang 61 31 0 26 Dec 2017
Learning One-hidden-layer Neural Networks with Landscape Design Rong Ge Jason D. Lee Tengyu Ma MLT 206 262 0 01 Nov 2017
Theoretical properties of the global optimizer of two layer neural network Digvijay Boob Guanghui Lan MLT 148 34 0 30 Oct 2017
Theoretical insights into the optimization landscape of over-parameterized shallow neural networks Mahdi Soltanolkotabi Adel Javanmard Jason D. Lee 177 423 0 16 Jul 2017
Recovery Guarantees for One-hidden-layer Neural Networks Kai Zhong Zhao Song Prateek Jain Peter L. Bartlett Inderjit S. Dhillon MLT 181 337 0 10 Jun 2017
Convergence Analysis of Two-layer Neural Networks with ReLU Activation Yuanzhi Li Yang Yuan MLT 155 652 0 28 May 2017
Machine Learning on Sequential Data Using a Recurrent Weighted Average Jared Ostmeyer L. Cowell 43 32 0 03 Mar 2017
An Analytical Formula of Population Gradient for two-layered ReLU network and its Applications in Convergence and Critical Point Analysis Yuandong Tian MLT 197 217 0 02 Mar 2017
Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs Alon Brutzkus Amir Globerson MLT 171 313 0 26 Feb 2017
Diverse Neural Network Learns True Target Functions Bo Xie Yingyu Liang Le Song 179 140 0 09 Nov 2016
Gradient Descent Learns Linear Dynamical Systems Moritz Hardt Tengyu Ma Benjamin Recht 110 240 0 16 Sep 2016
No bad local minima: Data independent training error guarantees for multilayer neural networks Daniel Soudry Y. Carmon 200 236 0 26 May 2016
Deep Learning without Poor Local Minima Kenji Kawaguchi ODL 224 925 0 23 May 2016
A vector-contraction inequality for Rademacher complexities Andreas Maurer 77 261 0 01 May 2016
Sequence to Sequence Learning with Neural Networks Ilya Sutskever Oriol Vinyals Quoc V. Le AIMat 446 20,590 0 10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate Dzmitry Bahdanau Kyunghyun Cho Yoshua Bengio AIMat 578 27,327 0 01 Sep 2014
Speech Recognition with Deep Recurrent Neural Networks Alex Graves Abdel-rahman Mohamed Geoffrey E. Hinton 230 8,526 0 22 Mar 2013
On the difficulty of training Recurrent Neural Networks Razvan Pascanu Tomas Mikolov Yoshua Bengio ODL 204 5,360 0 21 Nov 2012