v1v2v3v4 (latest)

Gradient Descent Finds Global Minima of Deep Neural Networks

9 November 2018

Papers citing "Gradient Descent Finds Global Minima of Deep Neural Networks"

50 / 466 papers shown

Title
A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation Pan Xu Quanquan Gu 90 68 0 10 Dec 2019
Neural Tangents: Fast and Easy Infinite Neural Networks in Python Roman Novak Lechao Xiao Jiri Hron Jaehoon Lee Alexander A. Alemi Jascha Narain Sohl-Dickstein S. Schoenholz 100 231 0 05 Dec 2019
Towards Understanding the Spectral Bias of Deep Learning Yuan Cao Zhiying Fang Yue Wu Ding-Xuan Zhou Quanquan Gu 124 220 0 03 Dec 2019
Variable Selection with Rigorous Uncertainty Quantification using Deep Bayesian Neural Networks: Posterior Concentration and Bernstein-von Mises Phenomenon Jeremiah Zhe Liu BDL 143 8 0 03 Dec 2019
How Much Over-parameterization Is Sufficient to Learn Deep ReLU Networks? Zixiang Chen Yuan Cao Difan Zou Quanquan Gu 77 123 0 27 Nov 2019
Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints Jingliang Duan Zhengyu Liu Shengbo Eben Li Qi Sun Zhenzhong Jia B. Cheng 70 65 0 26 Nov 2019
Information-Theoretic Local Minima Characterization and Regularization Zhiwei Jia Hao Su 75 19 0 19 Nov 2019
Convex Formulation of Overparameterized Deep Neural Networks Cong Fang Yihong Gu Weizhong Zhang Tong Zhang 80 28 0 18 Nov 2019
Asymptotics of Reinforcement Learning with Neural Networks Justin A. Sirignano K. Spiliopoulos MLT 98 14 0 13 Nov 2019
Tight Sample Complexity of Learning One-hidden-layer Convolutional Neural Networks Yuan Cao Quanquan Gu MLT 78 19 0 12 Nov 2019
Neural Contextual Bandits with UCB-based Exploration Dongruo Zhou Lihong Li Quanquan Gu 132 15 0 11 Nov 2019
Sub-Optimal Local Minima Exist for Neural Networks with Almost All Non-Linear Activations Tian Ding Dawei Li Ruoyu Sun 91 13 0 04 Nov 2019
Enhanced Convolutional Neural Tangent Kernels Zhiyuan Li Ruosong Wang Dingli Yu S. Du Wei Hu Ruslan Salakhutdinov Sanjeev Arora 76 133 0 03 Nov 2019
Poincaré Recurrence, Cycles and Spurious Equilibria in Gradient-Descent-Ascent for Non-Convex Non-Concave Zero-Sum Games Lampros Flokas Emmanouil-Vasileios Vlatakis-Gkaragkounis Georgios Piliouras MLT 88 41 0 28 Oct 2019
Image recognition from raw labels collected without annotators Fatih Yilmaz Reinhard Heckel NoLa 73 7 0 20 Oct 2019
Active Learning for Graph Neural Networks via Node Feature Propagation Yuexin Wu Yichong Xu Aarti Singh Yiming Yang A. Dubrawski GNN AI4CE 101 65 0 16 Oct 2019
The Local Elasticity of Neural Networks Hangfeng He Weijie J. Su 149 46 0 15 Oct 2019
Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks Spencer Frei Yuan Cao Quanquan Gu ODL 82 31 0 07 Oct 2019
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks Sanjeev Arora S. Du Zhiyuan Li Ruslan Salakhutdinov Ruosong Wang Dingli Yu AAML 78 162 0 03 Oct 2019
Beyond Linearization: On Quadratic and Higher-Order Approximation of Wide Neural Networks Yu Bai Jason D. Lee 67 116 0 03 Oct 2019
How noise affects the Hessian spectrum in overparameterized neural networks Ming-Bo Wei D. Schwab 85 28 0 01 Oct 2019
Overparameterized Neural Networks Implement Associative Memory Adityanarayanan Radhakrishnan M. Belkin Caroline Uhler BDL 97 74 0 26 Sep 2019
Polylogarithmic width suffices for gradient descent to achieve arbitrarily small test error with shallow ReLU networks Ziwei Ji Matus Telgarsky 98 178 0 26 Sep 2019
Asymptotics of Wide Networks from Feynman Diagrams Ethan Dyer Guy Gur-Ari 109 115 0 25 Sep 2019
Classification Logit Two-sample Testing by Neural Networks Xiuyuan Cheng A. Cloninger 89 33 0 25 Sep 2019
Sample Efficient Policy Gradient Methods with Recursive Variance Reduction Pan Xu F. Gao Quanquan Gu 117 89 0 18 Sep 2019
Dynamics of Deep Neural Networks and Neural Tangent Hierarchy Jiaoyang Huang H. Yau 62 151 0 18 Sep 2019
Relaxed Actor-Critic with Convergence Guarantees for Continuous-Time Optimal Control of Nonlinear Systems Jingliang Duan Jie Li Qiang Ge Shengbo Eben Li Monimoy Bujarbaruah Fei Ma Dezhao Zhang 21 16 0 11 Sep 2019
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence Lingxiao Wang Qi Cai Zhuoran Yang Zhaoran Wang 113 242 0 29 Aug 2019
Deep Learning Theory Review: An Optimal Control and Dynamical Systems Perspective Guan-Horng Liu Evangelos A. Theodorou AI4CE 121 72 0 28 Aug 2019
Stochastic AUC Maximization with Deep Neural Networks Mingrui Liu Zhuoning Yuan Yiming Ying Tianbao Yang 57 109 0 28 Aug 2019
Theoretical Issues in Deep Networks: Approximation, Optimization and Generalization T. Poggio Andrzej Banburski Q. Liao ODL 126 165 0 25 Aug 2019
Effect of Activation Functions on the Training of Overparametrized Neural Nets A. Panigrahi Abhishek Shetty Navin Goyal 81 21 0 16 Aug 2019
The generalization error of random features regression: Precise asymptotics and double descent curve Song Mei Andrea Montanari 162 640 0 14 Aug 2019
Convergence Rates of Variational Inference in Sparse Deep Learning Badr-Eddine Chérief-Abdellatif BDL 122 39 0 09 Aug 2019
How Does Learning Rate Decay Help Modern Neural Networks? Kaichao You Mingsheng Long Jianmin Wang Michael I. Jordan 66 4 0 05 Aug 2019
Hessian based analysis of SGD for Deep Nets: Dynamics and Generalization Xinyan Li Qilong Gu Yingxue Zhou Tiancong Chen A. Banerjee ODL 88 52 0 24 Jul 2019
Trainability of ReLU networks and Data-dependent Initialization Yeonjong Shin George Karniadakis 50 8 0 23 Jul 2019
Surfing: Iterative optimization over incrementally trained deep networks Ganlin Song Z. Fan John D. Lafferty 73 20 0 19 Jul 2019
Two-block vs. Multi-block ADMM: An empirical evaluation of convergence A. Gonçalves Xiaoliang Liu A. Banerjee 33 5 0 10 Jul 2019
Scaling Limit of Neural Networks with the Xavier Initialization and Convergence to a Global Minimum Justin A. Sirignano K. Spiliopoulos 58 14 0 09 Jul 2019
Deep Learning based Wireless Resource Allocation with Application to Vehicular Networks Le Liang Hao Ye Guanding Yu Geoffrey Ye Li 71 200 0 07 Jul 2019
Benign Overfitting in Linear Regression Peter L. Bartlett Philip M. Long Gábor Lugosi Alexander Tsigler MLT 126 780 0 26 Jun 2019
Limitations of Lazy Training of Two-layers Neural Networks Behrooz Ghorbani Song Mei Theodor Misiakiewicz Andrea Montanari MLT 62 143 0 21 Jun 2019
Convergence of Adversarial Training in Overparametrized Neural Networks Ruiqi Gao Tianle Cai Haochuan Li Liwei Wang Cho-Jui Hsieh Jason D. Lee AAML 113 109 0 19 Jun 2019
Gradient Dynamics of Shallow Univariate ReLU Networks Francis Williams Matthew Trager Claudio Silva Daniele Panozzo Denis Zorin Joan Bruna 70 80 0 18 Jun 2019
Approximation power of random neural networks Bolton Bailey Ziwei Ji Matus Telgarsky Ruicheng Xian 55 6 0 18 Jun 2019
Gradient Descent Maximizes the Margin of Homogeneous Neural Networks Kaifeng Lyu Jian Li 114 336 0 13 Jun 2019
Kernel and Rich Regimes in Overparametrized Models Blake E. Woodworth Suriya Gunasekar Pedro H. P. Savarese E. Moroshko Itay Golan Jason D. Lee Daniel Soudry Nathan Srebro 91 367 0 13 Jun 2019
Generalization Guarantees for Neural Networks via Harnessing the Low-rank Structure of the Jacobian Samet Oymak Zalan Fabian Mingchen Li Mahdi Soltanolkotabi MLT 89 88 0 12 Jun 2019