ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.03804
  4. Cited By
Gradient Descent Finds Global Minima of Deep Neural Networks
v1v2v3v4 (latest)

Gradient Descent Finds Global Minima of Deep Neural Networks

9 November 2018
S. Du
Jason D. Lee
Haochuan Li
Liwei Wang
Masayoshi Tomizuka
    ODL
ArXiv (abs)PDFHTML

Papers citing "Gradient Descent Finds Global Minima of Deep Neural Networks"

50 / 466 papers shown
Title
A Finite-Time Analysis of Q-Learning with Neural Network Function
  Approximation
A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation
Pan Xu
Quanquan Gu
90
68
0
10 Dec 2019
Neural Tangents: Fast and Easy Infinite Neural Networks in Python
Neural Tangents: Fast and Easy Infinite Neural Networks in Python
Roman Novak
Lechao Xiao
Jiri Hron
Jaehoon Lee
Alexander A. Alemi
Jascha Narain Sohl-Dickstein
S. Schoenholz
100
231
0
05 Dec 2019
Towards Understanding the Spectral Bias of Deep Learning
Towards Understanding the Spectral Bias of Deep Learning
Yuan Cao
Zhiying Fang
Yue Wu
Ding-Xuan Zhou
Quanquan Gu
124
220
0
03 Dec 2019
Variable Selection with Rigorous Uncertainty Quantification using Deep
  Bayesian Neural Networks: Posterior Concentration and Bernstein-von Mises
  Phenomenon
Variable Selection with Rigorous Uncertainty Quantification using Deep Bayesian Neural Networks: Posterior Concentration and Bernstein-von Mises Phenomenon
Jeremiah Zhe Liu
BDL
143
8
0
03 Dec 2019
How Much Over-parameterization Is Sufficient to Learn Deep ReLU
  Networks?
How Much Over-parameterization Is Sufficient to Learn Deep ReLU Networks?
Zixiang Chen
Yuan Cao
Difan Zou
Quanquan Gu
77
123
0
27 Nov 2019
Adaptive dynamic programming for nonaffine nonlinear optimal control
  problem with state constraints
Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints
Jingliang Duan
Zhengyu Liu
Shengbo Eben Li
Qi Sun
Zhenzhong Jia
B. Cheng
70
65
0
26 Nov 2019
Information-Theoretic Local Minima Characterization and Regularization
Information-Theoretic Local Minima Characterization and Regularization
Zhiwei Jia
Hao Su
75
19
0
19 Nov 2019
Convex Formulation of Overparameterized Deep Neural Networks
Convex Formulation of Overparameterized Deep Neural Networks
Cong Fang
Yihong Gu
Weizhong Zhang
Tong Zhang
80
28
0
18 Nov 2019
Asymptotics of Reinforcement Learning with Neural Networks
Asymptotics of Reinforcement Learning with Neural Networks
Justin A. Sirignano
K. Spiliopoulos
MLT
98
14
0
13 Nov 2019
Tight Sample Complexity of Learning One-hidden-layer Convolutional
  Neural Networks
Tight Sample Complexity of Learning One-hidden-layer Convolutional Neural Networks
Yuan Cao
Quanquan Gu
MLT
78
19
0
12 Nov 2019
Neural Contextual Bandits with UCB-based Exploration
Neural Contextual Bandits with UCB-based Exploration
Dongruo Zhou
Lihong Li
Quanquan Gu
132
15
0
11 Nov 2019
Sub-Optimal Local Minima Exist for Neural Networks with Almost All
  Non-Linear Activations
Sub-Optimal Local Minima Exist for Neural Networks with Almost All Non-Linear Activations
Tian Ding
Dawei Li
Ruoyu Sun
91
13
0
04 Nov 2019
Enhanced Convolutional Neural Tangent Kernels
Enhanced Convolutional Neural Tangent Kernels
Zhiyuan Li
Ruosong Wang
Dingli Yu
S. Du
Wei Hu
Ruslan Salakhutdinov
Sanjeev Arora
76
133
0
03 Nov 2019
Poincaré Recurrence, Cycles and Spurious Equilibria in
  Gradient-Descent-Ascent for Non-Convex Non-Concave Zero-Sum Games
Poincaré Recurrence, Cycles and Spurious Equilibria in Gradient-Descent-Ascent for Non-Convex Non-Concave Zero-Sum Games
Lampros Flokas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Georgios Piliouras
MLT
88
41
0
28 Oct 2019
Image recognition from raw labels collected without annotators
Image recognition from raw labels collected without annotators
Fatih Yilmaz
Reinhard Heckel
NoLa
73
7
0
20 Oct 2019
Active Learning for Graph Neural Networks via Node Feature Propagation
Active Learning for Graph Neural Networks via Node Feature Propagation
Yuexin Wu
Yichong Xu
Aarti Singh
Yiming Yang
A. Dubrawski
GNNAI4CE
101
65
0
16 Oct 2019
The Local Elasticity of Neural Networks
The Local Elasticity of Neural Networks
Hangfeng He
Weijie J. Su
149
46
0
15 Oct 2019
Algorithm-Dependent Generalization Bounds for Overparameterized Deep
  Residual Networks
Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks
Spencer Frei
Yuan Cao
Quanquan Gu
ODL
82
31
0
07 Oct 2019
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks
Sanjeev Arora
S. Du
Zhiyuan Li
Ruslan Salakhutdinov
Ruosong Wang
Dingli Yu
AAML
78
162
0
03 Oct 2019
Beyond Linearization: On Quadratic and Higher-Order Approximation of
  Wide Neural Networks
Beyond Linearization: On Quadratic and Higher-Order Approximation of Wide Neural Networks
Yu Bai
Jason D. Lee
67
116
0
03 Oct 2019
How noise affects the Hessian spectrum in overparameterized neural
  networks
How noise affects the Hessian spectrum in overparameterized neural networks
Ming-Bo Wei
D. Schwab
85
28
0
01 Oct 2019
Overparameterized Neural Networks Implement Associative Memory
Overparameterized Neural Networks Implement Associative Memory
Adityanarayanan Radhakrishnan
M. Belkin
Caroline Uhler
BDL
97
74
0
26 Sep 2019
Polylogarithmic width suffices for gradient descent to achieve
  arbitrarily small test error with shallow ReLU networks
Polylogarithmic width suffices for gradient descent to achieve arbitrarily small test error with shallow ReLU networks
Ziwei Ji
Matus Telgarsky
98
178
0
26 Sep 2019
Asymptotics of Wide Networks from Feynman Diagrams
Asymptotics of Wide Networks from Feynman Diagrams
Ethan Dyer
Guy Gur-Ari
109
115
0
25 Sep 2019
Classification Logit Two-sample Testing by Neural Networks
Classification Logit Two-sample Testing by Neural Networks
Xiuyuan Cheng
A. Cloninger
89
33
0
25 Sep 2019
Sample Efficient Policy Gradient Methods with Recursive Variance
  Reduction
Sample Efficient Policy Gradient Methods with Recursive Variance Reduction
Pan Xu
F. Gao
Quanquan Gu
117
89
0
18 Sep 2019
Dynamics of Deep Neural Networks and Neural Tangent Hierarchy
Dynamics of Deep Neural Networks and Neural Tangent Hierarchy
Jiaoyang Huang
H. Yau
62
151
0
18 Sep 2019
Relaxed Actor-Critic with Convergence Guarantees for Continuous-Time
  Optimal Control of Nonlinear Systems
Relaxed Actor-Critic with Convergence Guarantees for Continuous-Time Optimal Control of Nonlinear Systems
Jingliang Duan
Jie Li
Qiang Ge
Shengbo Eben Li
Monimoy Bujarbaruah
Fei Ma
Dezhao Zhang
21
16
0
11 Sep 2019
Neural Policy Gradient Methods: Global Optimality and Rates of
  Convergence
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
113
242
0
29 Aug 2019
Deep Learning Theory Review: An Optimal Control and Dynamical Systems
  Perspective
Deep Learning Theory Review: An Optimal Control and Dynamical Systems Perspective
Guan-Horng Liu
Evangelos A. Theodorou
AI4CE
121
72
0
28 Aug 2019
Stochastic AUC Maximization with Deep Neural Networks
Stochastic AUC Maximization with Deep Neural Networks
Mingrui Liu
Zhuoning Yuan
Yiming Ying
Tianbao Yang
57
109
0
28 Aug 2019
Theoretical Issues in Deep Networks: Approximation, Optimization and
  Generalization
Theoretical Issues in Deep Networks: Approximation, Optimization and Generalization
T. Poggio
Andrzej Banburski
Q. Liao
ODL
126
165
0
25 Aug 2019
Effect of Activation Functions on the Training of Overparametrized
  Neural Nets
Effect of Activation Functions on the Training of Overparametrized Neural Nets
A. Panigrahi
Abhishek Shetty
Navin Goyal
81
21
0
16 Aug 2019
The generalization error of random features regression: Precise
  asymptotics and double descent curve
The generalization error of random features regression: Precise asymptotics and double descent curve
Song Mei
Andrea Montanari
162
640
0
14 Aug 2019
Convergence Rates of Variational Inference in Sparse Deep Learning
Convergence Rates of Variational Inference in Sparse Deep Learning
Badr-Eddine Chérief-Abdellatif
BDL
122
39
0
09 Aug 2019
How Does Learning Rate Decay Help Modern Neural Networks?
How Does Learning Rate Decay Help Modern Neural Networks?
Kaichao You
Mingsheng Long
Jianmin Wang
Michael I. Jordan
66
4
0
05 Aug 2019
Hessian based analysis of SGD for Deep Nets: Dynamics and Generalization
Hessian based analysis of SGD for Deep Nets: Dynamics and Generalization
Xinyan Li
Qilong Gu
Yingxue Zhou
Tiancong Chen
A. Banerjee
ODL
88
52
0
24 Jul 2019
Trainability of ReLU networks and Data-dependent Initialization
Trainability of ReLU networks and Data-dependent Initialization
Yeonjong Shin
George Karniadakis
50
8
0
23 Jul 2019
Surfing: Iterative optimization over incrementally trained deep networks
Surfing: Iterative optimization over incrementally trained deep networks
Ganlin Song
Z. Fan
John D. Lafferty
73
20
0
19 Jul 2019
Two-block vs. Multi-block ADMM: An empirical evaluation of convergence
Two-block vs. Multi-block ADMM: An empirical evaluation of convergence
A. Gonçalves
Xiaoliang Liu
A. Banerjee
33
5
0
10 Jul 2019
Scaling Limit of Neural Networks with the Xavier Initialization and
  Convergence to a Global Minimum
Scaling Limit of Neural Networks with the Xavier Initialization and Convergence to a Global Minimum
Justin A. Sirignano
K. Spiliopoulos
58
14
0
09 Jul 2019
Deep Learning based Wireless Resource Allocation with Application to
  Vehicular Networks
Deep Learning based Wireless Resource Allocation with Application to Vehicular Networks
Le Liang
Hao Ye
Guanding Yu
Geoffrey Ye Li
71
200
0
07 Jul 2019
Benign Overfitting in Linear Regression
Benign Overfitting in Linear Regression
Peter L. Bartlett
Philip M. Long
Gábor Lugosi
Alexander Tsigler
MLT
126
780
0
26 Jun 2019
Limitations of Lazy Training of Two-layers Neural Networks
Limitations of Lazy Training of Two-layers Neural Networks
Behrooz Ghorbani
Song Mei
Theodor Misiakiewicz
Andrea Montanari
MLT
62
143
0
21 Jun 2019
Convergence of Adversarial Training in Overparametrized Neural Networks
Convergence of Adversarial Training in Overparametrized Neural Networks
Ruiqi Gao
Tianle Cai
Haochuan Li
Liwei Wang
Cho-Jui Hsieh
Jason D. Lee
AAML
113
109
0
19 Jun 2019
Gradient Dynamics of Shallow Univariate ReLU Networks
Gradient Dynamics of Shallow Univariate ReLU Networks
Francis Williams
Matthew Trager
Claudio Silva
Daniele Panozzo
Denis Zorin
Joan Bruna
70
80
0
18 Jun 2019
Approximation power of random neural networks
Bolton Bailey
Ziwei Ji
Matus Telgarsky
Ruicheng Xian
55
6
0
18 Jun 2019
Gradient Descent Maximizes the Margin of Homogeneous Neural Networks
Gradient Descent Maximizes the Margin of Homogeneous Neural Networks
Kaifeng Lyu
Jian Li
114
336
0
13 Jun 2019
Kernel and Rich Regimes in Overparametrized Models
Blake E. Woodworth
Suriya Gunasekar
Pedro H. P. Savarese
E. Moroshko
Itay Golan
Jason D. Lee
Daniel Soudry
Nathan Srebro
91
367
0
13 Jun 2019
Generalization Guarantees for Neural Networks via Harnessing the
  Low-rank Structure of the Jacobian
Generalization Guarantees for Neural Networks via Harnessing the Low-rank Structure of the Jacobian
Samet Oymak
Zalan Fabian
Mingchen Li
Mahdi Soltanolkotabi
MLT
89
88
0
12 Jun 2019
Previous
123...10789
Next