Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.03804
Cited By
Gradient Descent Finds Global Minima of Deep Neural Networks
9 November 2018
S. Du
J. Lee
Haochuan Li
Liwei Wang
M. Tomizuka
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Gradient Descent Finds Global Minima of Deep Neural Networks"
50 / 763 papers shown
Title
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
George Andriopoulos
Soyuj Jung Basnet
Juan Guevara
Li Guo
Keith Ross
30
0
0
14 May 2025
SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures
Julian Kranz
Davide Gallon
Steffen Dereich
Arnulf Jentzen
19
0
0
14 May 2025
Divergence of Empirical Neural Tangent Kernel in Classification Problems
Zixiong Yu
Songtao Tian
Guhan Chen
23
0
0
15 Apr 2025
Statistically guided deep learning
Michael Kohler
A. Krzyżak
ODL
BDL
76
0
0
11 Apr 2025
Can Diffusion Models Disentangle? A Theoretical Perspective
Liming Wang
Muhammad Jehanzeb Mirza
Yishu Gong
Yuan Gong
Jiaqi Zhang
Brian Tracey
Katerina Placek
Marco Vilela
James Glass
DiffM
CoGe
87
0
0
31 Mar 2025
Towards Understanding the Optimization Mechanisms in Deep Learning
Binchuan Qi
Wei Gong
Li Li
49
0
0
29 Mar 2025
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry
Chi-Ning Chou
Hang Le
Yichen Wang
SueYeon Chung
53
0
0
23 Mar 2025
On the Cone Effect in the Learning Dynamics
Zhanpeng Zhou
Yongyi Yang
Jie Ren
Mahito Sugiyama
Junchi Yan
53
0
0
20 Mar 2025
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis Prediction
Anni Zhou
Raheem Beyah
Rishikesan Kamaleswaran
43
0
0
20 Mar 2025
OFL: Opportunistic Federated Learning for Resource-Heterogeneous and Privacy-Aware Devices
Yunlong Mao
Mingyang Niu
Ziqin Dang
Chengxi Li
Hanning Xia
Yuejuan Zhu
Haoyu Bian
Yuan Zhang
Jingyu Hua
Sheng Zhong
FedML
55
0
0
19 Mar 2025
Global Convergence and Rich Feature Learning in
L
L
L
-Layer Infinite-Width Neural Networks under
μ
μ
μ
P Parametrization
Zixiang Chen
Greg Yang
Qingyue Zhao
Q. Gu
MLT
50
0
0
12 Mar 2025
A Near Complete Nonasymptotic Generalization Theory For Multilayer Neural Networks: Beyond the Bias-Variance Tradeoff
Hao Yu
Xiangyang Ji
AI4CE
60
0
0
03 Mar 2025
On the Saturation Effects of Spectral Algorithms in Large Dimensions
Weihao Lu
Haobo Zhang
Yicheng Li
Q. Lin
42
1
0
01 Mar 2025
Towards Understanding the Benefit of Multitask Representation Learning in Decision Process
Rui Lu
Yang Yue
Andrew Zhao
S. Du
Gao Huang
OffRL
52
1
0
01 Mar 2025
Escaping from the Barren Plateau via Gaussian Initializations in Deep Variational Quantum Circuits
Kaining Zhang
Liu Liu
Min-hsiu Hsieh
Dacheng Tao
59
60
0
20 Feb 2025
Feature Learning Beyond the Edge of Stability
Dávid Terjék
MLT
46
0
0
18 Feb 2025
Stability and Generalization in Free Adversarial Training
Xiwei Cheng
Kexin Fu
Farzan Farnia
AAML
46
2
0
08 Jan 2025
How Do Artificial Intelligences Think? The Three Mathematico-Cognitive Factors of Categorical Segmentation Operated by Synthetic Neurons
Michael Pichat
William Pogrund
Armanush Gasparian
Paloma Pichat
Samuel Demarchi
Michael Veillet-Guillem
42
3
0
26 Dec 2024
What do physics-informed DeepONets learn? Understanding and improving training for scientific computing applications
Emily Williams
Amanda A. Howard
B. Meuris
P. Stinis
AI4CE
71
0
0
27 Nov 2024
Multi-Label Bayesian Active Learning with Inter-Label Relationships
Yuanyuan Qi
Jueqing Lu
Xiaohao Yang
Joanne Enticott
Lan Du
69
0
0
26 Nov 2024
Computational metaoptics for imaging
Charles Roques-Carmes
Kai Wang
Yanting Yang
A. Majumdar
Zin Lin
24
1
0
14 Nov 2024
Unraveling the Gradient Descent Dynamics of Transformers
Bingqing Song
Boran Han
Shuai Zhang
Jie Ding
Mingyi Hong
AI4CE
39
1
0
12 Nov 2024
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
108
0
0
08 Nov 2024
PageRank Bandits for Link Prediction
Yikun Ban
Jiaru Zou
Zihao Li
Yunzhe Qi
Dongqi Fu
Jian Kang
Hanghang Tong
Jingrui He
36
2
0
03 Nov 2024
Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame
Evan Markou
Thalaiyasingam Ajanthan
Stephen Gould
26
0
0
02 Nov 2024
Generalizability of Memorization Neural Networks
Lijia Yu
Xiao-Shan Gao
Lijun Zhang
Yibo Miao
33
1
0
01 Nov 2024
CaAdam: Improving Adam optimizer using connection aware methods
Remi Genet
Hugo Inzirillo
31
0
0
31 Oct 2024
Loss Landscape Characterization of Neural Networks without Over-Parametrization
Rustem Islamov
Niccolò Ajroldi
Antonio Orvieto
Aurelien Lucchi
35
4
0
16 Oct 2024
Sharper Guarantees for Learning Neural Network Classifiers with Gradient Methods
Hossein Taheri
Christos Thrampoulidis
Arya Mazumdar
MLT
36
0
0
13 Oct 2024
BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation
Peijia Qin
Ruiyi Zhang
Pengtao Xie
31
1
0
13 Oct 2024
Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data
Binghui Li
Yuanzhi Li
OOD
30
2
0
11 Oct 2024
Neuropsychology of AI: Relationship Between Activation Proximity and Categorical Proximity Within Neural Categories of Synthetic Cognition
Michael Pichat
Enola Campoli
William Pogrund
Jourdan Wilson
Michael Veillet-Guillem
Anton Melkozerov
Paloma Pichat
Armanouche Gasparian
Samuel Demarchi
Judicael Poumay
NAI
51
3
0
08 Oct 2024
On the Impacts of the Random Initialization in the Neural Tangent Kernel Theory
Guhan Chen
Yicheng Li
Qian Lin
AAML
38
1
0
08 Oct 2024
Extended convexity and smoothness and their applications in deep learning
Binchuan Qi
Wei Gong
Li Li
61
0
0
08 Oct 2024
Simplicity bias and optimization threshold in two-layer ReLU networks
Etienne Boursier
Nicolas Flammarion
31
2
0
03 Oct 2024
From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks
Clémentine Dominé
Nicolas Anguita
A. Proca
Lukas Braun
D. Kunin
P. Mediano
Andrew M. Saxe
38
3
0
22 Sep 2024
Monomial Matrix Group Equivariant Neural Functional Networks
Hoang V. Tran
Thieu N. Vo
Tho H. Tran
An T. Nguyen
Tan M. Nguyen
54
5
0
18 Sep 2024
On the Convergence Analysis of Over-Parameterized Variational Autoencoders: A Neural Tangent Kernel Perspective
Li Wang
Wei Huang
DRL
21
0
0
09 Sep 2024
On the Pinsker bound of inner product kernel regression in large dimensions
Weihao Lu
Jialin Ding
Haobo Zhang
Qian Lin
52
0
0
02 Sep 2024
Absence of Closed-Form Descriptions for Gradient Flow in Two-Layer Narrow Networks
Yeachan Park
AI4CE
27
0
0
15 Aug 2024
Meta Clustering of Neural Bandits
Yikun Ban
Yunzhe Qi
Tianxin Wei
Lihui Liu
Jingrui He
42
2
0
10 Aug 2024
Evaluating the design space of diffusion-based generative models
Yuqing Wang
Ye He
Molei Tao
DiffM
36
5
0
18 Jun 2024
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
Lu Li
T. Zhang
Zhiqi Bu
Suyuchen Wang
Huan He
Jie Fu
Yonghui Wu
Jiang Bian
Yong Chen
Yoshua Bengio
FedML
MoMe
100
3
0
11 Jun 2024
Loss Gradient Gaussian Width based Generalization and Optimization Guarantees
A. Banerjee
Qiaobo Li
Yingxue Zhou
49
0
0
11 Jun 2024
Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning
D. Kunin
Allan Raventós
Clémentine Dominé
Feng Chen
David Klindt
Andrew M. Saxe
Surya Ganguli
MLT
45
15
0
10 Jun 2024
Perturbation Towards Easy Samples Improves Targeted Adversarial Transferability
Junqi Gao
Biqing Qi
Yao Li
Zhichang Guo
Dong Li
Yuming Xing
Dazhi Zhang
AAML
34
6
0
08 Jun 2024
Reparameterization invariance in approximate Bayesian inference
Hrittik Roy
M. Miani
Carl Henrik Ek
Philipp Hennig
Marvin Pfortner
Lukas Tatzel
Søren Hauberg
BDL
47
8
0
05 Jun 2024
Improving Generalization and Convergence by Enhancing Implicit Regularization
Mingze Wang
Haotian He
Jinbo Wang
Zilin Wang
Guanhua Huang
Feiyu Xiong
Zhiyu Li
E. Weinan
Lei Wu
45
6
0
31 May 2024
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts
Mohammed Nowaz Rabbani Chowdhury
Meng Wang
Kaoutar El Maghraoui
Naigang Wang
Pin-Yu Chen
Christopher Carothers
MoE
36
4
0
26 May 2024
Novel Kernel Models and Exact Representor Theory for Neural Networks Beyond the Over-Parameterized Regime
A. Shilton
Sunil R. Gupta
Santu Rana
Svetha Venkatesh
34
0
0
24 May 2024
1
2
3
4
...
14
15
16
Next