On the Global Convergence of Gradient Descent for Over-parameterized Models using Optimal Transport

24 May 2018

Papers citing "On the Global Convergence of Gradient Descent for Over-parameterized Models using Optimal Transport"

33 / 483 papers shown

Title
On the Power and Limitations of Random Features for Understanding Neural Networks Gilad Yehudai Ohad Shamir MLT 18 180 0 01 Apr 2019
Gradient Descent with Early Stopping is Provably Robust to Label Noise for Overparameterized Neural Networks Mingchen Li Mahdi Soltanolkotabi Samet Oymak NoLa 45 351 0 27 Mar 2019
Surprises in High-Dimensional Ridgeless Least Squares Interpolation Trevor Hastie Andrea Montanari Saharon Rosset R. Tibshirani 31 728 0 19 Mar 2019
Stabilize Deep ResNet with A Sharp Scaling Factor $τ$ Huishuai Zhang Da Yu Mingyang Yi Wei Chen Tie-Yan Liu 16 8 0 17 Mar 2019
Mean Field Analysis of Deep Neural Networks Justin A. Sirignano K. Spiliopoulos 6 82 0 11 Mar 2019
Why Learning of Large-Scale Neural Networks Behaves Like Convex Optimization Hui Jiang 11 8 0 06 Mar 2019
Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent Jaehoon Lee Lechao Xiao S. Schoenholz Yasaman Bahri Roman Novak Jascha Narain Sohl-Dickstein Jeffrey Pennington 9 1,074 0 18 Feb 2019
Mean-field theory of two-layers neural networks: dimension-free bounds and kernel limit Song Mei Theodor Misiakiewicz Andrea Montanari MLT 22 275 0 16 Feb 2019
How do infinite width bounded norm networks look in function space? Pedro H. P. Savarese Itay Evron Daniel Soudry Nathan Srebro 6 163 0 13 Feb 2019
Towards moderate overparameterization: global convergence guarantees for training shallow neural networks Samet Oymak Mahdi Soltanolkotabi 17 319 0 12 Feb 2019
Mean Field Limit of the Learning Dynamics of Multilayer Neural Networks Phan-Minh Nguyen AI4CE 22 72 0 07 Feb 2019
Global convergence of neuron birth-death dynamics Grant M. Rotskoff Samy Jelassi Joan Bruna Eric Vanden-Eijnden 6 44 0 05 Feb 2019
Generalization Error Bounds of Gradient Descent for Learning Over-parameterized Deep ReLU Networks Yuan Cao Quanquan Gu ODL MLT AI4CE 17 155 0 04 Feb 2019
Generalisation dynamics of online learning in over-parameterised neural networks Sebastian Goldt Madhu S. Advani Andrew M. Saxe Florent Krzakala Lenka Zdeborová 25 14 0 25 Jan 2019
Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks Sanjeev Arora S. Du Wei Hu Zhiyuan Li Ruosong Wang MLT 35 962 0 24 Jan 2019
On Connected Sublevel Sets in Deep Learning Quynh N. Nguyen 11 101 0 22 Jan 2019
Context Aware Machine Learning Yun Zeng 11 3 0 10 Jan 2019
Accelerated Flow for Probability Distributions Amirhossein Taghvaei P. Mehta 40 30 0 10 Jan 2019
Analysis of a Two-Layer Neural Network via Displacement Convexity Adel Javanmard Marco Mondelli Andrea Montanari MLT 45 57 0 05 Jan 2019
On the Benefit of Width for Neural Networks: Disappearance of Bad Basins Dawei Li Tian Ding Ruoyu Sun 29 37 0 28 Dec 2018
Overparameterized Nonlinear Learning: Gradient Descent Takes the Shortest Path? Samet Oymak Mahdi Soltanolkotabi ODL 6 177 0 25 Dec 2018
On Lazy Training in Differentiable Programming Lénaïc Chizat Edouard Oyallon Francis R. Bach 21 805 0 19 Dec 2018
Gradient Descent Finds Global Minima of Deep Neural Networks S. Du J. Lee Haochuan Li Liwei Wang M. Tomizuka ODL 35 1,122 0 09 Nov 2018
Fused Gromov-Wasserstein distance for structured objects: theoretical foundations and mathematical properties David Tellez G. Litjens J. A. van der Laak R. Tavenard F. Ciompi OT 20 121 0 07 Nov 2018
A Priori Estimates of the Population Risk for Two-layer Neural Networks Weinan E Chao Ma Lei Wu 29 130 0 15 Oct 2018
Regularization Matters: Generalization and Optimization of Neural Nets v.s. their Induced Kernel Colin Wei J. Lee Qiang Liu Tengyu Ma 20 243 0 12 Oct 2018
Unbiased deep solvers for linear parametric PDEs Marc Sabate Vidales David Siska Lukasz Szpruch OOD 24 7 0 11 Oct 2018
Gradient Descent Provably Optimizes Over-parameterized Neural Networks S. Du Xiyu Zhai Barnabás Póczós Aarti Singh MLT ODL 38 1,250 0 04 Oct 2018
Stochastic Gradient Descent Learns State Equations with Nonlinear Activations Samet Oymak 11 43 0 09 Sep 2018
Trainability and Accuracy of Neural Networks: An Interacting Particle System Approach Grant M. Rotskoff Eric Vanden-Eijnden 59 118 0 02 May 2018
A Mean Field View of the Landscape of Two-Layers Neural Networks Song Mei Andrea Montanari Phan-Minh Nguyen MLT 40 849 0 18 Apr 2018
Spurious Valleys in Two-layer Neural Network Optimization Landscapes Luca Venturi Afonso S. Bandeira Joan Bruna 19 74 0 18 Feb 2018
Multiscale Sparse Microcanonical Models Joan Bruna S. Mallat 21 36 0 06 Jan 2018