Theory of Deep Learning IIb: Optimization Properties of SGD

7 January 2018

Papers citing "Theory of Deep Learning IIb: Optimization Properties of SGD"

8 / 8 papers shown

Title
Global Convergence of SGD On Two Layer Neural Nets Pulkit Gopalani Anirbit Mukherjee 26 5 0 20 Oct 2022
Multi-Objective Loss Balancing for Physics-Informed Deep Learning Rafael Bischof M. Kraus PINN AI4CE 33 92 0 19 Oct 2021
Noise and Fluctuation of Finite Learning Rate Stochastic Gradient Descent Kangqiao Liu Liu Ziyin Masakuni Ueda MLT 61 37 0 07 Dec 2020
A Random Matrix Theory Approach to Damping in Deep Learning Diego Granziol Nicholas P. Baskerville AI4CE ODL 29 2 0 15 Nov 2020
Orthogonal Deep Neural Networks Kui Jia Shuai Li Yuxin Wen Tongliang Liu Dacheng Tao 34 131 0 15 May 2019
Deep Multi-View Learning using Neuron-Wise Correlation-Maximizing Regularizers Kui Jia Jiehong Lin Mingkui Tan Dacheng Tao 3DV 25 32 0 25 Apr 2019
Parameter Efficient Training of Deep Convolutional Neural Networks by Dynamic Sparse Reparameterization Hesham Mostafa Xin Wang 29 307 0 15 Feb 2019
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima N. Keskar Dheevatsa Mudigere J. Nocedal M. Smelyanskiy P. T. P. Tang ODL 296 2,890 0 15 Sep 2016