Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.04546
Cited By
Normalized Direction-preserving Adam
13 September 2017
Zijun Zhang
Lin Ma
Zongpeng Li
Chuan Wu
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Normalized Direction-preserving Adam"
6 / 6 papers shown
Title
DEAM: Adaptive Momentum with Discriminative Weight for Stochastic Optimization
Jiyang Bai
Yuxiang Ren
Jiawei Zhang
ODL
21
1
0
25 Jul 2019
Removing the Feature Correlation Effect of Multiplicative Noise
Zijun Zhang
Yining Zhang
Zongpeng Li
13
8
0
19 Sep 2018
Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks
Jinghui Chen
Dongruo Zhou
Yiqi Tang
Ziyan Yang
Yuan Cao
Quanquan Gu
ODL
19
193
0
18 Jun 2018
Improving Generalization Performance by Switching from Adam to SGD
N. Keskar
R. Socher
ODL
41
521
0
20 Dec 2017
Decoupled Weight Decay Regularization
I. Loshchilov
Frank Hutter
OffRL
62
2,084
0
14 Nov 2017
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
308
2,890
0
15 Sep 2016
1