AdamP: Slowing Down the Slowdown for Momentum Optimizers on
  Scale-invariant Weights

AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights

Papers citing "AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights"

36 / 36 papers shown
Title
Layer Normalization
Layer Normalization
338
10,467
0
21 Jul 2016

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.