Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2306.00700
Cited By
v1
v2
v3 (latest)
On the Weight Dynamics of Deep Normalized Networks
International Conference on Machine Learning (ICML), 2023
1 June 2023
Christian H. X. Ali Mehmeti-Göpel
Michael Wand
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"On the Weight Dynamics of Deep Normalized Networks"
3 / 3 papers shown
Title
Can Training Dynamics of Scale-Invariant Neural Networks Be Explained by the Thermodynamics of an Ideal Gas?
Ildus Sadrtdinov
E. Lobacheva
Ivan Klimov
Mikhail I. Katsnelson
Dmitry Vetrov
AI4CE
72
0
0
10 Nov 2025
Weight Decay may matter more than muP for Learning Rate Transfer in Practice
Atli Kosson
Jeremy Welborn
Yang Liu
Martin Jaggi
Xi Chen
40
1
0
21 Oct 2025
ResNets Are Deeper Than You Think
Christian H.X. Ali Mehmeti-Göpel
Michael Wand
108
0
0
17 Jun 2025
1