Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2306.00700
Cited By
v1
v2
v3 (latest)
On the Weight Dynamics of Deep Normalized Networks
International Conference on Machine Learning (ICML), 2023
1 June 2023
Christian H. X. Ali Mehmeti-Göpel
Michael Wand
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"On the Weight Dynamics of Deep Normalized Networks"
4 / 4 papers shown
Title
Can Training Dynamics of Scale-Invariant Neural Networks Be Explained by the Thermodynamics of an Ideal Gas?
Ildus Sadrtdinov
E. Lobacheva
Ivan Klimov
Mikhail I. Katsnelson
Dmitry Vetrov
AI4CE
96
0
0
10 Nov 2025
Weight Decay may matter more than muP for Learning Rate Transfer in Practice
Atli Kosson
Jeremy Welborn
Yang Liu
Martin Jaggi
Xi Chen
40
1
0
21 Oct 2025
ResNets Are Deeper Than You Think
Christian H.X. Ali Mehmeti-Göpel
Michael Wand
112
0
0
17 Jun 2025
Layer-wise Update Aggregation with Recycling for Communication-Efficient Federated Learning
Jisoo Kim
Sungmin Kang
Sunwoo Lee
FedML
151
0
0
14 Mar 2025
1