Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.07810
Cited By
Depth Dependence of
μ
μ
μ
P Learning Rates in ReLU MLPs
13 May 2023
Samy Jelassi
Boris Hanin
Ziwei Ji
Sashank J. Reddi
Srinadh Bhojanapalli
Surinder Kumar
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Depth Dependence of $μ$P Learning Rates in ReLU MLPs"
1 / 1 papers shown
Title
Principled Architecture-aware Scaling of Hyperparameters
Wuyang Chen
Junru Wu
Zhangyang Wang
Boris Hanin
AI4CE
104
0
0
27 Feb 2024
1