Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.07810
Cited By
Depth Dependence of
μ
μ
μ
P Learning Rates in ReLU MLPs
13 May 2023
Samy Jelassi
Boris Hanin
Ziwei Ji
Sashank J. Reddi
Srinadh Bhojanapalli
Surinder Kumar
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Depth Dependence of $μ$P Learning Rates in ReLU MLPs"
6 / 6 papers shown
Title
Feature Learning Beyond the Edge of Stability
Dávid Terjék
MLT
46
0
0
18 Feb 2025
Scalable Optimization in the Modular Norm
Tim Large
Yang Liu
Minyoung Huh
Hyojin Bahng
Phillip Isola
Jeremy Bernstein
47
11
0
23 May 2024
Principled Architecture-aware Scaling of Hyperparameters
Wuyang Chen
Junru Wu
Zhangyang Wang
Boris Hanin
AI4CE
46
1
0
27 Feb 2024
The Feature Speed Formula: a flexible approach to scale hyper-parameters of deep neural networks
Lénaic Chizat
Praneeth Netrapalli
23
4
0
30 Nov 2023
Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks
Greg Yang
Dingli Yu
Chen Zhu
Soufiane Hayou
MLT
10
27
0
03 Oct 2023
Depthwise Hyperparameter Transfer in Residual Networks: Dynamics and Scaling Limit
Blake Bordelon
Lorenzo Noci
Mufan Li
Boris Hanin
Cengiz Pehlevan
32
23
0
28 Sep 2023
1