ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.07810
  4. Cited By
Depth Dependence of $μ$P Learning Rates in ReLU MLPs

Depth Dependence of μμμP Learning Rates in ReLU MLPs

13 May 2023
Samy Jelassi
Boris Hanin
Ziwei Ji
Sashank J. Reddi
Srinadh Bhojanapalli
Surinder Kumar
ArXiv (abs)PDFHTML

Papers citing "Depth Dependence of $μ$P Learning Rates in ReLU MLPs"

1 / 1 papers shown
Title
Principled Architecture-aware Scaling of Hyperparameters
Principled Architecture-aware Scaling of Hyperparameters
Wuyang Chen
Junru Wu
Zhangyang Wang
Boris Hanin
AI4CE
104
0
0
27 Feb 2024
1