Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.23971
Cited By
v1
v2 (latest)
Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training
29 May 2025
William Merrill
Shane Arora
Dirk Groeneveld
Hannaneh Hajishirzi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training"
Title
No papers