Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.16900
Cited By
v1
v2
v3
v4
v5 (latest)
Power-Law Decay Loss for Large Language Model Finetuning: A Theory Perspective
22 May 2025
Jintian Shao
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Power-Law Decay Loss for Large Language Model Finetuning: A Theory Perspective"
1 / 1 papers shown
Title
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
605
20,418
0
23 Oct 2019
1