Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.10824
Cited By
Decoupled Weight Decay for Any
p
p
p
Norm
16 April 2024
N. Outmezguine
Noam Levi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Decoupled Weight Decay for Any $p$ Norm"
3 / 3 papers shown
Title
Deep Weight Factorization: Sparse Learning Through the Lens of Artificial Symmetries
Chris Kolb
T. Weber
Bernd Bischl
David Rügamer
109
0
0
04 Feb 2025
Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks
Torsten Hoefler
Dan Alistarh
Tal Ben-Nun
Nikoli Dryden
Alexandra Peste
MQ
141
684
0
31 Jan 2021
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
231
4,460
0
23 Jan 2020
1