Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.15345
Cited By
FixNorm: Dissecting Weight Decay for Training Deep Neural Networks
29 March 2021
Yucong Zhou
Yunxiao Sun
Zhaobai Zhong
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FixNorm: Dissecting Weight Decay for Training Deep Neural Networks"
3 / 3 papers shown
Title
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Sid Black
Stella Biderman
Eric Hallahan
Quentin G. Anthony
Leo Gao
...
Shivanshu Purohit
Laria Reynolds
J. Tow
Benqi Wang
Samuel Weinbach
102
803
0
14 Apr 2022
Bag of Tricks for Image Classification with Convolutional Neural Networks
Tong He
Zhi-Li Zhang
Hang Zhang
Zhongyue Zhang
Junyuan Xie
Mu Li
224
1,400
0
04 Dec 2018
A disciplined approach to neural network hyper-parameters: Part 1 -- learning rate, batch size, momentum, and weight decay
L. Smith
208
1,020
0
26 Mar 2018
1