Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.18399
Cited By
On the impact of activation and normalization in obtaining isometric embeddings at initialization
28 May 2023
Amir Joudaki
Hadi Daneshmand
Francis R. Bach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On the impact of activation and normalization in obtaining isometric embeddings at initialization"
4 / 4 papers shown
Title
ReLU's Revival: On the Entropic Overload in Normalization-Free Large Language Models
N. Jha
Brandon Reagen
OffRL
AI4CE
33
0
0
12 Oct 2024
Understanding and Minimising Outlier Features in Neural Network Training
Bobby He
Lorenzo Noci
Daniele Paliotta
Imanol Schlag
Thomas Hofmann
39
3
0
29 May 2024
Rapid training of deep neural networks without skip connections or normalization layers using Deep Kernel Shaping
James Martens
Andy Ballard
Guillaume Desjardins
G. Swirszcz
Valentin Dalibard
Jascha Narain Sohl-Dickstein
S. Schoenholz
88
43
0
05 Oct 2021
Dynamical Isometry and a Mean Field Theory of CNNs: How to Train 10,000-Layer Vanilla Convolutional Neural Networks
Lechao Xiao
Yasaman Bahri
Jascha Narain Sohl-Dickstein
S. Schoenholz
Jeffrey Pennington
227
348
0
14 Jun 2018
1