Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09639
Cited By
Why Deep Learning Generalizes
17 November 2022
Benjamin L. Badger
TDI
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Why Deep Learning Generalizes"
13 / 13 papers shown
Title
Masked Mixers for Language Generation and Retrieval
Benjamin L. Badger
102
0
0
02 Sep 2024
Depth and Representation in Vision Models
Benjamin L. Badger
SSL
VLM
FAtt
39
3
0
11 Nov 2022
On the Origin of Implicit Regularization in Stochastic Gradient Descent
Samuel L. Smith
Benoit Dherin
David Barrett
Soham De
MLT
34
203
0
28 Jan 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
593
40,961
0
22 Oct 2020
Implicit Gradient Regularization
David Barrett
Benoit Dherin
69
150
0
23 Sep 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
439
42,393
0
03 Dec 2019
Implicit Regularization of Stochastic Gradient Descent in Natural Language Processing: Observations and Implications
Deren Lei
Zichen Sun
Yijun Xiao
William Yang Wang
119
14
0
01 Nov 2018
Implicit Regularization in Deep Learning
Behnam Neyshabur
50
146
0
06 Sep 2017
Towards Understanding Generalization of Deep Learning: Perspective of Loss Landscapes
Lei Wu
Zhanxing Zhu
E. Weinan
ODL
62
221
0
30 Jun 2017
A Closer Look at Memorization in Deep Networks
Devansh Arpit
Stanislaw Jastrzebski
Nicolas Ballas
David M. Krueger
Emmanuel Bengio
...
Tegan Maharaj
Asja Fischer
Aaron Courville
Yoshua Bengio
Simon Lacoste-Julien
TDI
120
1,817
0
16 Jun 2017
Understanding deep learning requires rethinking generalization
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
HAI
334
4,625
0
10 Nov 2016
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.7K
150,006
0
22 Dec 2014
In Search of the Real Inductive Bias: On the Role of Implicit Regularization in Deep Learning
Behnam Neyshabur
Ryota Tomioka
Nathan Srebro
AI4CE
88
657
0
20 Dec 2014
1