Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.03530
Cited By
Understanding deep learning requires rethinking generalization
10 November 2016
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
HAI
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Understanding deep learning requires rethinking generalization"
50 / 885 papers shown
Title
Regularization-wise double descent: Why it occurs and how to eliminate it
Fatih Yilmaz
Reinhard Heckel
30
11
0
03 Jun 2022
Dataset Distillation using Neural Feature Regression
Yongchao Zhou
E. Nezhadarya
Jimmy Ba
DD
FedML
44
149
0
01 Jun 2022
Context-based Virtual Adversarial Training for Text Classification with Noisy Labels
Do-Myoung Lee
Yeachan Kim
Chang-gyun Seo
NoLa
21
2
0
29 May 2022
Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power
Binghui Li
Jikai Jin
Han Zhong
J. Hopcroft
Liwei Wang
OOD
82
27
0
27 May 2022
Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks
Zhiwei Bai
Tao Luo
Z. Xu
Yaoyu Zhang
31
4
0
26 May 2022
VeriFi: Towards Verifiable Federated Unlearning
Xiangshan Gao
Xingjun Ma
Jingyi Wang
Youcheng Sun
Bo Li
S. Ji
Peng Cheng
Jiming Chen
MU
67
46
0
25 May 2022
On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity
Vincent Szolnoky
Viktor Andersson
Balázs Kulcsár
Rebecka Jörnsten
42
5
0
25 May 2022
Compression-aware Training of Neural Networks using Frank-Wolfe
Max Zimmer
Christoph Spiegel
Sebastian Pokutta
29
9
0
24 May 2022
Randomly Initialized One-Layer Neural Networks Make Data Linearly Separable
Promit Ghosal
Srinath Mahankali
Yihang Sun
MLT
24
4
0
24 May 2022
Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models
Kushal Tirumala
Aram H. Markosyan
Luke Zettlemoyer
Armen Aghajanyan
TDI
29
185
0
22 May 2022
Interpolating Compressed Parameter Subspaces
Siddhartha Datta
N. Shadbolt
37
5
0
19 May 2022
Large Neural Networks Learning from Scratch with Very Few Data and without Explicit Regularization
C. Linse
T. Martinetz
SSL
VLM
12
4
0
18 May 2022
Learn2Weight: Parameter Adaptation against Similar-domain Adversarial Attacks
Siddhartha Datta
AAML
34
4
0
15 May 2022
HiURE: Hierarchical Exemplar Contrastive Learning for Unsupervised Relation Extraction
Xuming Hu
Shuliang Liu
Chenwei Zhang
Shuang Li
Lijie Wen
Philip S. Yu
SSL
46
39
0
04 May 2022
A Comprehensive Survey of Image Augmentation Techniques for Deep Learning
Mingle Xu
Sook Yoon
A. Fuentes
D. Park
VLM
27
397
0
03 May 2022
FedRN: Exploiting k-Reliable Neighbors Towards Robust Federated Learning
Sangmook Kim
Wonyoung Shin
Soohyuk Jang
Hwanjun Song
Se-Young Yun
34
2
0
03 May 2022
Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learning
Mayee F. Chen
Daniel Y. Fu
A. Narayan
Michael Zhang
Zhao Song
Kayvon Fatahalian
Christopher Ré
SSL
32
47
0
15 Apr 2022
Nonlocal optimization of binary neural networks
Amir Khoshaman
Giuseppe Castiglione
C. Srinivasa
18
0
0
05 Apr 2022
Learning from few examples with nonlinear feature maps
I. Tyukin
Oliver J. Sutton
Alexander N. Gorban
14
1
0
31 Mar 2022
PACE: A Parallelizable Computation Encoder for Directed Acyclic Graphs
Zehao Dong
Muhan Zhang
Fuhai Li
Yixin Chen
CML
GNN
33
17
0
19 Mar 2022
Reducing Flipping Errors in Deep Neural Networks
Xiang Deng
Yun Xiao
Bo Long
Zhongfei Zhang
AAML
38
3
0
16 Mar 2022
Deep AutoAugment
Yu Zheng
Z. Zhang
Shen Yan
Mi Zhang
ViT
23
26
0
11 Mar 2022
The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks
Xin Yu
Thiago Serra
Srikumar Ramalingam
Shandian Zhe
42
48
0
09 Mar 2022
Selective-Supervised Contrastive Learning with Noisy Labels
Shikun Li
Xiaobo Xia
Shiming Ge
Tongliang Liu
NoLa
24
172
0
08 Mar 2022
Generalization Through The Lens Of Leave-One-Out Error
Gregor Bachmann
Thomas Hofmann
Aurelien Lucchi
52
7
0
07 Mar 2022
Explicitising The Implicit Intrepretability of Deep Neural Networks Via Duality
Chandrashekar Lakshminarayanan
Ashutosh Kumar Singh
A. Rajkumar
AI4CE
26
1
0
01 Mar 2022
Understanding Contrastive Learning Requires Incorporating Inductive Biases
Nikunj Saunshi
Jordan T. Ash
Surbhi Goel
Dipendra Kumar Misra
Cyril Zhang
Sanjeev Arora
Sham Kakade
A. Krishnamurthy
SSL
24
109
0
28 Feb 2022
The Spectral Bias of Polynomial Neural Networks
Moulik Choraria
L. Dadi
Grigorios G. Chrysos
Julien Mairal
V. Cevher
24
18
0
27 Feb 2022
Benign Underfitting of Stochastic Gradient Descent
Tomer Koren
Roi Livni
Yishay Mansour
Uri Sherman
MLT
20
13
0
27 Feb 2022
Adversarial robustness of sparse local Lipschitz predictors
Ramchandran Muthukumar
Jeremias Sulam
AAML
32
13
0
26 Feb 2022
ASSIST: Towards Label Noise-Robust Dialogue State Tracking
Fanghua Ye
Yue Feng
Emine Yilmaz
21
21
0
26 Feb 2022
Benefit of Interpolation in Nearest Neighbor Algorithms
Yue Xing
Qifan Song
Guang Cheng
11
28
0
23 Feb 2022
On PAC-Bayesian reconstruction guarantees for VAEs
Badr-Eddine Chérief-Abdellatif
Yuyang Shi
Arnaud Doucet
Benjamin Guedj
DRL
50
17
0
23 Feb 2022
Random Feature Amplification: Feature Learning and Generalization in Neural Networks
Spencer Frei
Niladri S. Chatterji
Peter L. Bartlett
MLT
30
29
0
15 Feb 2022
Information-Theoretic Analysis of Minimax Excess Risk
Hassan Hafez-Kolahi
Behrad Moniri
S. Kasaei
17
4
0
15 Feb 2022
Generalisation and the Risk--Entropy Curve
Dominic Belcher
Antonia Marcu
Adam Prugel-Bennett
11
0
0
15 Feb 2022
On the Origins of the Block Structure Phenomenon in Neural Network Representations
Thao Nguyen
M. Raghu
Simon Kornblith
25
14
0
15 Feb 2022
Evolving Neural Networks with Optimal Balance between Information Flow and Connections Cost
A. Khalili
A. Bouchachia
14
0
0
12 Feb 2022
The no-free-lunch theorems of supervised learning
T. Sterkenburg
Peter Grünwald
FedML
24
56
0
09 Feb 2022
A Survey on Poisoning Attacks Against Supervised Machine Learning
Wenjun Qiu
AAML
28
9
0
05 Feb 2022
Learning with Neighbor Consistency for Noisy Labels
Ahmet Iscen
Jack Valmadre
Anurag Arnab
Cordelia Schmid
NoLa
41
75
0
04 Feb 2022
Non-Vacuous Generalisation Bounds for Shallow Neural Networks
Felix Biggs
Benjamin Guedj
BDL
30
26
0
03 Feb 2022
On Regularizing Coordinate-MLPs
Sameera Ramasinghe
L. MacDonald
Simon Lucey
158
5
0
01 Feb 2022
Deep Layer-wise Networks Have Closed-Form Weights
Chieh-Tsai Wu
A. Masoomi
Arthur Gretton
Jennifer Dy
29
3
0
01 Feb 2022
Datamodels: Predicting Predictions from Training Data
Andrew Ilyas
Sung Min Park
Logan Engstrom
Guillaume Leclerc
A. Madry
TDI
47
131
0
01 Feb 2022
Backdoors Stuck At The Frontdoor: Multi-Agent Backdoor Attacks That Backfire
Siddhartha Datta
N. Shadbolt
AAML
32
7
0
28 Jan 2022
Interplay between depth of neural networks and locality of target functions
Takashi Mori
Masakuni Ueda
25
0
0
28 Jan 2022
Improved Overparametrization Bounds for Global Convergence of Stochastic Gradient Descent for Shallow Neural Networks
Bartlomiej Polaczyk
J. Cyranka
ODL
33
3
0
28 Jan 2022
Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks
Noam Razin
Asaf Maman
Nadav Cohen
46
29
0
27 Jan 2022
PiCO+: Contrastive Label Disambiguation for Robust Partial Label Learning
Haobo Wang
Rui Xiao
Yixuan Li
Lei Feng
Gang Niu
Gang Chen
J. Zhao
VLM
49
25
0
22 Jan 2022
Previous
1
2
3
4
5
6
...
16
17
18
Next