ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.03530
  4. Cited By
Understanding deep learning requires rethinking generalization

Understanding deep learning requires rethinking generalization

10 November 2016
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
    HAI
ArXivPDFHTML

Papers citing "Understanding deep learning requires rethinking generalization"

50 / 926 papers shown
Title
Neural networks are a priori biased towards Boolean functions with low
  entropy
Neural networks are a priori biased towards Boolean functions with low entropy
Chris Mingard
Joar Skalse
Guillermo Valle Pérez
David Martínez-Rubio
Vladimir Mikulik
A. Louis
FAtt
AI4CE
24
37
0
25 Sep 2019
Mixup Inference: Better Exploiting Mixup to Defend Adversarial Attacks
Mixup Inference: Better Exploiting Mixup to Defend Adversarial Attacks
Tianyu Pang
Kun Xu
Jun Zhu
AAML
28
103
0
25 Sep 2019
Deep Convolutions for In-Depth Automated Rock Typing
Deep Convolutions for In-Depth Automated Rock Typing
E. E. Baraboshkin
L. Ismailova
D. Orlov
E. Zhukovskaya
G. Kalmykov
O. V. Khotylev
E. Baraboshkin
D. Koroteev
33
84
0
23 Sep 2019
Generating Accurate Pseudo-labels in Semi-Supervised Learning and
  Avoiding Overconfident Predictions via Hermite Polynomial Activations
Generating Accurate Pseudo-labels in Semi-Supervised Learning and Avoiding Overconfident Predictions via Hermite Polynomial Activations
Vishnu Suresh Lokhande
Songwong Tasneeyapant
Abhay Venkatesh
Sathya Ravi
Vikas Singh
18
29
0
12 Sep 2019
Deep Metric Learning with Density Adaptivity
Deep Metric Learning with Density Adaptivity
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
30
11
0
09 Sep 2019
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement
  Learning
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Kristopher De Asis
Alan Chan
Silviu Pitis
R. Sutton
D. Graves
18
32
0
09 Sep 2019
Designing and Interpreting Probes with Control Tasks
Designing and Interpreting Probes with Control Tasks
John Hewitt
Percy Liang
32
523
0
08 Sep 2019
Deep Weakly-Supervised Learning Methods for Classification and
  Localization in Histology Images: A Survey
Deep Weakly-Supervised Learning Methods for Classification and Localization in Histology Images: A Survey
Jérôme Rony
Soufiane Belharbi
Jose Dolz
Ismail Ben Ayed
Luke McCaffrey
Eric Granger
25
70
0
08 Sep 2019
Image Captioning with Sparse Recurrent Neural Network
Image Captioning with Sparse Recurrent Neural Network
J. Tan
Chee Seng Chan
Joon Huang Chuah
VLM
26
6
0
28 Aug 2019
Embracing Imperfect Datasets: A Review of Deep Learning Solutions for
  Medical Image Segmentation
Embracing Imperfect Datasets: A Review of Deep Learning Solutions for Medical Image Segmentation
Nima Tajbakhsh
Laura Jeyaseelan
Q. Li
J. Chiang
Zhihao Wu
Xiaowei Ding
33
751
0
27 Aug 2019
Transferability and Hardness of Supervised Classification Tasks
Transferability and Hardness of Supervised Classification Tasks
Anh Tran
Cuong V Nguyen
Tal Hassner
134
164
0
21 Aug 2019
Symmetric Cross Entropy for Robust Learning with Noisy Labels
Symmetric Cross Entropy for Robust Learning with Noisy Labels
Yisen Wang
Xingjun Ma
Zaiyi Chen
Yuan Luo
Jinfeng Yi
James Bailey
NoLa
39
875
0
16 Aug 2019
Needles in Haystacks: On Classifying Tiny Objects in Large Images
Needles in Haystacks: On Classifying Tiny Objects in Large Images
Nick Pawlowski
Suvrat Bhooshan
Nicolas Ballas
F. Ciompi
Ben Glocker
M. Drozdzal
24
22
0
16 Aug 2019
Regularizing CNN Transfer Learning with Randomised Regression
Regularizing CNN Transfer Learning with Randomised Regression
Yang Zhong
A. Maki
21
13
0
16 Aug 2019
The generalization error of random features regression: Precise
  asymptotics and double descent curve
The generalization error of random features regression: Precise asymptotics and double descent curve
Song Mei
Andrea Montanari
60
626
0
14 Aug 2019
Unsupervised Out-of-Distribution Detection by Maximum Classifier
  Discrepancy
Unsupervised Out-of-Distribution Detection by Maximum Classifier Discrepancy
Qing Yu
Kiyoharu Aizawa
OODD
19
163
0
14 Aug 2019
Visualizing the PHATE of Neural Networks
Visualizing the PHATE of Neural Networks
Scott A. Gigante
Adam S. Charles
Smita Krishnaswamy
Gal Mishne
36
37
0
07 Aug 2019
How Does Learning Rate Decay Help Modern Neural Networks?
How Does Learning Rate Decay Help Modern Neural Networks?
Kaichao You
Mingsheng Long
Jianmin Wang
Michael I. Jordan
27
4
0
05 Aug 2019
Pick-and-Learn: Automatic Quality Evaluation for Noisy-Labeled Image
  Segmentation
Pick-and-Learn: Automatic Quality Evaluation for Noisy-Labeled Image Segmentation
Haidong Zhu
Jialin Shi
Ji Wu
NoLa
24
65
0
27 Jul 2019
A Frobenius norm regularization method for convolutional kernels to
  avoid unstable gradient problem
A Frobenius norm regularization method for convolutional kernels to avoid unstable gradient problem
Pei-Chang Guo
24
5
0
25 Jul 2019
Hessian based analysis of SGD for Deep Nets: Dynamics and Generalization
Hessian based analysis of SGD for Deep Nets: Dynamics and Generalization
Xinyan Li
Qilong Gu
Yingxue Zhou
Tiancong Chen
A. Banerjee
ODL
42
51
0
24 Jul 2019
Residual Entropy
Residual Entropy
B. Rowe
20
7
0
08 Jul 2019
AutoCompress: An Automatic DNN Structured Pruning Framework for
  Ultra-High Compression Rates
AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates
Ning Liu
Xiaolong Ma
Zhiyuan Xu
Yanzhi Wang
Jian Tang
Jieping Ye
40
183
0
06 Jul 2019
Weight-space symmetry in deep networks gives rise to permutation
  saddles, connected by equal-loss valleys across the loss landscape
Weight-space symmetry in deep networks gives rise to permutation saddles, connected by equal-loss valleys across the loss landscape
Johanni Brea
Berfin Simsek
Bernd Illing
W. Gerstner
23
55
0
05 Jul 2019
Invariant Risk Minimization
Invariant Risk Minimization
Martín Arjovsky
Léon Bottou
Ishaan Gulrajani
David Lopez-Paz
OOD
60
2,155
0
05 Jul 2019
High-Dimensional Optimization in Adaptive Random Subspaces
High-Dimensional Optimization in Adaptive Random Subspaces
Jonathan Lacotte
Mert Pilanci
Marco Pavone
27
16
0
27 Jun 2019
Benign Overfitting in Linear Regression
Benign Overfitting in Linear Regression
Peter L. Bartlett
Philip M. Long
Gábor Lugosi
Alexander Tsigler
MLT
8
762
0
26 Jun 2019
Invariance-inducing regularization using worst-case transformations
  suffices to boost accuracy and spatial robustness
Invariance-inducing regularization using worst-case transformations suffices to boost accuracy and spatial robustness
Fanny Yang
Zuowen Wang
C. Heinze-Deml
28
42
0
26 Jun 2019
Further advantages of data augmentation on convolutional neural networks
Further advantages of data augmentation on convolutional neural networks
Alex Hernández-García
Peter König
8
107
0
26 Jun 2019
Importance Estimation for Neural Network Pruning
Importance Estimation for Neural Network Pruning
Pavlo Molchanov
Arun Mallya
Stephen Tyree
I. Frosio
Jan Kautz
3DPC
36
857
0
25 Jun 2019
On the Noisy Gradient Descent that Generalizes as SGD
On the Noisy Gradient Descent that Generalizes as SGD
Jingfeng Wu
Wenqing Hu
Haoyi Xiong
Jun Huan
Vladimir Braverman
Zhanxing Zhu
MLT
24
10
0
18 Jun 2019
Empirical study of extreme overfitting points of neural networks
Empirical study of extreme overfitting points of neural networks
D. Merkulov
Ivan Oseledets
3DPC
16
7
0
14 Jun 2019
Effectiveness of Distillation Attack and Countermeasure on Neural
  Network Watermarking
Effectiveness of Distillation Attack and Countermeasure on Neural Network Watermarking
Ziqi Yang
Hung Dang
E. Chang
AAML
27
34
0
14 Jun 2019
Gradient Descent Maximizes the Margin of Homogeneous Neural Networks
Gradient Descent Maximizes the Margin of Homogeneous Neural Networks
Kaifeng Lyu
Jian Li
52
322
0
13 Jun 2019
Does Learning Require Memorization? A Short Tale about a Long Tail
Does Learning Require Memorization? A Short Tale about a Long Tail
Vitaly Feldman
TDI
31
481
0
12 Jun 2019
Parameterized Structured Pruning for Deep Neural Networks
Parameterized Structured Pruning for Deep Neural Networks
Günther Schindler
Wolfgang Roth
Franz Pernkopf
Holger Froening
24
6
0
12 Jun 2019
Learning to Segment Skin Lesions from Noisy Annotations
Learning to Segment Skin Lesions from Noisy Annotations
Z. Mirikharaji
Yiqi Yan
Ghassan Hamarneh
38
77
0
10 Jun 2019
The Implicit Bias of AdaGrad on Separable Data
The Implicit Bias of AdaGrad on Separable Data
Qian Qian
Xiaoyuan Qian
37
23
0
09 Jun 2019
Disentangling neural mechanisms for perceptual grouping
Disentangling neural mechanisms for perceptual grouping
Junkyung Kim
Drew Linsley
Kalpit C. Thakkar
Thomas Serre
OCL
26
54
0
04 Jun 2019
Dimensionality compression and expansion in Deep Neural Networks
Dimensionality compression and expansion in Deep Neural Networks
Stefano Recanatesi
M. Farrell
Madhu S. Advani
Timothy Moore
Guillaume Lajoie
E. Shea-Brown
18
72
0
02 Jun 2019
The Principle of Unchanged Optimality in Reinforcement Learning
  Generalization
The Principle of Unchanged Optimality in Reinforcement Learning Generalization
A. Irpan
Xingyou Song
OffRL
30
7
0
02 Jun 2019
Are Anchor Points Really Indispensable in Label-Noise Learning?
Are Anchor Points Really Indispensable in Label-Noise Learning?
Xiaobo Xia
Tongliang Liu
N. Wang
Bo Han
Chen Gong
Gang Niu
Masashi Sugiyama
NoLa
11
369
0
01 Jun 2019
Implicit Regularization in Deep Matrix Factorization
Implicit Regularization in Deep Matrix Factorization
Sanjeev Arora
Nadav Cohen
Wei Hu
Yuping Luo
AI4CE
38
491
0
31 May 2019
On Network Design Spaces for Visual Recognition
On Network Design Spaces for Visual Recognition
Ilija Radosavovic
Justin Johnson
Saining Xie
Wan-Yen Lo
Piotr Dollár
19
134
0
30 May 2019
Generalization bounds for deep convolutional neural networks
Generalization bounds for deep convolutional neural networks
Philip M. Long
Hanie Sedghi
MLT
42
89
0
29 May 2019
Limitations of the Empirical Fisher Approximation for Natural Gradient
  Descent
Limitations of the Empirical Fisher Approximation for Natural Gradient Descent
Frederik Kunstner
Lukas Balles
Philipp Hennig
21
207
0
29 May 2019
Norm-based generalisation bounds for multi-class convolutional neural
  networks
Norm-based generalisation bounds for multi-class convolutional neural networks
Antoine Ledent
Waleed Mustafa
Yunwen Lei
Marius Kloft
18
5
0
29 May 2019
Combating Label Noise in Deep Learning Using Abstention
Combating Label Noise in Deep Learning Using Abstention
S. Thulasidasan
Tanmoy Bhattacharya
J. Bilmes
Gopinath Chennupati
J. Mohd-Yusof
NoLa
22
177
0
27 May 2019
Curriculum Loss: Robust Learning and Generalization against Label
  Corruption
Curriculum Loss: Robust Learning and Generalization against Label Corruption
Yueming Lyu
Ivor W. Tsang
NoLa
63
172
0
24 May 2019
Ensemble Model Patching: A Parameter-Efficient Variational Bayesian
  Neural Network
Ensemble Model Patching: A Parameter-Efficient Variational Bayesian Neural Network
Oscar Chang
Yuling Yao
David Williams-King
Hod Lipson
BDL
UQCV
32
8
0
23 May 2019
Previous
123...131415...171819
Next