Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.03530
Cited By
Understanding deep learning requires rethinking generalization
10 November 2016
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
HAI
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Understanding deep learning requires rethinking generalization"
50 / 913 papers shown
Title
Stochastic Gradient/Mirror Descent: Minimax Optimality and Implicit Regularization
Navid Azizan
B. Hassibi
16
61
0
04 Jun 2018
Minnorm training: an algorithm for training over-parameterized deep neural networks
Yamini Bansal
Madhu S. Advani
David D. Cox
Andrew M. Saxe
ODL
15
18
0
03 Jun 2018
Understanding Batch Normalization
Johan Bjorck
Carla P. Gomes
B. Selman
Kilian Q. Weinberger
18
593
0
01 Jun 2018
Optimal ridge penalty for real-world high-dimensional data can be zero or negative due to the implicit ridge regularization
D. Kobak
Jonathan Lomond
Benoit Sanchez
30
89
0
28 May 2018
Investigating Label Noise Sensitivity of Convolutional Neural Networks for Fine Grained Audio Signal Labelling
Rainer Kelz
Gerhard Widmer
NoLa
16
4
0
28 May 2018
Topological Data Analysis of Decision Boundaries with Application to Model Selection
K. Ramamurthy
Kush R. Varshney
Krishnan Mody
17
40
0
25 May 2018
Adding One Neuron Can Eliminate All Bad Local Minima
Shiyu Liang
Ruoyu Sun
J. Lee
R. Srikant
34
89
0
22 May 2018
SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning
W. Wen
Yandan Wang
Feng Yan
Cong Xu
Chunpeng Wu
Yiran Chen
H. Li
24
50
0
21 May 2018
Tropical Geometry of Deep Neural Networks
Liwen Zhang
Gregory Naitzat
Lek-Heng Lim
29
136
0
18 May 2018
Mad Max: Affine Spline Insights into Deep Learning
Randall Balestriero
Richard Baraniuk
AI4CE
31
78
0
17 May 2018
Learnable PINs: Cross-Modal Embeddings for Person Identity
Arsha Nagrani
Samuel Albanie
Andrew Zisserman
SSL
26
140
0
02 May 2018
Boosting Self-Supervised Learning via Knowledge Transfer
M. Noroozi
Ananth Vinjimoor
Paolo Favaro
Hamed Pirsiavash
SSL
212
292
0
01 May 2018
SHADE: Information Based Regularization for Deep Learning
Michael Blot
Thomas Robert
Nicolas Thome
Matthieu Cord
32
12
0
29 Apr 2018
Measuring the Intrinsic Dimension of Objective Landscapes
Chunyuan Li
Heerad Farkhoor
Rosanne Liu
J. Yosinski
19
397
0
24 Apr 2018
Performance Impact Caused by Hidden Bias of Training Data for Recognizing Textual Entailment
Masatoshi Tsuchiya
21
160
0
22 Apr 2018
Co-teaching: Robust Training of Deep Neural Networks with Extremely Noisy Labels
Bo Han
Quanming Yao
Xingrui Yu
Gang Niu
Miao Xu
Weihua Hu
Ivor Tsang
Masashi Sugiyama
NoLa
58
2,028
0
18 Apr 2018
The Limits and Potentials of Deep Learning for Robotics
Niko Sünderhauf
Oliver Brock
Walter J. Scheirer
R. Hadsell
Dieter Fox
...
B. Upcroft
Pieter Abbeel
Wolfram Burgard
Michael Milford
Peter Corke
15
522
0
18 Apr 2018
Non-Vacuous Generalization Bounds at the ImageNet Scale: A PAC-Bayesian Compression Approach
Wenda Zhou
Victor Veitch
Morgane Austern
Ryan P. Adams
Peter Orbanz
44
209
0
16 Apr 2018
Data-Dependent Coresets for Compressing Neural Networks with Applications to Generalization Bounds
Cenk Baykal
Lucas Liebenwein
Igor Gilitschenski
Dan Feldman
Daniela Rus
22
79
0
15 Apr 2018
Analysis on the Nonlinear Dynamics of Deep Neural Networks: Topological Entropy and Chaos
Husheng Li
19
11
0
03 Apr 2018
Joint Optimization Framework for Learning with Noisy Labels
Daiki Tanaka
Daiki Ikami
T. Yamasaki
Kiyoharu Aizawa
NoLa
39
702
0
30 Mar 2018
Learning to Reweight Examples for Robust Deep Learning
Mengye Ren
Wenyuan Zeng
Binh Yang
R. Urtasun
OOD
NoLa
57
1,410
0
24 Mar 2018
Technical Report: When Does Machine Learning FAIL? Generalized Transferability for Evasion and Poisoning Attacks
Octavian Suciu
R. Marginean
Yigitcan Kaya
Hal Daumé
Tudor Dumitras
AAML
28
283
0
19 Mar 2018
Comparing Dynamics: Deep Neural Networks versus Glassy Systems
Marco Baity-Jesi
Levent Sagun
Mario Geiger
S. Spigler
Gerard Ben Arous
C. Cammarota
Yann LeCun
M. Wyart
Giulio Biroli
AI4CE
36
113
0
19 Mar 2018
On the importance of single directions for generalization
Ari S. Morcos
David Barrett
Neil C. Rabinowitz
M. Botvinick
15
328
0
19 Mar 2018
Deep Component Analysis via Alternating Direction Neural Networks
Calvin Murdock
Ming-Fang Chang
Simon Lucey
BDL
27
20
0
16 Mar 2018
A Kernel Theory of Modern Data Augmentation
Tri Dao
Albert Gu
Alexander J. Ratner
Virginia Smith
Christopher De Sa
Christopher Ré
21
190
0
16 Mar 2018
Essentially No Barriers in Neural Network Energy Landscape
Felix Dräxler
K. Veschgini
M. Salmhofer
Fred Hamprecht
MoMe
20
424
0
02 Mar 2018
Var-CNN: A Data-Efficient Website Fingerprinting Attack Based on Deep Learning
Sanjit Bhat
David Lu
Albert Kwon
S. Devadas
AAML
18
190
0
28 Feb 2018
Learning Representations for Neural Network-Based Classification Using the Information Bottleneck Principle
Rana Ali Amjad
Bernhard C. Geiger
35
195
0
27 Feb 2018
Scalable Private Learning with PATE
Nicolas Papernot
Shuang Song
Ilya Mironov
A. Raghunathan
Kunal Talwar
Ulfar Erlingsson
27
606
0
24 Feb 2018
A Walk with SGD
Chen Xing
Devansh Arpit
Christos Tsirigotis
Yoshua Bengio
27
118
0
24 Feb 2018
Deep learning algorithm for data-driven simulation of noisy dynamical system
K. Yeo
Igor Melnyk
AI4TS
21
93
0
22 Feb 2018
Characterizing Implicit Bias in Terms of Optimization Geometry
Suriya Gunasekar
Jason D. Lee
Daniel Soudry
Nathan Srebro
AI4CE
35
399
0
22 Feb 2018
The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks
Nicholas Carlini
Chang-rui Liu
Ulfar Erlingsson
Jernej Kos
D. Song
56
1,113
0
22 Feb 2018
L2-Nonexpansive Neural Networks
Haifeng Qian
M. Wegman
22
74
0
22 Feb 2018
The Description Length of Deep Learning Models
Léonard Blier
Yann Ollivier
29
95
0
20 Feb 2018
Do deep nets really need weight decay and dropout?
Alex Hernández-García
Peter König
14
27
0
20 Feb 2018
The Role of Information Complexity and Randomization in Representation Learning
Matías Vera
Pablo Piantanida
L. Rey Vega
42
14
0
14 Feb 2018
Stronger generalization bounds for deep nets via a compression approach
Sanjeev Arora
Rong Ge
Behnam Neyshabur
Yi Zhang
MLT
AI4CE
23
630
0
14 Feb 2018
Training and Inference with Integers in Deep Neural Networks
Shuang Wu
Guoqi Li
F. Chen
Luping Shi
MQ
32
389
0
13 Feb 2018
Deep Neural Networks Learn Non-Smooth Functions Effectively
Masaaki Imaizumi
Kenji Fukumizu
18
123
0
13 Feb 2018
Towards Understanding the Generalization Bias of Two Layer Convolutional Linear Classifiers with Gradient Descent
Yifan Wu
Barnabás Póczós
Aarti Singh
MLT
24
8
0
13 Feb 2018
Learning Compact Neural Networks with Regularization
Samet Oymak
MLT
41
39
0
05 Feb 2018
Semi-Supervised Convolutional Neural Networks for Human Activity Recognition
Mingzhi Zeng
Tong Yu
Tianlin Li
Le T. Nguyen
Ole J. Mengshoel
Ian Lane
SSL
HAI
11
62
0
22 Jan 2018
Faster gaze prediction with dense networks and Fisher pruning
Lucas Theis
I. Korshunova
Alykhan Tejani
Ferenc Huszár
26
204
0
17 Jan 2018
Fix your classifier: the marginal value of training the last weight layer
Elad Hoffer
Itay Hubara
Daniel Soudry
35
101
0
14 Jan 2018
Approximation beats concentration? An approximation view on inference with smooth radial kernels
M. Belkin
34
69
0
10 Jan 2018
Boundary Optimizing Network (BON)
Marco Singh
A. Pai
17
0
0
08 Jan 2018
Theory of Deep Learning IIb: Optimization Properties of SGD
Chiyuan Zhang
Q. Liao
Alexander Rakhlin
Brando Miranda
Noah Golowich
T. Poggio
ODL
28
71
0
07 Jan 2018
Previous
1
2
3
...
16
17
18
19
Next