Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05296
Cited By
v1
v2
v3
v4 (latest)
Stronger generalization bounds for deep nets via a compression approach
14 February 2018
Sanjeev Arora
Rong Ge
Behnam Neyshabur
Yi Zhang
MLT
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stronger generalization bounds for deep nets via a compression approach"
50 / 444 papers shown
Title
PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization
Sanae Lotfi
Marc Finzi
Sanyam Kapoor
Andres Potapczynski
Micah Goldblum
A. Wilson
BDL
MLT
AI4CE
92
62
0
24 Nov 2022
Finding Skill Neurons in Pre-trained Transformer-based Language Models
Xiaozhi Wang
Kaiyue Wen
Zhengyan Zhang
Lei Hou
Zhiyuan Liu
Juanzi Li
MILM
MoE
90
52
0
14 Nov 2022
Do highly over-parameterized neural networks generalize since bad solutions are rare?
Julius Martinetz
T. Martinetz
98
1
0
07 Nov 2022
Instance-Dependent Generalization Bounds via Optimal Transport
Songyan Hou
Parnian Kassraie
Anastasis Kratsios
Andreas Krause
Jonas Rothfuss
125
6
0
02 Nov 2022
Auxiliary task discovery through generate-and-test
Banafsheh Rafiee
Sina Ghiassian
Jun Jin
R. Sutton
Jun Luo
Adam White
108
0
0
25 Oct 2022
The Curious Case of Benign Memorization
Sotiris Anagnostidis
Gregor Bachmann
Lorenzo Noci
Thomas Hofmann
AAML
148
10
0
25 Oct 2022
A PAC-Bayesian Generalization Bound for Equivariant Networks
Arash Behboodi
Gabriele Cesa
Taco S. Cohen
93
19
0
24 Oct 2022
Evolution of Neural Tangent Kernels under Benign and Adversarial Training
Noel Loo
Ramin Hasani
Alexander Amini
Daniela Rus
AAML
114
13
0
21 Oct 2022
GA-SAM: Gradient-Strength based Adaptive Sharpness-Aware Minimization for Improved Generalization
Zhiyuan Zhang
Ruixuan Luo
Qi Su
Xueting Sun
112
13
0
13 Oct 2022
Continual task learning in natural and artificial agents
Timo Flesch
Andrew M. Saxe
Christopher Summerfield
CLL
59
26
0
10 Oct 2022
Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel
Sungyub Kim
Si-hun Park
Kyungsu Kim
Eunho Yang
BDL
86
5
0
30 Sep 2022
Neural Networks Efficiently Learn Low-Dimensional Representations with SGD
Alireza Mousavi-Hosseini
Sejun Park
M. Girotti
Ioannis Mitliagkas
Murat A. Erdogdu
MLT
386
50
0
29 Sep 2022
Why neural networks find simple solutions: the many regularizers of geometric complexity
Benoit Dherin
Michael Munn
M. Rosca
David Barrett
139
32
0
27 Sep 2022
Approximate Description Length, Covering Numbers, and VC Dimension
Amit Daniely
Gal Katzhendler
37
0
0
26 Sep 2022
Stability and Generalization Analysis of Gradient Methods for Shallow Neural Networks
Yunwen Lei
Rong Jin
Yiming Ying
MLT
110
19
0
19 Sep 2022
Pruning Neural Networks via Coresets and Convex Geometry: Towards No Assumptions
M. Tukan
Loay Mualem
Alaa Maalouf
3DPC
80
23
0
18 Sep 2022
Improving Self-supervised Learning for Out-of-distribution Task via Auxiliary Classifier
Harshita Boonlia
T. Dam
Md Meftahul Ferdaus
S. Anavatti
Ankan Mullick
OODD
66
4
0
07 Sep 2022
Overparameterization from Computational Constraints
Sanjam Garg
S. Jha
Saeed Mahloujifar
Mohammad Mahmoody
Mingyuan Wang
56
2
0
27 Aug 2022
On the generalization of learning algorithms that do not converge
N. Chandramoorthy
Andreas Loukas
Khashayar Gatmiry
Stefanie Jegelka
MLT
100
11
0
16 Aug 2022
On the Strong Correlation Between Model Invariance and Generalization
Weijian Deng
Stephen Gould
Liang Zheng
OOD
91
19
0
14 Jul 2022
A law of adversarial risk, interpolation, and label noise
Daniel Paleka
Amartya Sanyal
NoLa
AAML
113
10
0
08 Jul 2022
Training Patch Analysis and Mining Skills for Image Restoration Deep Neural Networks
Jae Woong Soh
N. Cho
38
0
0
03 Jul 2022
Max-Margin Works while Large Margin Fails: Generalization without Uniform Convergence
Margalit Glasgow
Colin Wei
Mary Wootters
Tengyu Ma
103
5
0
16 Jun 2022
Benefits of Additive Noise in Composing Classes with Bounded Capacity
A. F. Pour
H. Ashtiani
77
3
0
14 Jun 2022
Zeroth-Order Topological Insights into Iterative Magnitude Pruning
Aishwarya H. Balwani
J. Krzyston
94
2
0
14 Jun 2022
Improving Pre-trained Language Model Fine-tuning with Noise Stability Regularization
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
94
15
0
12 Jun 2022
A Theoretical Understanding of Neural Network Compression from Sparse Linear Approximation
Wenjing Yang
G. Wang
Jie Ding
Yuhong Yang
MLT
71
7
0
11 Jun 2022
Fisher SAM: Information Geometry and Sharpness Aware Minimisation
Minyoung Kim
Da Li
S. Hu
Timothy M. Hospedales
AAML
99
72
0
10 Jun 2022
Trajectory-dependent Generalization Bounds for Deep Neural Networks via Fractional Brownian Motion
Chengli Tan
Jiang Zhang
Junmin Liu
89
1
0
09 Jun 2022
Generalization Error Bounds for Deep Neural Networks Trained by SGD
Mingze Wang
Chao Ma
49
14
0
07 Jun 2022
Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees
Haotian Ju
Dongyue Li
Hongyang R. Zhang
141
30
0
06 Jun 2022
Rate-Distortion Theoretic Bounds on Generalization Error for Distributed Learning
Romain Chor
Abdellatif Zaidi
Milad Sefidgaran
FedML
87
15
0
06 Jun 2022
Long-Tailed Learning Requires Feature Learning
T. Laurent
J. V. Brecht
Xavier Bresson
VLM
93
1
0
29 May 2022
Sharpness-Aware Training for Free
Jiawei Du
Daquan Zhou
Jiashi Feng
Vincent Y. F. Tan
Qiufeng Wang
AAML
105
96
0
27 May 2022
Generalization Bounds for Gradient Methods via Discrete and Continuous Prior
Jun Yu Li
Xu Luo
Jian Li
80
4
0
27 May 2022
Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures
Emmanuel Abbe
Samy Bengio
Elisabetta Cornacchia
Jon M. Kleinberg
Aryo Lotfi
M. Raghu
Chiyuan Zhang
MLT
77
10
0
26 May 2022
Deep Active Learning with Noise Stability
Xingjian Li
Pengkun Yang
Mingkun Xu
Xueying Zhan
Tianyang Wang
Dejing Dou
Chengzhong Xu
UQCV
82
12
0
26 May 2022
A Convergence Theory for Over-parameterized Variational Quantum Eigensolvers
Xuchen You
Shouvanik Chakrabarti
Xiaodi Wu
90
34
0
25 May 2022
Towards Size-Independent Generalization Bounds for Deep Operator Nets
Pulkit Gopalani
Sayar Karmakar
Dibyakanti Kumar
Anirbit Mukherjee
AI4CE
70
5
0
23 May 2022
Deep neural networks with dependent weights: Gaussian Process mixture limit, heavy tails, sparsity and compressibility
Hoileong Lee
Fadhel Ayed
Paul Jung
Juho Lee
Hongseok Yang
François Caron
112
10
0
17 May 2022
Dimensionality Reduced Training by Pruning and Freezing Parts of a Deep Neural Network, a Survey
Paul Wimmer
Jens Mehnert
Alexandru Paul Condurache
DD
100
21
0
17 May 2022
On the Generalization Mystery in Deep Learning
S. Chatterjee
Piotr Zielinski
OOD
79
35
0
18 Mar 2022
Error estimates for physics informed neural networks approximating the Navier-Stokes equations
Tim De Ryck
Ameya Dilip Jagtap
S. Mishra
PINN
136
118
0
17 Mar 2022
Confidence Dimension for Deep Learning based on Hoeffding Inequality and Relative Evaluation
Runqi Wang
Linlin Yang
Baochang Zhang
Wentao Zhu
David Doermann
Guodong Guo
52
1
0
17 Mar 2022
Approximability and Generalisation
A. J. Turner
Ata Kabán
64
0
0
15 Mar 2022
projUNN: efficient method for training deep networks with unitary matrices
B. Kiani
Randall Balestriero
Yann LeCun
S. Lloyd
118
32
0
10 Mar 2022
The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks
Xin Yu
Thiago Serra
Srikumar Ramalingam
Shandian Zhe
111
49
0
09 Mar 2022
Generalization Through The Lens Of Leave-One-Out Error
Gregor Bachmann
Thomas Hofmann
Aurelien Lucchi
142
8
0
07 Mar 2022
Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms
Romain Chor
A. Gohari
Gaël Richard
Umut Simsekli
113
24
0
04 Mar 2022
Adaptive Discriminative Regularization for Visual Classification
Qingsong Zhao
Yi Wang
Shuguang Dou
Chen Gong
Yin Wang
Cairong Zhao
111
0
0
02 Mar 2022
Previous
1
2
3
4
5
6
7
8
9
Next