Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05296
Cited By
v1
v2
v3
v4 (latest)
Stronger generalization bounds for deep nets via a compression approach
14 February 2018
Sanjeev Arora
Rong Ge
Behnam Neyshabur
Yi Zhang
MLT
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stronger generalization bounds for deep nets via a compression approach"
50 / 444 papers shown
Title
Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel
Yilan Chen
Zhichao Wang
Wei Huang
Andi Han
Taiji Suzuki
Arya Mazumdar
MLT
33
0
0
12 Jun 2025
The Universality Lens: Why Even Highly Over-Parametrized Models Learn Well
M. Feder
Ruediger Urbanke
Yaniv Fogel
27
0
0
09 Jun 2025
Overfitting has a limitation: a model-independent generalization error bound based on Rényi entropy
Atsushi Suzuki
34
0
0
30 May 2025
Scalable Complexity Control Facilitates Reasoning Ability of LLMs
Liangkai Hang
Junjie Yao
Zhiwei Bai
Tianyi Chen
Yang Chen
...
Feiyu Xiong
Y. Zhang
Weinan E
Hongkang Yang
Zhi-hai Xu
LRM
79
0
0
29 May 2025
Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes
I. Harel
Yonathan Wolanowsky
Gal Vardi
Nathan Srebro
Daniel Soudry
AI4CE
105
0
0
25 May 2025
Generalization Through Growth: Hidden Dynamics Controls Depth Dependence
Sho Sonoda
Yuka Hashimoto
Isao Ishikawa
Masahiro Ikeda
53
0
0
21 May 2025
On the Importance of Gaussianizing Representations
Daniel Eftekhari
Vardan Papyan
99
0
0
01 May 2025
Generalization Guarantees for Multi-View Representation Learning and Application to Regularization via Gaussian Product Mixture Prior
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
114
1
0
25 Apr 2025
Compute-Optimal LLMs Provably Generalize Better With Scale
Marc Finzi
Sanyam Kapoor
Diego Granziol
Anming Gu
Christopher De Sa
J. Zico Kolter
Andrew Gordon Wilson
150
0
0
21 Apr 2025
Identifying Key Challenges of Hardness-Based Resampling
Pawel Pukowski
Venet Osmani
85
0
0
09 Apr 2025
Non-vacuous Generalization Bounds for Deep Neural Networks without any modification to the trained models
Khoat Than
Dat Phan
BDL
AAML
VLM
108
0
0
10 Mar 2025
Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Ability
Lijia Yu
Yibo Miao
Yifan Zhu
Xiao-Shan Gao
Lijun Zhang
98
0
0
06 Mar 2025
Deep Learning is Not So Mysterious or Different
Andrew Gordon Wilson
101
6
0
03 Mar 2025
Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)
Yoonsoo Nam
Seok Hyeong Lee
Clementine Domine
Yea Chan Park
Charles London
Wonyl Choi
Niclas Goring
Seungjai Lee
AI4CE
229
1
0
28 Feb 2025
`Generalization is hallucination' through the lens of tensor completions
Liang Ze Wong
VLM
110
0
0
24 Feb 2025
Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors
Romain Chor
Milad Sefidgaran
Piotr Krasnowski
307
2
0
21 Feb 2025
Repetition Neurons: How Do Language Models Produce Repetitions?
Tatsuya Hiraoka
Kentaro Inui
MILM
152
9
0
21 Feb 2025
Early Stopping Against Label Noise Without Validation Data
Suqin Yuan
Lei Feng
Tongliang Liu
NoLa
296
20
0
11 Feb 2025
Kolmogorov-Arnold Fourier Networks
Jusheng Zhang
Yijia Fan
Kaitong Cai
Keze Wang
101
0
0
09 Feb 2025
Implicit Bias in Matrix Factorization and its Explicit Realization in a New Architecture
Yikun Hou
Suvrit Sra
A. Yurtsever
89
0
0
28 Jan 2025
HG-Adapter: Improving Pre-Trained Heterogeneous Graph Neural Networks with Dual Adapters
Yujie Mo
Runpeng Yu
Xiaofeng Zhu
Xinchao Wang
98
3
0
02 Nov 2024
Dimensionality-induced information loss of outliers in deep neural networks
Kazuki Uematsu
Kosuke Haruki
Taiji Suzuki
Mitsuhiro Kimura
Takahiro Takimoto
Hideyuki Nakagawa
63
0
0
29 Oct 2024
Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimes
Julius Martinetz
C. Linse
Thomas Martinetz
99
0
0
22 Oct 2024
The Fair Language Model Paradox
Andrea Pinto
Tomer Galanti
Randall Balestriero
92
1
0
15 Oct 2024
Towards Better Generalization: Weight Decay Induces Low-rank Bias for Neural Networks
Ke Chen
Chugang Yi
Haizhao Yang
MLT
69
0
0
03 Oct 2024
Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion
Adi Haviv
Shahar Sarfaty
Uri Y. Hacohen
N. Elkin-Koren
Roi Livni
Amit H. Bermano
89
2
0
15 Aug 2024
On the Generalization of Preference Learning with DPO
Shawn Im
Yixuan Li
90
2
0
06 Aug 2024
Tightening the Evaluation of PAC Bounds Using Formal Verification Results
Thomas Walker
A. Lomuscio
70
0
0
29 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
Lukas Mauch
Marzieh Edraki
Aaron Courville
OODD
CLL
VLM
139
5
0
03 Jul 2024
What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?
Weijie Tu
Weijian Deng
Liang Zheng
Tom Gedeon
98
1
0
14 Jun 2024
Minimal Communication-Cost Statistical Learning
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
67
0
0
12 Jun 2024
Slicing Mutual Information Generalization Bounds for Neural Networks
Kimia Nadjahi
Kristjan Greenewald
Rickard Brüel-Gabrielsson
Justin Solomon
104
4
0
06 Jun 2024
Generalization Bound and New Algorithm for Clean-Label Backdoor Attack
Lijia Yu
Shuang Liu
Yibo Miao
Xiao-Shan Gao
Lijun Zhang
AAML
106
7
0
02 Jun 2024
How many samples are needed to train a deep neural network?
Pegah Golestaneh
Mahsa Taheri
Johannes Lederer
80
4
0
26 May 2024
Unmasking Efficiency: Learning Salient Sparse Models in Non-IID Federated Learning
Riyasat Ohib
Bishal Thapaliya
Gintare Karolina Dziugaite
Jingyu Liu
Vince D. Calhoun
Sergey Plis
FedML
90
1
0
15 May 2024
Information-Theoretic Generalization Bounds for Deep Neural Networks
Haiyun He
Christina Lee Yu
129
6
0
04 Apr 2024
On the Generalization Ability of Unsupervised Pretraining
Yuyang Deng
Junyuan Hong
Jiayu Zhou
M. Mahdavi
SSL
96
5
0
11 Mar 2024
On the Diminishing Returns of Width for Continual Learning
E. Guha
V. Lakshman
CLL
70
6
0
11 Mar 2024
Generalization of Graph Neural Networks through the Lens of Homomorphism
Shouheng Li
Dongwoo Kim
Qing Wang
90
1
0
10 Mar 2024
A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning
Shuyu Yin
Qixuan Zhou
Fei Wen
Tao Luo
87
0
0
24 Feb 2024
Efficient Stagewise Pretraining via Progressive Subnetworks
Abhishek Panigrahi
Nikunj Saunshi
Kaifeng Lyu
Sobhan Miryoosefi
Sashank J. Reddi
Satyen Kale
Sanjiv Kumar
69
6
0
08 Feb 2024
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Akira Ito
Masanori Yamada
Atsutoshi Kumagai
MoMe
189
6
0
06 Feb 2024
Minimum Description Length and Generalization Guarantees for Representation Learning
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
138
10
0
05 Feb 2024
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression
Dong Chen
Ning Liu
Yichen Zhu
Zhengping Che
Rui Ma
Fachao Zhang
Xiaofeng Mou
Yi Chang
Jian Tang
67
4
0
31 Jan 2024
Improving conversion rate prediction via self-supervised pre-training in online advertising
Alex Shtoff
Yohay Kaplan
Ariel Raviv
41
0
0
25 Jan 2024
Non-Vacuous Generalization Bounds for Large Language Models
Sanae Lotfi
Marc Finzi
Yilun Kuang
Tim G. J. Rudner
Micah Goldblum
Andrew Gordon Wilson
117
25
0
28 Dec 2023
Beyond One Model Fits All: Ensemble Deep Learning for Autonomous Vehicles
Hemanth Manjunatha
Panagiotis Tsiotras
59
0
0
10 Dec 2023
PAC-Bayes Generalization Certificates for Learned Inductive Conformal Prediction
Apoorva Sharma
Sushant Veer
Asher Hancock
Heng Yang
Marco Pavone
Anirudha Majumdar
210
9
0
07 Dec 2023
Scalable Federated Learning for Clients with Different Input Image Sizes and Numbers of Output Categories
Shuhei Nitta
Taiji Suzuki
Albert Rodríguez Mulet
A. Yaguchi
Ryusuke Hirai
FedML
75
0
0
15 Nov 2023
Information-Theoretic Generalization Bounds for Transductive Learning and its Applications
Huayi Tang
Yong Liu
165
1
0
08 Nov 2023
1
2
3
4
5
6
7
8
9
Next