Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05296
Cited By
Stronger generalization bounds for deep nets via a compression approach
14 February 2018
Sanjeev Arora
Rong Ge
Behnam Neyshabur
Yi Zhang
MLT
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Stronger generalization bounds for deep nets via a compression approach"
50 / 440 papers shown
Title
On the Importance of Gaussianizing Representations
Daniel Eftekhari
Vardan Papyan
31
0
0
01 May 2025
Generalization Guarantees for Multi-View Representation Learning and Application to Regularization via Gaussian Product Mixture Prior
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
49
0
0
25 Apr 2025
Compute-Optimal LLMs Provably Generalize Better With Scale
Marc Finzi
Sanyam Kapoor
Diego Granziol
Anming Gu
Christopher De Sa
J. Zico Kolter
Andrew Gordon Wilson
35
0
0
21 Apr 2025
Identifying Key Challenges of Hardness-Based Resampling
Pawel Pukowski
Venet Osmani
36
0
0
09 Apr 2025
Non-vacuous Generalization Bounds for Deep Neural Networks without any modification to the trained models
Khoat Than
Dat Phan
BDL
AAML
VLM
60
0
0
10 Mar 2025
Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Ability
Lijia Yu
Yibo Miao
Yifan Zhu
Xiao-Shan Gao
Lijun Zhang
53
0
0
06 Mar 2025
Deep Learning is Not So Mysterious or Different
Andrew Gordon Wilson
41
2
0
03 Mar 2025
Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)
Yoonsoo Nam
Seok Hyeong Lee
Clementine Domine
Yea Chan Park
Charles London
Wonyl Choi
Niclas Goring
Seungjai Lee
AI4CE
38
0
0
28 Feb 2025
`Generalization is hallucination' through the lens of tensor completions
Liang Ze Wong
VLM
70
0
0
24 Feb 2025
Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors
Romain Chor
Milad Sefidgaran
Piotr Krasnowski
93
1
0
21 Feb 2025
Repetition Neurons: How Do Language Models Produce Repetitions?
Tatsuya Hiraoka
Kentaro Inui
MILM
75
8
0
21 Feb 2025
Early Stopping Against Label Noise Without Validation Data
Suqin Yuan
Lei Feng
Tongliang Liu
NoLa
104
17
0
11 Feb 2025
Kolmogorov-Arnold Fourier Networks
Jusheng Zhang
Yijia Fan
Kaitong Cai
Keze Wang
68
0
0
09 Feb 2025
Implicit Bias in Matrix Factorization and its Explicit Realization in a New Architecture
Yikun Hou
Suvrit Sra
A. Yurtsever
34
0
0
28 Jan 2025
HG-Adapter: Improving Pre-Trained Heterogeneous Graph Neural Networks with Dual Adapters
Yujie Mo
Runpeng Yu
Xiaofeng Zhu
Xinchao Wang
46
1
0
02 Nov 2024
Dimensionality-induced information loss of outliers in deep neural networks
Kazuki Uematsu
Kosuke Haruki
Taiji Suzuki
Mitsuhiro Kimura
Takahiro Takimoto
Hideyuki Nakagawa
28
0
0
29 Oct 2024
Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimes
Julius Martinetz
C. Linse
Thomas Martinetz
28
0
0
22 Oct 2024
The Fair Language Model Paradox
Andrea Pinto
Tomer Galanti
Randall Balestriero
25
0
0
15 Oct 2024
Towards Better Generalization: Weight Decay Induces Low-rank Bias for Neural Networks
Ke Chen
Chugang Yi
Haizhao Yang
MLT
33
0
0
03 Oct 2024
Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion
Adi Haviv
Shahar Sarfaty
Uri Y. Hacohen
N. Elkin-Koren
Roi Livni
Amit H. Bermano
37
2
0
15 Aug 2024
On the Generalization of Preference Learning with DPO
Shawn Im
Yixuan Li
52
1
0
06 Aug 2024
Tightening the Evaluation of PAC Bounds Using Formal Verification Results
Thomas Walker
A. Lomuscio
26
0
0
29 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
Lukas Mauch
Marzieh Edraki
Aaron Courville
OODD
CLL
VLM
57
3
0
03 Jul 2024
What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?
Weijie Tu
Weijian Deng
Liang Zheng
Tom Gedeon
40
0
0
14 Jun 2024
Minimal Communication-Cost Statistical Learning
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
37
0
0
12 Jun 2024
Slicing Mutual Information Generalization Bounds for Neural Networks
Kimia Nadjahi
Kristjan Greenewald
Rickard Brüel-Gabrielsson
Justin Solomon
51
3
0
06 Jun 2024
Generalization Bound and New Algorithm for Clean-Label Backdoor Attack
Lijia Yu
Shuang Liu
Yibo Miao
Xiao-Shan Gao
Lijun Zhang
AAML
36
6
0
02 Jun 2024
How many samples are needed to train a deep neural network?
Pegah Golestaneh
Mahsa Taheri
Johannes Lederer
34
4
0
26 May 2024
Unmasking Efficiency: Learning Salient Sparse Models in Non-IID Federated Learning
Riyasat Ohib
Bishal Thapaliya
Gintare Karolina Dziugaite
Jingyu Liu
Vince D. Calhoun
Sergey Plis
FedML
32
1
0
15 May 2024
Information-Theoretic Generalization Bounds for Deep Neural Networks
Haiyun He
Christina Lee Yu
38
5
0
04 Apr 2024
On the Generalization Ability of Unsupervised Pretraining
Yuyang Deng
Junyuan Hong
Jiayu Zhou
M. Mahdavi
SSL
55
4
0
11 Mar 2024
On the Diminishing Returns of Width for Continual Learning
E. Guha
V. Lakshman
CLL
39
4
0
11 Mar 2024
Generalization of Graph Neural Networks through the Lens of Homomorphism
Shouheng Li
Dongwoo Kim
Qing Wang
42
1
0
10 Mar 2024
A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning
Shuyu Yin
Qixuan Zhou
Fei Wen
Tao Luo
32
0
0
24 Feb 2024
Efficient Stagewise Pretraining via Progressive Subnetworks
Abhishek Panigrahi
Nikunj Saunshi
Kaifeng Lyu
Sobhan Miryoosefi
Sashank J. Reddi
Satyen Kale
Sanjiv Kumar
38
5
0
08 Feb 2024
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Akira Ito
Masanori Yamada
Atsutoshi Kumagai
MoMe
64
5
0
06 Feb 2024
Minimum Description Length and Generalization Guarantees for Representation Learning
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
45
7
0
05 Feb 2024
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression
Dong Chen
Ning Liu
Yichen Zhu
Zhengping Che
Rui Ma
Fachao Zhang
Xiaofeng Mou
Yi Chang
Jian Tang
31
4
0
31 Jan 2024
Improving conversion rate prediction via self-supervised pre-training in online advertising
Alex Shtoff
Yohay Kaplan
Ariel Raviv
16
0
0
25 Jan 2024
Non-Vacuous Generalization Bounds for Large Language Models
Sanae Lotfi
Marc Finzi
Yilun Kuang
Tim G. J. Rudner
Micah Goldblum
Andrew Gordon Wilson
31
20
0
28 Dec 2023
Beyond One Model Fits All: Ensemble Deep Learning for Autonomous Vehicles
Hemanth Manjunatha
Panagiotis Tsiotras
17
0
0
10 Dec 2023
PAC-Bayes Generalization Certificates for Learned Inductive Conformal Prediction
Apoorva Sharma
Sushant Veer
Asher Hancock
Heng Yang
Marco Pavone
Anirudha Majumdar
49
8
0
07 Dec 2023
Scalable Federated Learning for Clients with Different Input Image Sizes and Numbers of Output Categories
Shuhei Nitta
Taiji Suzuki
Albert Rodríguez Mulet
A. Yaguchi
Ryusuke Hirai
FedML
20
0
0
15 Nov 2023
Information-Theoretic Generalization Bounds for Transductive Learning and its Applications
Huayi Tang
Yong Liu
62
1
0
08 Nov 2023
EKGNet: A 10.96μW Fully Analog Neural Network for Intra-Patient Arrhythmia Classification
B. Haghi
Lin Ma
Sahin Lale
A. Anandkumar
Azita Emami
21
0
0
24 Oct 2023
Bridging Information-Theoretic and Geometric Compression in Language Models
Emily Cheng
Corentin Kervadec
Marco Baroni
36
17
0
20 Oct 2023
Are GATs Out of Balance?
Nimrah Mustafa
Aleksandar Bojchevski
R. Burkholz
48
4
0
11 Oct 2023
Investigating the Ability of PINNs To Solve Burgers' PDE Near Finite-Time BlowUp
Dibyakanti Kumar
Anirbit Mukherjee
31
2
0
08 Oct 2023
A Primer on Bayesian Neural Networks: Review and Debates
Federico Danieli
Konstantinos Pitas
M. Vladimirova
Vincent Fortuin
BDL
AAML
56
18
0
28 Sep 2023
Deep Model Fusion: A Survey
Weishi Li
Yong Peng
Miao Zhang
Liang Ding
Han Hu
Li Shen
FedML
MoMe
41
52
0
27 Sep 2023
1
2
3
4
5
6
7
8
9
Next