ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05296
  4. Cited By
Stronger generalization bounds for deep nets via a compression approach
v1v2v3v4 (latest)

Stronger generalization bounds for deep nets via a compression approach

14 February 2018
Sanjeev Arora
Rong Ge
Behnam Neyshabur
Yi Zhang
    MLTAI4CE
ArXiv (abs)PDFHTML

Papers citing "Stronger generalization bounds for deep nets via a compression approach"

50 / 444 papers shown
Title
Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel
Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel
Yilan Chen
Zhichao Wang
Wei Huang
Andi Han
Taiji Suzuki
Arya Mazumdar
MLT
33
0
0
12 Jun 2025
The Universality Lens: Why Even Highly Over-Parametrized Models Learn Well
M. Feder
Ruediger Urbanke
Yaniv Fogel
27
0
0
09 Jun 2025
Overfitting has a limitation: a model-independent generalization error bound based on Rényi entropy
Overfitting has a limitation: a model-independent generalization error bound based on Rényi entropy
Atsushi Suzuki
34
0
0
30 May 2025
Scalable Complexity Control Facilitates Reasoning Ability of LLMs
Scalable Complexity Control Facilitates Reasoning Ability of LLMs
Liangkai Hang
Junjie Yao
Zhiwei Bai
Tianyi Chen
Yang Chen
...
Feiyu Xiong
Y. Zhang
Weinan E
Hongkang Yang
Zhi-hai Xu
LRM
79
0
0
29 May 2025
Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes
Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes
I. Harel
Yonathan Wolanowsky
Gal Vardi
Nathan Srebro
Daniel Soudry
AI4CE
105
0
0
25 May 2025
Generalization Through Growth: Hidden Dynamics Controls Depth Dependence
Generalization Through Growth: Hidden Dynamics Controls Depth Dependence
Sho Sonoda
Yuka Hashimoto
Isao Ishikawa
Masahiro Ikeda
53
0
0
21 May 2025
On the Importance of Gaussianizing Representations
On the Importance of Gaussianizing Representations
Daniel Eftekhari
Vardan Papyan
99
0
0
01 May 2025
Generalization Guarantees for Multi-View Representation Learning and Application to Regularization via Gaussian Product Mixture Prior
Generalization Guarantees for Multi-View Representation Learning and Application to Regularization via Gaussian Product Mixture Prior
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
114
1
0
25 Apr 2025
Compute-Optimal LLMs Provably Generalize Better With Scale
Compute-Optimal LLMs Provably Generalize Better With Scale
Marc Finzi
Sanyam Kapoor
Diego Granziol
Anming Gu
Christopher De Sa
J. Zico Kolter
Andrew Gordon Wilson
150
0
0
21 Apr 2025
Identifying Key Challenges of Hardness-Based Resampling
Identifying Key Challenges of Hardness-Based Resampling
Pawel Pukowski
Venet Osmani
85
0
0
09 Apr 2025
Non-vacuous Generalization Bounds for Deep Neural Networks without any modification to the trained models
Khoat Than
Dat Phan
BDLAAMLVLM
108
0
0
10 Mar 2025
Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Ability
Lijia Yu
Yibo Miao
Yifan Zhu
Xiao-Shan Gao
Lijun Zhang
98
0
0
06 Mar 2025
Deep Learning is Not So Mysterious or Different
Deep Learning is Not So Mysterious or Different
Andrew Gordon Wilson
101
6
0
03 Mar 2025
Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)
Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)
Yoonsoo Nam
Seok Hyeong Lee
Clementine Domine
Yea Chan Park
Charles London
Wonyl Choi
Niclas Goring
Seungjai Lee
AI4CE
229
1
0
28 Feb 2025
`Generalization is hallucination' through the lens of tensor completions
`Generalization is hallucination' through the lens of tensor completions
Liang Ze Wong
VLM
110
0
0
24 Feb 2025
Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors
Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors
Romain Chor
Milad Sefidgaran
Piotr Krasnowski
307
2
0
21 Feb 2025
Repetition Neurons: How Do Language Models Produce Repetitions?
Repetition Neurons: How Do Language Models Produce Repetitions?
Tatsuya Hiraoka
Kentaro Inui
MILM
152
9
0
21 Feb 2025
Early Stopping Against Label Noise Without Validation Data
Early Stopping Against Label Noise Without Validation Data
Suqin Yuan
Lei Feng
Tongliang Liu
NoLa
296
20
0
11 Feb 2025
Kolmogorov-Arnold Fourier Networks
Kolmogorov-Arnold Fourier Networks
Jusheng Zhang
Yijia Fan
Kaitong Cai
Keze Wang
101
0
0
09 Feb 2025
Implicit Bias in Matrix Factorization and its Explicit Realization in a New Architecture
Yikun Hou
Suvrit Sra
A. Yurtsever
89
0
0
28 Jan 2025
HG-Adapter: Improving Pre-Trained Heterogeneous Graph Neural Networks
  with Dual Adapters
HG-Adapter: Improving Pre-Trained Heterogeneous Graph Neural Networks with Dual Adapters
Yujie Mo
Runpeng Yu
Xiaofeng Zhu
Xinchao Wang
98
3
0
02 Nov 2024
Dimensionality-induced information loss of outliers in deep neural
  networks
Dimensionality-induced information loss of outliers in deep neural networks
Kazuki Uematsu
Kosuke Haruki
Taiji Suzuki
Mitsuhiro Kimura
Takahiro Takimoto
Hideyuki Nakagawa
63
0
0
29 Oct 2024
Rethinking generalization of classifiers in separable classes scenarios
  and over-parameterized regimes
Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimes
Julius Martinetz
C. Linse
Thomas Martinetz
99
0
0
22 Oct 2024
The Fair Language Model Paradox
The Fair Language Model Paradox
Andrea Pinto
Tomer Galanti
Randall Balestriero
92
1
0
15 Oct 2024
Towards Better Generalization: Weight Decay Induces Low-rank Bias for
  Neural Networks
Towards Better Generalization: Weight Decay Induces Low-rank Bias for Neural Networks
Ke Chen
Chugang Yi
Haizhao Yang
MLT
69
0
0
03 Oct 2024
Not Every Image is Worth a Thousand Words: Quantifying Originality in
  Stable Diffusion
Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion
Adi Haviv
Shahar Sarfaty
Uri Y. Hacohen
N. Elkin-Koren
Roi Livni
Amit H. Bermano
89
2
0
15 Aug 2024
On the Generalization of Preference Learning with DPO
On the Generalization of Preference Learning with DPO
Shawn Im
Yixuan Li
90
2
0
06 Aug 2024
Tightening the Evaluation of PAC Bounds Using Formal Verification
  Results
Tightening the Evaluation of PAC Bounds Using Formal Verification Results
Thomas Walker
A. Lomuscio
70
0
0
29 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
Lukas Mauch
Marzieh Edraki
Aaron Courville
OODDCLLVLM
139
5
0
03 Jul 2024
What Does Softmax Probability Tell Us about Classifiers Ranking Across
  Diverse Test Conditions?
What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?
Weijie Tu
Weijian Deng
Liang Zheng
Tom Gedeon
98
1
0
14 Jun 2024
Minimal Communication-Cost Statistical Learning
Minimal Communication-Cost Statistical Learning
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
67
0
0
12 Jun 2024
Slicing Mutual Information Generalization Bounds for Neural Networks
Slicing Mutual Information Generalization Bounds for Neural Networks
Kimia Nadjahi
Kristjan Greenewald
Rickard Brüel-Gabrielsson
Justin Solomon
104
4
0
06 Jun 2024
Generalization Bound and New Algorithm for Clean-Label Backdoor Attack
Generalization Bound and New Algorithm for Clean-Label Backdoor Attack
Lijia Yu
Shuang Liu
Yibo Miao
Xiao-Shan Gao
Lijun Zhang
AAML
106
7
0
02 Jun 2024
How many samples are needed to train a deep neural network?
How many samples are needed to train a deep neural network?
Pegah Golestaneh
Mahsa Taheri
Johannes Lederer
80
4
0
26 May 2024
Unmasking Efficiency: Learning Salient Sparse Models in Non-IID
  Federated Learning
Unmasking Efficiency: Learning Salient Sparse Models in Non-IID Federated Learning
Riyasat Ohib
Bishal Thapaliya
Gintare Karolina Dziugaite
Jingyu Liu
Vince D. Calhoun
Sergey Plis
FedML
90
1
0
15 May 2024
Information-Theoretic Generalization Bounds for Deep Neural Networks
Information-Theoretic Generalization Bounds for Deep Neural Networks
Haiyun He
Christina Lee Yu
129
6
0
04 Apr 2024
On the Generalization Ability of Unsupervised Pretraining
On the Generalization Ability of Unsupervised Pretraining
Yuyang Deng
Junyuan Hong
Jiayu Zhou
M. Mahdavi
SSL
96
5
0
11 Mar 2024
On the Diminishing Returns of Width for Continual Learning
On the Diminishing Returns of Width for Continual Learning
E. Guha
V. Lakshman
CLL
70
6
0
11 Mar 2024
Generalization of Graph Neural Networks through the Lens of Homomorphism
Generalization of Graph Neural Networks through the Lens of Homomorphism
Shouheng Li
Dongwoo Kim
Qing Wang
90
1
0
10 Mar 2024
A priori Estimates for Deep Residual Network in Continuous-time
  Reinforcement Learning
A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning
Shuyu Yin
Qixuan Zhou
Fei Wen
Tao Luo
87
0
0
24 Feb 2024
Efficient Stagewise Pretraining via Progressive Subnetworks
Efficient Stagewise Pretraining via Progressive Subnetworks
Abhishek Panigrahi
Nikunj Saunshi
Kaifeng Lyu
Sobhan Miryoosefi
Sashank J. Reddi
Satyen Kale
Sanjiv Kumar
69
6
0
08 Feb 2024
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Akira Ito
Masanori Yamada
Atsutoshi Kumagai
MoMe
189
6
0
06 Feb 2024
Minimum Description Length and Generalization Guarantees for
  Representation Learning
Minimum Description Length and Generalization Guarantees for Representation Learning
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
138
10
0
05 Feb 2024
EPSD: Early Pruning with Self-Distillation for Efficient Model
  Compression
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression
Dong Chen
Ning Liu
Yichen Zhu
Zhengping Che
Rui Ma
Fachao Zhang
Xiaofeng Mou
Yi Chang
Jian Tang
67
4
0
31 Jan 2024
Improving conversion rate prediction via self-supervised pre-training in
  online advertising
Improving conversion rate prediction via self-supervised pre-training in online advertising
Alex Shtoff
Yohay Kaplan
Ariel Raviv
41
0
0
25 Jan 2024
Non-Vacuous Generalization Bounds for Large Language Models
Non-Vacuous Generalization Bounds for Large Language Models
Sanae Lotfi
Marc Finzi
Yilun Kuang
Tim G. J. Rudner
Micah Goldblum
Andrew Gordon Wilson
117
25
0
28 Dec 2023
Beyond One Model Fits All: Ensemble Deep Learning for Autonomous
  Vehicles
Beyond One Model Fits All: Ensemble Deep Learning for Autonomous Vehicles
Hemanth Manjunatha
Panagiotis Tsiotras
59
0
0
10 Dec 2023
PAC-Bayes Generalization Certificates for Learned Inductive Conformal
  Prediction
PAC-Bayes Generalization Certificates for Learned Inductive Conformal Prediction
Apoorva Sharma
Sushant Veer
Asher Hancock
Heng Yang
Marco Pavone
Anirudha Majumdar
210
9
0
07 Dec 2023
Scalable Federated Learning for Clients with Different Input Image Sizes
  and Numbers of Output Categories
Scalable Federated Learning for Clients with Different Input Image Sizes and Numbers of Output Categories
Shuhei Nitta
Taiji Suzuki
Albert Rodríguez Mulet
A. Yaguchi
Ryusuke Hirai
FedML
75
0
0
15 Nov 2023
Information-Theoretic Generalization Bounds for Transductive Learning and its Applications
Information-Theoretic Generalization Bounds for Transductive Learning and its Applications
Huayi Tang
Yong Liu
165
1
0
08 Nov 2023
123456789
Next