ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05296
  4. Cited By
Stronger generalization bounds for deep nets via a compression approach

Stronger generalization bounds for deep nets via a compression approach

14 February 2018
Sanjeev Arora
Rong Ge
Behnam Neyshabur
Yi Zhang
    MLT
    AI4CE
ArXivPDFHTML

Papers citing "Stronger generalization bounds for deep nets via a compression approach"

50 / 440 papers shown
Title
On the Importance of Gaussianizing Representations
On the Importance of Gaussianizing Representations
Daniel Eftekhari
Vardan Papyan
31
0
0
01 May 2025
Generalization Guarantees for Multi-View Representation Learning and Application to Regularization via Gaussian Product Mixture Prior
Generalization Guarantees for Multi-View Representation Learning and Application to Regularization via Gaussian Product Mixture Prior
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
49
0
0
25 Apr 2025
Compute-Optimal LLMs Provably Generalize Better With Scale
Compute-Optimal LLMs Provably Generalize Better With Scale
Marc Finzi
Sanyam Kapoor
Diego Granziol
Anming Gu
Christopher De Sa
J. Zico Kolter
Andrew Gordon Wilson
35
0
0
21 Apr 2025
Identifying Key Challenges of Hardness-Based Resampling
Identifying Key Challenges of Hardness-Based Resampling
Pawel Pukowski
Venet Osmani
36
0
0
09 Apr 2025
Non-vacuous Generalization Bounds for Deep Neural Networks without any modification to the trained models
Khoat Than
Dat Phan
BDL
AAML
VLM
60
0
0
10 Mar 2025
Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Ability
Lijia Yu
Yibo Miao
Yifan Zhu
Xiao-Shan Gao
Lijun Zhang
53
0
0
06 Mar 2025
Deep Learning is Not So Mysterious or Different
Andrew Gordon Wilson
41
2
0
03 Mar 2025
Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)
Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)
Yoonsoo Nam
Seok Hyeong Lee
Clementine Domine
Yea Chan Park
Charles London
Wonyl Choi
Niclas Goring
Seungjai Lee
AI4CE
38
0
0
28 Feb 2025
`Generalization is hallucination' through the lens of tensor completions
`Generalization is hallucination' through the lens of tensor completions
Liang Ze Wong
VLM
70
0
0
24 Feb 2025
Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors
Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors
Romain Chor
Milad Sefidgaran
Piotr Krasnowski
91
1
0
21 Feb 2025
Repetition Neurons: How Do Language Models Produce Repetitions?
Repetition Neurons: How Do Language Models Produce Repetitions?
Tatsuya Hiraoka
Kentaro Inui
MILM
75
6
0
21 Feb 2025
Early Stopping Against Label Noise Without Validation Data
Early Stopping Against Label Noise Without Validation Data
Suqin Yuan
Lei Feng
Tongliang Liu
NoLa
104
16
0
11 Feb 2025
Kolmogorov-Arnold Fourier Networks
Kolmogorov-Arnold Fourier Networks
Jusheng Zhang
Yijia Fan
Kaitong Cai
Keze Wang
68
0
0
09 Feb 2025
Implicit Bias in Matrix Factorization and its Explicit Realization in a New Architecture
Yikun Hou
Suvrit Sra
A. Yurtsever
34
0
0
28 Jan 2025
HG-Adapter: Improving Pre-Trained Heterogeneous Graph Neural Networks
  with Dual Adapters
HG-Adapter: Improving Pre-Trained Heterogeneous Graph Neural Networks with Dual Adapters
Yujie Mo
Runpeng Yu
Xiaofeng Zhu
Xinchao Wang
43
1
0
02 Nov 2024
Dimensionality-induced information loss of outliers in deep neural
  networks
Dimensionality-induced information loss of outliers in deep neural networks
Kazuki Uematsu
Kosuke Haruki
Taiji Suzuki
Mitsuhiro Kimura
Takahiro Takimoto
Hideyuki Nakagawa
28
0
0
29 Oct 2024
Rethinking generalization of classifiers in separable classes scenarios
  and over-parameterized regimes
Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimes
Julius Martinetz
C. Linse
Thomas Martinetz
26
0
0
22 Oct 2024
The Fair Language Model Paradox
The Fair Language Model Paradox
Andrea Pinto
Tomer Galanti
Randall Balestriero
25
0
0
15 Oct 2024
Towards Better Generalization: Weight Decay Induces Low-rank Bias for
  Neural Networks
Towards Better Generalization: Weight Decay Induces Low-rank Bias for Neural Networks
Ke Chen
Chugang Yi
Haizhao Yang
MLT
33
0
0
03 Oct 2024
Not Every Image is Worth a Thousand Words: Quantifying Originality in
  Stable Diffusion
Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion
Adi Haviv
Shahar Sarfaty
Uri Y. Hacohen
N. Elkin-Koren
Roi Livni
Amit H. Bermano
37
2
0
15 Aug 2024
On the Generalization of Preference Learning with DPO
On the Generalization of Preference Learning with DPO
Shawn Im
Yixuan Li
52
1
0
06 Aug 2024
Tightening the Evaluation of PAC Bounds Using Formal Verification
  Results
Tightening the Evaluation of PAC Bounds Using Formal Verification Results
Thomas Walker
A. Lomuscio
26
0
0
29 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
Lukas Mauch
Marzieh Edraki
Aaron Courville
OODD
CLL
VLM
57
3
0
03 Jul 2024
What Does Softmax Probability Tell Us about Classifiers Ranking Across
  Diverse Test Conditions?
What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?
Weijie Tu
Weijian Deng
Liang Zheng
Tom Gedeon
40
0
0
14 Jun 2024
Minimal Communication-Cost Statistical Learning
Minimal Communication-Cost Statistical Learning
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
37
0
0
12 Jun 2024
Slicing Mutual Information Generalization Bounds for Neural Networks
Slicing Mutual Information Generalization Bounds for Neural Networks
Kimia Nadjahi
Kristjan Greenewald
Rickard Brüel-Gabrielsson
Justin Solomon
51
3
0
06 Jun 2024
Generalization Bound and New Algorithm for Clean-Label Backdoor Attack
Generalization Bound and New Algorithm for Clean-Label Backdoor Attack
Lijia Yu
Shuang Liu
Yibo Miao
Xiao-Shan Gao
Lijun Zhang
AAML
34
5
0
02 Jun 2024
How many samples are needed to train a deep neural network?
How many samples are needed to train a deep neural network?
Pegah Golestaneh
Mahsa Taheri
Johannes Lederer
34
4
0
26 May 2024
Unmasking Efficiency: Learning Salient Sparse Models in Non-IID
  Federated Learning
Unmasking Efficiency: Learning Salient Sparse Models in Non-IID Federated Learning
Riyasat Ohib
Bishal Thapaliya
Gintare Karolina Dziugaite
Jingyu Liu
Vince D. Calhoun
Sergey Plis
FedML
32
1
0
15 May 2024
Information-Theoretic Generalization Bounds for Deep Neural Networks
Information-Theoretic Generalization Bounds for Deep Neural Networks
Haiyun He
Christina Lee Yu
35
4
0
04 Apr 2024
On the Generalization Ability of Unsupervised Pretraining
On the Generalization Ability of Unsupervised Pretraining
Yuyang Deng
Junyuan Hong
Jiayu Zhou
M. Mahdavi
SSL
55
4
0
11 Mar 2024
On the Diminishing Returns of Width for Continual Learning
On the Diminishing Returns of Width for Continual Learning
E. Guha
V. Lakshman
CLL
39
4
0
11 Mar 2024
Generalization of Graph Neural Networks through the Lens of Homomorphism
Generalization of Graph Neural Networks through the Lens of Homomorphism
Shouheng Li
Dongwoo Kim
Qing Wang
42
1
0
10 Mar 2024
A priori Estimates for Deep Residual Network in Continuous-time
  Reinforcement Learning
A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning
Shuyu Yin
Qixuan Zhou
Fei Wen
Tao Luo
32
0
0
24 Feb 2024
Efficient Stagewise Pretraining via Progressive Subnetworks
Efficient Stagewise Pretraining via Progressive Subnetworks
Abhishek Panigrahi
Nikunj Saunshi
Kaifeng Lyu
Sobhan Miryoosefi
Sashank J. Reddi
Satyen Kale
Sanjiv Kumar
38
5
0
08 Feb 2024
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Akira Ito
Masanori Yamada
Atsutoshi Kumagai
MoMe
64
5
0
06 Feb 2024
Minimum Description Length and Generalization Guarantees for
  Representation Learning
Minimum Description Length and Generalization Guarantees for Representation Learning
Romain Chor
Abdellatif Zaidi
Piotr Krasnowski
45
7
0
05 Feb 2024
EPSD: Early Pruning with Self-Distillation for Efficient Model
  Compression
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression
Dong Chen
Ning Liu
Yichen Zhu
Zhengping Che
Rui Ma
Fachao Zhang
Xiaofeng Mou
Yi Chang
Jian Tang
31
4
0
31 Jan 2024
Improving conversion rate prediction via self-supervised pre-training in
  online advertising
Improving conversion rate prediction via self-supervised pre-training in online advertising
Alex Shtoff
Yohay Kaplan
Ariel Raviv
16
0
0
25 Jan 2024
Non-Vacuous Generalization Bounds for Large Language Models
Non-Vacuous Generalization Bounds for Large Language Models
Sanae Lotfi
Marc Finzi
Yilun Kuang
Tim G. J. Rudner
Micah Goldblum
Andrew Gordon Wilson
31
20
0
28 Dec 2023
Beyond One Model Fits All: Ensemble Deep Learning for Autonomous
  Vehicles
Beyond One Model Fits All: Ensemble Deep Learning for Autonomous Vehicles
Hemanth Manjunatha
Panagiotis Tsiotras
17
0
0
10 Dec 2023
PAC-Bayes Generalization Certificates for Learned Inductive Conformal
  Prediction
PAC-Bayes Generalization Certificates for Learned Inductive Conformal Prediction
Apoorva Sharma
Sushant Veer
Asher Hancock
Heng Yang
Marco Pavone
Anirudha Majumdar
49
8
0
07 Dec 2023
Scalable Federated Learning for Clients with Different Input Image Sizes
  and Numbers of Output Categories
Scalable Federated Learning for Clients with Different Input Image Sizes and Numbers of Output Categories
Shuhei Nitta
Taiji Suzuki
Albert Rodríguez Mulet
A. Yaguchi
Ryusuke Hirai
FedML
20
0
0
15 Nov 2023
Information-Theoretic Generalization Bounds for Transductive Learning and its Applications
Information-Theoretic Generalization Bounds for Transductive Learning and its Applications
Huayi Tang
Yong Liu
62
1
0
08 Nov 2023
EKGNet: A 10.96μW Fully Analog Neural Network for Intra-Patient
  Arrhythmia Classification
EKGNet: A 10.96μW Fully Analog Neural Network for Intra-Patient Arrhythmia Classification
B. Haghi
Lin Ma
Sahin Lale
A. Anandkumar
Azita Emami
16
0
0
24 Oct 2023
Bridging Information-Theoretic and Geometric Compression in Language
  Models
Bridging Information-Theoretic and Geometric Compression in Language Models
Emily Cheng
Corentin Kervadec
Marco Baroni
34
17
0
20 Oct 2023
Are GATs Out of Balance?
Are GATs Out of Balance?
Nimrah Mustafa
Aleksandar Bojchevski
R. Burkholz
48
4
0
11 Oct 2023
Investigating the Ability of PINNs To Solve Burgers' PDE Near
  Finite-Time BlowUp
Investigating the Ability of PINNs To Solve Burgers' PDE Near Finite-Time BlowUp
Dibyakanti Kumar
Anirbit Mukherjee
31
2
0
08 Oct 2023
A Primer on Bayesian Neural Networks: Review and Debates
A Primer on Bayesian Neural Networks: Review and Debates
Federico Danieli
Konstantinos Pitas
M. Vladimirova
Vincent Fortuin
BDL
AAML
56
18
0
28 Sep 2023
Deep Model Fusion: A Survey
Deep Model Fusion: A Survey
Weishi Li
Yong Peng
Miao Zhang
Liang Ding
Han Hu
Li Shen
FedML
MoMe
39
52
0
27 Sep 2023
123456789
Next