v1v2v3v4 (latest)

Stronger generalization bounds for deep nets via a compression approach

14 February 2018

Papers citing "Stronger generalization bounds for deep nets via a compression approach"

50 / 444 papers shown

Title
Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel Yilan Chen Zhichao Wang Wei Huang Andi Han Taiji Suzuki Arya Mazumdar MLT 33 0 0 12 Jun 2025
The Universality Lens: Why Even Highly Over-Parametrized Models Learn Well M. Feder Ruediger Urbanke Yaniv Fogel 27 0 0 09 Jun 2025
Overfitting has a limitation: a model-independent generalization error bound based on Rényi entropy Atsushi Suzuki 34 0 0 30 May 2025
Scalable Complexity Control Facilitates Reasoning Ability of LLMs Liangkai Hang Junjie Yao Zhiwei Bai Tianyi Chen Yang Chen ... Feiyu Xiong Y. Zhang Weinan E Hongkang Yang Zhi-hai Xu LRM 79 0 0 29 May 2025
Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes I. Harel Yonathan Wolanowsky Gal Vardi Nathan Srebro Daniel Soudry AI4CE 105 0 0 25 May 2025
Generalization Through Growth: Hidden Dynamics Controls Depth Dependence Sho Sonoda Yuka Hashimoto Isao Ishikawa Masahiro Ikeda 53 0 0 21 May 2025
On the Importance of Gaussianizing Representations Daniel Eftekhari Vardan Papyan 99 0 0 01 May 2025
Generalization Guarantees for Multi-View Representation Learning and Application to Regularization via Gaussian Product Mixture Prior Romain Chor Abdellatif Zaidi Piotr Krasnowski 114 1 0 25 Apr 2025
Compute-Optimal LLMs Provably Generalize Better With Scale Marc Finzi Sanyam Kapoor Diego Granziol Anming Gu Christopher De Sa J. Zico Kolter Andrew Gordon Wilson 150 0 0 21 Apr 2025
Identifying Key Challenges of Hardness-Based Resampling Pawel Pukowski Venet Osmani 85 0 0 09 Apr 2025
Non-vacuous Generalization Bounds for Deep Neural Networks without any modification to the trained models Khoat Than Dat Phan BDL AAML VLM 108 0 0 10 Mar 2025
Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Ability Lijia Yu Yibo Miao Yifan Zhu Xiao-Shan Gao Lijun Zhang 98 0 0 06 Mar 2025
Deep Learning is Not So Mysterious or Different Andrew Gordon Wilson 101 6 0 03 Mar 2025
Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking) Yoonsoo Nam Seok Hyeong Lee Clementine Domine Yea Chan Park Charles London Wonyl Choi Niclas Goring Seungjai Lee AI4CE 229 1 0 28 Feb 2025
`Generalization is hallucination' through the lens of tensor completions Liang Ze Wong VLM 110 0 0 24 Feb 2025
Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors Romain Chor Milad Sefidgaran Piotr Krasnowski 307 2 0 21 Feb 2025
Repetition Neurons: How Do Language Models Produce Repetitions? Tatsuya Hiraoka Kentaro Inui MILM 152 9 0 21 Feb 2025
Early Stopping Against Label Noise Without Validation Data Suqin Yuan Lei Feng Tongliang Liu NoLa 296 20 0 11 Feb 2025
Kolmogorov-Arnold Fourier Networks Jusheng Zhang Yijia Fan Kaitong Cai Keze Wang 101 0 0 09 Feb 2025
Implicit Bias in Matrix Factorization and its Explicit Realization in a New Architecture Yikun Hou Suvrit Sra A. Yurtsever 89 0 0 28 Jan 2025
HG-Adapter: Improving Pre-Trained Heterogeneous Graph Neural Networks with Dual Adapters Yujie Mo Runpeng Yu Xiaofeng Zhu Xinchao Wang 98 3 0 02 Nov 2024
Dimensionality-induced information loss of outliers in deep neural networks Kazuki Uematsu Kosuke Haruki Taiji Suzuki Mitsuhiro Kimura Takahiro Takimoto Hideyuki Nakagawa 63 0 0 29 Oct 2024
Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimes Julius Martinetz C. Linse Thomas Martinetz 99 0 0 22 Oct 2024
The Fair Language Model Paradox Andrea Pinto Tomer Galanti Randall Balestriero 92 1 0 15 Oct 2024
Towards Better Generalization: Weight Decay Induces Low-rank Bias for Neural Networks Ke Chen Chugang Yi Haizhao Yang MLT 69 0 0 03 Oct 2024
Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion Adi Haviv Shahar Sarfaty Uri Y. Hacohen N. Elkin-Koren Roi Livni Amit H. Bermano 89 2 0 15 Aug 2024
On the Generalization of Preference Learning with DPO Shawn Im Yixuan Li 90 2 0 06 Aug 2024
Tightening the Evaluation of PAC Bounds Using Formal Verification Results Thomas Walker A. Lomuscio 70 0 0 29 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning Bac Nguyen Stefan Uhlich Fabien Cardinaux Lukas Mauch Marzieh Edraki Aaron Courville OODD CLL VLM 139 5 0 03 Jul 2024
What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions? Weijie Tu Weijian Deng Liang Zheng Tom Gedeon 98 1 0 14 Jun 2024
Minimal Communication-Cost Statistical Learning Romain Chor Abdellatif Zaidi Piotr Krasnowski 67 0 0 12 Jun 2024
Slicing Mutual Information Generalization Bounds for Neural Networks Kimia Nadjahi Kristjan Greenewald Rickard Brüel-Gabrielsson Justin Solomon 104 4 0 06 Jun 2024
Generalization Bound and New Algorithm for Clean-Label Backdoor Attack Lijia Yu Shuang Liu Yibo Miao Xiao-Shan Gao Lijun Zhang AAML 106 7 0 02 Jun 2024
How many samples are needed to train a deep neural network? Pegah Golestaneh Mahsa Taheri Johannes Lederer 80 4 0 26 May 2024
Unmasking Efficiency: Learning Salient Sparse Models in Non-IID Federated Learning Riyasat Ohib Bishal Thapaliya Gintare Karolina Dziugaite Jingyu Liu Vince D. Calhoun Sergey Plis FedML 90 1 0 15 May 2024
Information-Theoretic Generalization Bounds for Deep Neural Networks Haiyun He Christina Lee Yu 129 6 0 04 Apr 2024
On the Generalization Ability of Unsupervised Pretraining Yuyang Deng Junyuan Hong Jiayu Zhou M. Mahdavi SSL 96 5 0 11 Mar 2024
On the Diminishing Returns of Width for Continual Learning E. Guha V. Lakshman CLL 70 6 0 11 Mar 2024
Generalization of Graph Neural Networks through the Lens of Homomorphism Shouheng Li Dongwoo Kim Qing Wang 90 1 0 10 Mar 2024
A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning Shuyu Yin Qixuan Zhou Fei Wen Tao Luo 87 0 0 24 Feb 2024
Efficient Stagewise Pretraining via Progressive Subnetworks Abhishek Panigrahi Nikunj Saunshi Kaifeng Lyu Sobhan Miryoosefi Sashank J. Reddi Satyen Kale Sanjiv Kumar 69 6 0 08 Feb 2024
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods Akira Ito Masanori Yamada Atsutoshi Kumagai MoMe 189 6 0 06 Feb 2024
Minimum Description Length and Generalization Guarantees for Representation Learning Romain Chor Abdellatif Zaidi Piotr Krasnowski 138 10 0 05 Feb 2024
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression Dong Chen Ning Liu Yichen Zhu Zhengping Che Rui Ma Fachao Zhang Xiaofeng Mou Yi Chang Jian Tang 67 4 0 31 Jan 2024
Improving conversion rate prediction via self-supervised pre-training in online advertising Alex Shtoff Yohay Kaplan Ariel Raviv 41 0 0 25 Jan 2024
Non-Vacuous Generalization Bounds for Large Language Models Sanae Lotfi Marc Finzi Yilun Kuang Tim G. J. Rudner Micah Goldblum Andrew Gordon Wilson 117 25 0 28 Dec 2023
Beyond One Model Fits All: Ensemble Deep Learning for Autonomous Vehicles Hemanth Manjunatha Panagiotis Tsiotras 59 0 0 10 Dec 2023
PAC-Bayes Generalization Certificates for Learned Inductive Conformal Prediction Apoorva Sharma Sushant Veer Asher Hancock Heng Yang Marco Pavone Anirudha Majumdar 210 9 0 07 Dec 2023
Scalable Federated Learning for Clients with Different Input Image Sizes and Numbers of Output Categories Shuhei Nitta Taiji Suzuki Albert Rodríguez Mulet A. Yaguchi Ryusuke Hirai FedML 75 0 0 15 Nov 2023
Information-Theoretic Generalization Bounds for Transductive Learning and its Applications Huayi Tang Yong Liu 165 1 0 08 Nov 2023