Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1503.00036
Cited By
v1
v2 (latest)
Norm-Based Capacity Control in Neural Networks
27 February 2015
Behnam Neyshabur
Ryota Tomioka
Nathan Srebro
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Norm-Based Capacity Control in Neural Networks"
50 / 407 papers shown
Title
Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel
Yilan Chen
Zhichao Wang
Wei Huang
Andi Han
Taiji Suzuki
Arya Mazumdar
MLT
20
0
0
12 Jun 2025
FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling
Sifan Wang
Zehao Dou
Tong-Rui Liu
Lu Lu
DiffM
33
0
0
09 Jun 2025
Average Calibration Losses for Reliable Uncertainty in Medical Image Segmentation
Theodore Barfoot
Luis C. Garcia-Peraza-Herrera
Samet Akcay
Ben Glocker
Tom Vercauteren
UQCV
132
0
0
04 Jun 2025
NetPress: Dynamically Generated LLM Benchmarks for Network Applications
Yajie Zhou
Jiajun Ruan
Eric S. Wang
Sadjad Fouladi
Francis Y. Yan
Kevin Hsieh
Zaoxing Liu
32
0
0
03 Jun 2025
LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning
Zihang Liu
Tianyu Pang
Oleg Balabanov
Chaoqun Yang
Tianjin Huang
L. Yin
Yaoqing Yang
Shiwei Liu
LRM
53
1
0
01 Jun 2025
Scalable Complexity Control Facilitates Reasoning Ability of LLMs
Liangkai Hang
Junjie Yao
Zhiwei Bai
Tianyi Chen
Yang Chen
...
Feiyu Xiong
Y. Zhang
Weinan E
Hongkang Yang
Zhi-hai Xu
LRM
52
0
0
29 May 2025
Global Minimizers of
ℓ
p
\ell^p
ℓ
p
-Regularized Objectives Yield the Sparsest ReLU Neural Networks
Julia B. Nakhleh
Robert D. Nowak
34
0
0
27 May 2025
Regularized Personalization of Text-to-Image Diffusion Models without Distributional Drift
Gihoon Kim
Hyungjin Park
Taesup Kim
DiffM
VLM
190
0
0
26 May 2025
Understanding Pre-training and Fine-tuning from Loss Landscape Perspectives
Huanran Chen
Yinpeng Dong
Zeming Wei
Yao Huang
Yichi Zhang
Hang Su
Jun Zhu
MoMe
90
1
0
23 May 2025
Architecture independent generalization bounds for overparametrized deep ReLU networks
Thomas Chen
Chun-Kai Kevin Chien
Patrícia Muñoz Ewald
Andrew G. Moore
158
0
0
08 Apr 2025
ZeroLM: Data-Free Transformer Architecture Search for Language Models
Zhen-Song Chen
Hong-Wei Ding
Xian-Jia Wang
Witold Pedrycz
96
0
0
24 Mar 2025
EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models
Yinan Liang
Zehua Wang
Xiuwei Xu
Jie Zhou
Jiwen Lu
VLM
LRM
75
0
0
19 Mar 2025
A Near Complete Nonasymptotic Generalization Theory For Multilayer Neural Networks: Beyond the Bias-Variance Tradeoff
Hao Yu
Xiangyang Ji
AI4CE
75
0
0
03 Mar 2025
Regularization can make diffusion models more efficient
Mahsa Taheri
Johannes Lederer
171
0
0
13 Feb 2025
Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning
Rémy Hosseinkhan Boucher
Onofrio Semeraro
L. Mathelin
125
0
0
28 Jan 2025
GradAlign for Training-free Model Performance Inference
Yuxuan Li
Yunhui Guo
100
0
0
29 Nov 2024
An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models
Yunzhe Hu
Difan Zou
Dong Xu
133
1
0
26 Nov 2024
On Generalization Bounds for Neural Networks with Low Rank Layers
Andrea Pinto
Akshay Rangamani
T. Poggio
AI4CE
123
1
0
20 Nov 2024
Layer-Adaptive State Pruning for Deep State Space Models
Minseon Gwak
Seongrok Moon
Joohwan Ko
PooGyeon Park
126
1
0
05 Nov 2024
Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimes
Julius Martinetz
C. Linse
Thomas Martinetz
82
0
0
22 Oct 2024
The Fair Language Model Paradox
Andrea Pinto
Tomer Galanti
Randall Balestriero
87
1
0
15 Oct 2024
On Rank-Dependent Generalisation Error Bounds for Transformers
Lan V. Truong
86
2
0
15 Oct 2024
Understanding Adversarially Robust Generalization via Weight-Curvature Index
Yuelin Xu
Xiao Zhang
AAML
61
0
0
10 Oct 2024
Deep Koopman-layered Model with Universal Property Based on Toeplitz Matrices
Yuka Hashimoto
Tomoharu Iwata
78
0
0
03 Oct 2024
A General Framework of the Consistency for Large Neural Networks
Haoran Zhan
Yingcun Xia
63
0
0
21 Sep 2024
Generalization bounds for regression and classification on adaptive covering input domains
Wen-Liang Hwang
67
0
0
29 Jul 2024
Invertible Neural Warp for NeRF
Shin-Fang Chng
Ravi Garg
Hemanth Saratchandran
Simon Lucey
86
4
0
17 Jul 2024
How DNNs break the Curse of Dimensionality: Compositionality and Symmetry Learning
Arthur Jacot
Seok Hoan Choi
Yuxiao Wen
AI4CE
143
2
0
08 Jul 2024
Sparse deep neural networks for nonparametric estimation in high-dimensional sparse regression
Dongya Wu
Xin Li
65
0
0
26 Jun 2024
What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?
Weijie Tu
Weijian Deng
Liang Zheng
Tom Gedeon
89
1
0
14 Jun 2024
The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof
Derek Lim
Moe Putterman
Robin Walters
Haggai Maron
Stefanie Jegelka
122
9
0
30 May 2024
Spectral Truncation Kernels: Noncommutativity in
C
∗
C^*
C
∗
-algebraic Kernel Machines
Yuka Hashimoto
Ayoub Hafid
Masahiro Ikeda
Hachem Kadri
79
2
0
28 May 2024
How many samples are needed to train a deep neural network?
Pegah Golestaneh
Mahsa Taheri
Johannes Lederer
76
4
0
26 May 2024
Error Analysis of Three-Layer Neural Network Trained with PGD for Deep Ritz Method
Yuling Jiao
Yanming Lai
Yang Wang
AI4CE
35
1
0
19 May 2024
Two-Phase Dynamics of Interactions Explains the Starting Point of a DNN Learning Over-Fitted Features
Junpeng Zhang
Qing Li
Liang Lin
Quanshi Zhang
AI4CE
131
5
0
16 May 2024
Hidden Synergy:
L
1
L_1
L
1
Weight Normalization and 1-Path-Norm Regularization
Aditya Biswas
81
1
0
29 Apr 2024
Learning with Norm Constrained, Over-parameterized, Two-layer Neural Networks
Fanghui Liu
L. Dadi
Volkan Cevher
133
2
0
29 Apr 2024
Error analysis for finite element operator learning methods for solving parametric second-order elliptic PDEs
Youngjoon Hong
Seungchan Ko
Jae Yong Lee
72
1
0
27 Apr 2024
Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets
Benjamin Dupuis
Paul Viallard
George Deligiannidis
Umut Simsekli
129
5
0
26 Apr 2024
Data-independent Module-aware Pruning for Hierarchical Vision Transformers
Yang He
Qiufeng Wang
ViT
83
5
0
21 Apr 2024
Probabilistic Lipschitzness and the Stable Rank for Comparing Explanation Models
Lachlan Simpson
Kyle Millar
A. Cheng
Cheng-Chew Lim
Hong-Gunn Chew
BDL
FAtt
94
2
0
29 Feb 2024
A unified Fourier slice method to derive ridgelet transform for a variety of depth-2 neural networks
Sho Sonoda
Isao Ishikawa
Masahiro Ikeda
116
4
0
25 Feb 2024
A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning
Shuyu Yin
Qixuan Zhou
Fei Wen
Tao Luo
74
0
0
24 Feb 2024
Depth Separation in Norm-Bounded Infinite-Width Neural Networks
Suzanna Parkinson
Greg Ongie
Rebecca Willett
Ohad Shamir
Nathan Srebro
MDE
82
3
0
13 Feb 2024
How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers
G. Buzaglo
I. Harel
Mor Shpigel Nacson
Alon Brutzkus
Nathan Srebro
Daniel Soudry
124
7
0
09 Feb 2024
PAC-Bayesian Adversarially Robust Generalization Bounds for Graph Neural Network
Tan Sun
Junhong Lin
AAML
84
3
0
06 Feb 2024
Always-Sparse Training by Growing Connections with Guided Stochastic Exploration
Mike Heddes
Narayan Srinivasa
T. Givargis
Alexandru Nicolau
267
0
0
12 Jan 2024
Why "classic" Transformers are shallow and how to make them go deep
Yueyao Yu
Yin Zhang
ViT
97
0
0
11 Dec 2023
Pathway to a fully data-driven geotechnics: lessons from materials informatics
Stephen Wu
Yu Otake
Yosuke Higo
Ikumasa Yoshida
AI4CE
59
5
0
01 Dec 2023
How do Minimum-Norm Shallow Denoisers Look in Function Space?
Chen Zeno
Greg Ongie
Yaniv Blumenfeld
Nir Weinberger
Daniel Soudry
78
8
0
12 Nov 2023
1
2
3
4
5
6
7
8
9
Next