ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1503.00036
  4. Cited By
Norm-Based Capacity Control in Neural Networks
v1v2 (latest)

Norm-Based Capacity Control in Neural Networks

27 February 2015
Behnam Neyshabur
Ryota Tomioka
Nathan Srebro
ArXiv (abs)PDFHTML

Papers citing "Norm-Based Capacity Control in Neural Networks"

50 / 407 papers shown
Title
Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel
Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel
Yilan Chen
Zhichao Wang
Wei Huang
Andi Han
Taiji Suzuki
Arya Mazumdar
MLT
20
0
0
12 Jun 2025
FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling
FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling
Sifan Wang
Zehao Dou
Tong-Rui Liu
Lu Lu
DiffM
33
0
0
09 Jun 2025
Average Calibration Losses for Reliable Uncertainty in Medical Image Segmentation
Average Calibration Losses for Reliable Uncertainty in Medical Image Segmentation
Theodore Barfoot
Luis C. Garcia-Peraza-Herrera
Samet Akcay
Ben Glocker
Tom Vercauteren
UQCV
132
0
0
04 Jun 2025
NetPress: Dynamically Generated LLM Benchmarks for Network Applications
NetPress: Dynamically Generated LLM Benchmarks for Network Applications
Yajie Zhou
Jiajun Ruan
Eric S. Wang
Sadjad Fouladi
Francis Y. Yan
Kevin Hsieh
Zaoxing Liu
32
0
0
03 Jun 2025
LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning
LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning
Zihang Liu
Tianyu Pang
Oleg Balabanov
Chaoqun Yang
Tianjin Huang
L. Yin
Yaoqing Yang
Shiwei Liu
LRM
53
1
0
01 Jun 2025
Scalable Complexity Control Facilitates Reasoning Ability of LLMs
Scalable Complexity Control Facilitates Reasoning Ability of LLMs
Liangkai Hang
Junjie Yao
Zhiwei Bai
Tianyi Chen
Yang Chen
...
Feiyu Xiong
Y. Zhang
Weinan E
Hongkang Yang
Zhi-hai Xu
LRM
52
0
0
29 May 2025
Global Minimizers of $\ell^p$-Regularized Objectives Yield the Sparsest ReLU Neural Networks
Global Minimizers of ℓp\ell^pℓp-Regularized Objectives Yield the Sparsest ReLU Neural Networks
Julia B. Nakhleh
Robert D. Nowak
34
0
0
27 May 2025
Regularized Personalization of Text-to-Image Diffusion Models without Distributional Drift
Regularized Personalization of Text-to-Image Diffusion Models without Distributional Drift
Gihoon Kim
Hyungjin Park
Taesup Kim
DiffMVLM
190
0
0
26 May 2025
Understanding Pre-training and Fine-tuning from Loss Landscape Perspectives
Huanran Chen
Yinpeng Dong
Zeming Wei
Yao Huang
Yichi Zhang
Hang Su
Jun Zhu
MoMe
90
1
0
23 May 2025
Architecture independent generalization bounds for overparametrized deep ReLU networks
Architecture independent generalization bounds for overparametrized deep ReLU networks
Thomas Chen
Chun-Kai Kevin Chien
Patrícia Muñoz Ewald
Andrew G. Moore
158
0
0
08 Apr 2025
ZeroLM: Data-Free Transformer Architecture Search for Language Models
ZeroLM: Data-Free Transformer Architecture Search for Language Models
Zhen-Song Chen
Hong-Wei Ding
Xian-Jia Wang
Witold Pedrycz
96
0
0
24 Mar 2025
EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models
EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models
Yinan Liang
Zehua Wang
Xiuwei Xu
Jie Zhou
Jiwen Lu
VLMLRM
75
0
0
19 Mar 2025
A Near Complete Nonasymptotic Generalization Theory For Multilayer Neural Networks: Beyond the Bias-Variance Tradeoff
Hao Yu
Xiangyang Ji
AI4CE
75
0
0
03 Mar 2025
Regularization can make diffusion models more efficient
Regularization can make diffusion models more efficient
Mahsa Taheri
Johannes Lederer
171
0
0
13 Feb 2025
Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning
Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning
Rémy Hosseinkhan Boucher
Onofrio Semeraro
L. Mathelin
125
0
0
28 Jan 2025
GradAlign for Training-free Model Performance Inference
GradAlign for Training-free Model Performance Inference
Yuxuan Li
Yunhui Guo
100
0
0
29 Nov 2024
An In-depth Investigation of Sparse Rate Reduction in Transformer-like
  Models
An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models
Yunzhe Hu
Difan Zou
Dong Xu
133
1
0
26 Nov 2024
On Generalization Bounds for Neural Networks with Low Rank Layers
On Generalization Bounds for Neural Networks with Low Rank Layers
Andrea Pinto
Akshay Rangamani
T. Poggio
AI4CE
123
1
0
20 Nov 2024
Layer-Adaptive State Pruning for Deep State Space Models
Layer-Adaptive State Pruning for Deep State Space Models
Minseon Gwak
Seongrok Moon
Joohwan Ko
PooGyeon Park
126
1
0
05 Nov 2024
Rethinking generalization of classifiers in separable classes scenarios
  and over-parameterized regimes
Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimes
Julius Martinetz
C. Linse
Thomas Martinetz
82
0
0
22 Oct 2024
The Fair Language Model Paradox
The Fair Language Model Paradox
Andrea Pinto
Tomer Galanti
Randall Balestriero
87
1
0
15 Oct 2024
On Rank-Dependent Generalisation Error Bounds for Transformers
On Rank-Dependent Generalisation Error Bounds for Transformers
Lan V. Truong
86
2
0
15 Oct 2024
Understanding Adversarially Robust Generalization via Weight-Curvature
  Index
Understanding Adversarially Robust Generalization via Weight-Curvature Index
Yuelin Xu
Xiao Zhang
AAML
61
0
0
10 Oct 2024
Deep Koopman-layered Model with Universal Property Based on Toeplitz Matrices
Deep Koopman-layered Model with Universal Property Based on Toeplitz Matrices
Yuka Hashimoto
Tomoharu Iwata
78
0
0
03 Oct 2024
A General Framework of the Consistency for Large Neural Networks
A General Framework of the Consistency for Large Neural Networks
Haoran Zhan
Yingcun Xia
63
0
0
21 Sep 2024
Generalization bounds for regression and classification on adaptive
  covering input domains
Generalization bounds for regression and classification on adaptive covering input domains
Wen-Liang Hwang
67
0
0
29 Jul 2024
Invertible Neural Warp for NeRF
Invertible Neural Warp for NeRF
Shin-Fang Chng
Ravi Garg
Hemanth Saratchandran
Simon Lucey
86
4
0
17 Jul 2024
How DNNs break the Curse of Dimensionality: Compositionality and Symmetry Learning
How DNNs break the Curse of Dimensionality: Compositionality and Symmetry Learning
Arthur Jacot
Seok Hoan Choi
Yuxiao Wen
AI4CE
143
2
0
08 Jul 2024
Sparse deep neural networks for nonparametric estimation in
  high-dimensional sparse regression
Sparse deep neural networks for nonparametric estimation in high-dimensional sparse regression
Dongya Wu
Xin Li
65
0
0
26 Jun 2024
What Does Softmax Probability Tell Us about Classifiers Ranking Across
  Diverse Test Conditions?
What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?
Weijie Tu
Weijian Deng
Liang Zheng
Tom Gedeon
89
1
0
14 Jun 2024
The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof
The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof
Derek Lim
Moe Putterman
Robin Walters
Haggai Maron
Stefanie Jegelka
122
9
0
30 May 2024
Spectral Truncation Kernels: Noncommutativity in $C^*$-algebraic Kernel Machines
Spectral Truncation Kernels: Noncommutativity in C∗C^*C∗-algebraic Kernel Machines
Yuka Hashimoto
Ayoub Hafid
Masahiro Ikeda
Hachem Kadri
79
2
0
28 May 2024
How many samples are needed to train a deep neural network?
How many samples are needed to train a deep neural network?
Pegah Golestaneh
Mahsa Taheri
Johannes Lederer
76
4
0
26 May 2024
Error Analysis of Three-Layer Neural Network Trained with PGD for Deep
  Ritz Method
Error Analysis of Three-Layer Neural Network Trained with PGD for Deep Ritz Method
Yuling Jiao
Yanming Lai
Yang Wang
AI4CE
35
1
0
19 May 2024
Two-Phase Dynamics of Interactions Explains the Starting Point of a DNN
  Learning Over-Fitted Features
Two-Phase Dynamics of Interactions Explains the Starting Point of a DNN Learning Over-Fitted Features
Junpeng Zhang
Qing Li
Liang Lin
Quanshi Zhang
AI4CE
131
5
0
16 May 2024
Hidden Synergy: $L_1$ Weight Normalization and 1-Path-Norm
  Regularization
Hidden Synergy: L1L_1L1​ Weight Normalization and 1-Path-Norm Regularization
Aditya Biswas
81
1
0
29 Apr 2024
Learning with Norm Constrained, Over-parameterized, Two-layer Neural
  Networks
Learning with Norm Constrained, Over-parameterized, Two-layer Neural Networks
Fanghui Liu
L. Dadi
Volkan Cevher
133
2
0
29 Apr 2024
Error analysis for finite element operator learning methods for solving
  parametric second-order elliptic PDEs
Error analysis for finite element operator learning methods for solving parametric second-order elliptic PDEs
Youngjoon Hong
Seungchan Ko
Jae Yong Lee
72
1
0
27 Apr 2024
Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets
Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets
Benjamin Dupuis
Paul Viallard
George Deligiannidis
Umut Simsekli
129
5
0
26 Apr 2024
Data-independent Module-aware Pruning for Hierarchical Vision
  Transformers
Data-independent Module-aware Pruning for Hierarchical Vision Transformers
Yang He
Qiufeng Wang
ViT
83
5
0
21 Apr 2024
Probabilistic Lipschitzness and the Stable Rank for Comparing
  Explanation Models
Probabilistic Lipschitzness and the Stable Rank for Comparing Explanation Models
Lachlan Simpson
Kyle Millar
A. Cheng
Cheng-Chew Lim
Hong-Gunn Chew
BDLFAtt
94
2
0
29 Feb 2024
A unified Fourier slice method to derive ridgelet transform for a
  variety of depth-2 neural networks
A unified Fourier slice method to derive ridgelet transform for a variety of depth-2 neural networks
Sho Sonoda
Isao Ishikawa
Masahiro Ikeda
116
4
0
25 Feb 2024
A priori Estimates for Deep Residual Network in Continuous-time
  Reinforcement Learning
A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning
Shuyu Yin
Qixuan Zhou
Fei Wen
Tao Luo
74
0
0
24 Feb 2024
Depth Separation in Norm-Bounded Infinite-Width Neural Networks
Depth Separation in Norm-Bounded Infinite-Width Neural Networks
Suzanna Parkinson
Greg Ongie
Rebecca Willett
Ohad Shamir
Nathan Srebro
MDE
82
3
0
13 Feb 2024
How Uniform Random Weights Induce Non-uniform Bias: Typical
  Interpolating Neural Networks Generalize with Narrow Teachers
How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers
G. Buzaglo
I. Harel
Mor Shpigel Nacson
Alon Brutzkus
Nathan Srebro
Daniel Soudry
124
7
0
09 Feb 2024
PAC-Bayesian Adversarially Robust Generalization Bounds for Graph Neural
  Network
PAC-Bayesian Adversarially Robust Generalization Bounds for Graph Neural Network
Tan Sun
Junhong Lin
AAML
84
3
0
06 Feb 2024
Always-Sparse Training by Growing Connections with Guided Stochastic Exploration
Always-Sparse Training by Growing Connections with Guided Stochastic Exploration
Mike Heddes
Narayan Srinivasa
T. Givargis
Alexandru Nicolau
267
0
0
12 Jan 2024
Why "classic" Transformers are shallow and how to make them go deep
Why "classic" Transformers are shallow and how to make them go deep
Yueyao Yu
Yin Zhang
ViT
97
0
0
11 Dec 2023
Pathway to a fully data-driven geotechnics: lessons from materials
  informatics
Pathway to a fully data-driven geotechnics: lessons from materials informatics
Stephen Wu
Yu Otake
Yosuke Higo
Ikumasa Yoshida
AI4CE
59
5
0
01 Dec 2023
How do Minimum-Norm Shallow Denoisers Look in Function Space?
How do Minimum-Norm Shallow Denoisers Look in Function Space?
Chen Zeno
Greg Ongie
Yaniv Blumenfeld
Nir Weinberger
Daniel Soudry
78
8
0
12 Nov 2023
123456789
Next