Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.04485
Cited By
Benefits of depth in neural networks
14 February 2016
Matus Telgarsky
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Benefits of depth in neural networks"
50 / 353 papers shown
Title
Block-Biased Mamba for Long-Range Sequence Processing
Annan Yu
N. Benjamin Erichson
Mamba
37
0
0
13 May 2025
On the Depth of Monotone ReLU Neural Networks and ICNNs
Egor Bakaev
Florestan Brunck
Christoph Hertrich
Daniel Reichman
Amir Yehudayoff
26
0
0
09 May 2025
Nonlocal techniques for the analysis of deep ReLU neural network approximations
Cornelia Schneider
Mario Ullrich
Jan Vybiral
18
0
0
07 Apr 2025
On Space Folds of ReLU Neural Networks
Michal Lewandowski
Hamid Eghbalzadeh
Bernhard Heinzl
Raphael Pisoni
Bernhard A.Moser
MLT
73
1
0
17 Feb 2025
On the Expressiveness of Rational ReLU Neural Networks With Bounded Depth
Gennadiy Averkov
Christopher Hojny
Maximilian Merkert
81
3
0
10 Feb 2025
Free-Knots Kolmogorov-Arnold Network: On the Analysis of Spline Knots and Advancing Stability
L. Zheng
W. Zhang
Lin Yue
Miao Xu
Olaf Maennel
Weitong Chen
54
1
0
17 Jan 2025
Theoretical limitations of multi-layer Transformer
Lijie Chen
Binghui Peng
Hongxun Wu
AI4CE
72
6
0
04 Dec 2024
Understanding the Effect of GCN Convolutions in Regression Tasks
Juntong Chen
Johannes Schmidt-Hieber
Claire Donnat
Olga Klopp
GNN
29
0
0
26 Oct 2024
Towards characterizing the value of edge embeddings in Graph Neural Networks
Dhruv Rohatgi
Tanya Marwah
Zachary Chase Lipton
Jianfeng Lu
Ankur Moitra
Andrej Risteski
AI4CE
16
0
0
13 Oct 2024
On the Expressive Power of Tree-Structured Probabilistic Circuits
Lang Yin
Han Zhao
TPM
24
2
0
07 Oct 2024
Identification of Mean-Field Dynamics using Transformers
Shiba Biswal
Karthik Elamvazhuthi
Rishi Sonthalia
AI4CE
27
1
0
06 Oct 2024
Deep Neural Networks: Multi-Classification and Universal Approximation
Martín Hernández
Enrique Zuazua
26
2
0
10 Sep 2024
Activation function optimization method: Learnable series linear units (LSLUs)
Chuan Feng
Xi Lin
Shiping Zhu
Hongkang Shi
Maojie Tang
Hua Huang
24
0
0
28 Aug 2024
Variance reduction of diffusion model's gradients with Taylor approximation-based control variate
Paul Jeha
Will Grathwohl
Michael Riis Andersen
Carl Henrik Ek
J. Frellsen
DiffM
29
1
0
22 Aug 2024
Graph Classification with GNNs: Optimisation, Representation and Inductive Bias
P. Krishna Kumar a
H. G. Ramaswamy
24
0
0
17 Aug 2024
The Role of Temporal Hierarchy in Spiking Neural Networks
Filippo Moro
Pau Vilimelis Aceituno
Laura Kriener
Melika Payvand
AI4CE
32
3
0
26 Jul 2024
When Can Transformers Count to n?
Gilad Yehudai
Haim Kaplan
Asma Ghandeharioun
Mor Geva
Amir Globerson
39
10
0
21 Jul 2024
The Role of Depth, Width, and Tree Size in Expressiveness of Deep Forest
Shen-Huan Lyu
Jin-Hui Wu
Qin-Cheng Zheng
Baoliu Ye
31
0
0
06 Jul 2024
Analytical Solution of a Three-layer Network with a Matrix Exponential Activation Function
Kuo Gai
Shihua Zhang
FAtt
38
0
0
02 Jul 2024
Neural Networks Trained by Weight Permutation are Universal Approximators
Yongqiang Cai
Gaohang Chen
Zhonghua Qiao
69
1
0
01 Jul 2024
1-Lipschitz Neural Distance Fields
Guillaume Coiffier
Louis Bethune
41
3
0
14 Jun 2024
Highway Value Iteration Networks
Yuhui Wang
Weida Li
Francesco Faccio
Qingyuan Wu
Jürgen Schmidhuber
32
2
0
05 Jun 2024
Understanding Encoder-Decoder Structures in Machine Learning Using Information Measures
Jorge F. Silva
Victor Faraggi
Camilo Ramírez
Álvaro F. Egaña
Eduardo Pavez
17
1
0
30 May 2024
Tropical Expressivity of Neural Networks
Shiv Bhatia
Yueqi Cao
Paul Lezeau
Anthea Monod
21
0
0
30 May 2024
Unified Universality Theorem for Deep and Shallow Joint-Group-Equivariant Machines
Sho Sonoda
Yuka Hashimoto
Isao Ishikawa
Masahiro Ikeda
34
0
0
22 May 2024
Hyperplane Arrangements and Fixed Points in Iterated PWL Neural Networks
H. Beise
MLT
19
0
0
16 May 2024
Spectral complexity of deep neural networks
Simmaco Di Lillo
Domenico Marinucci
Michele Salvi
S. Vigogna
BDL
74
1
0
15 May 2024
Half-Space Feature Learning in Neural Networks
Mahesh Lorik Yadav
H. G. Ramaswamy
Chandrashekar Lakshminarayanan
MLT
27
0
0
05 Apr 2024
The Real Tropical Geometry of Neural Networks
Marie-Charlotte Brandenburg
Georg Loho
Guido Montúfar
54
7
0
18 Mar 2024
Linearly Constrained Weights: Reducing Activation Shift for Faster Training of Neural Networks
Takuro Kutsuna
LLMSV
19
1
0
08 Mar 2024
On Minimal Depth in Neural Networks
J. L. Valerdi
38
3
0
23 Feb 2024
Depth Separation in Norm-Bounded Infinite-Width Neural Networks
Suzanna Parkinson
Greg Ongie
Rebecca Willett
Ohad Shamir
Nathan Srebro
MDE
43
2
0
13 Feb 2024
Depth Separations in Neural Networks: Separating the Dimension from the Accuracy
Itay Safran
Daniel Reichman
Paul Valiant
53
0
0
11 Feb 2024
Locality Sensitive Sparse Encoding for Learning World Models Online
Zi-Yan Liu
Chao Du
Wee Sun Lee
Min-Bin Lin
KELM
CLL
OffRL
31
8
0
23 Jan 2024
Nonlinear functional regression by functional deep neural network with kernel embedding
Zhongjie Shi
Jun Fan
Linhao Song
Ding-Xuan Zhou
Johan A. K. Suykens
50
5
0
05 Jan 2024
Deep Radon Prior: A Fully Unsupervised Framework for Sparse-View CT Reconstruction
Shuo Xu
Yucheng Zhang
Gang Chen
Xincheng Xiang
Peng Cong
Yuewen Sun
17
1
0
30 Dec 2023
Optimal Deep Neural Network Approximation for Korobov Functions with respect to Sobolev Norms
Yahong Yang
Yulong Lu
31
3
0
08 Nov 2023
The Expressive Power of Low-Rank Adaptation
Yuchen Zeng
Kangwook Lee
28
49
0
26 Oct 2023
Topological Expressivity of ReLU Neural Networks
Ekin Ergen
Moritz Grillo
54
2
0
17 Oct 2023
Deep Ridgelet Transform: Voice with Koopman Operator Proves Universality of Formal Deep Networks
Sho Sonoda
Yuka Hashimoto
Isao Ishikawa
Masahiro Ikeda
19
3
0
05 Oct 2023
Why should autoencoders work?
Matthew D. Kvalheim
E.D. Sontag
21
0
0
03 Oct 2023
Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language Models
Zijun Wu
Yongkang Wu
Lili Mou
VLM
25
2
0
02 Oct 2023
A Primer on Bayesian Neural Networks: Review and Debates
Federico Danieli
Konstantinos Pitas
M. Vladimirova
Vincent Fortuin
BDL
AAML
56
18
0
28 Sep 2023
Minimum width for universal approximation using ReLU networks on compact domain
Namjun Kim
Chanho Min
Sejun Park
VLM
27
10
0
19 Sep 2023
DiT: Efficient Vision Transformers with Dynamic Token Routing
Yuchen Ma
Zhengcong Fei
Junshi Huang
ViT
24
2
0
07 Aug 2023
A Distance Correlation-Based Approach to Characterize the Effectiveness of Recurrent Neural Networks for Time Series Forecasting
Christopher Salazar
A. Banerjee
AI4TS
18
2
0
28 Jul 2023
How Many Neurons Does it Take to Approximate the Maximum?
Itay Safran
Daniel Reichman
Paul Valiant
31
8
0
18 Jul 2023
Machine learning for option pricing: an empirical investigation of network architectures
Laurens Van Mieghem
A. Papapantoleon
Jonas Papazoglou-Hennig
11
2
0
14 Jul 2023
Neural Hilbert Ladders: Multi-Layer Neural Networks in Function Space
Zhengdao Chen
28
1
0
03 Jul 2023
A Constructive Approach to Function Realization by Neural Stochastic Differential Equations
Tanya Veeravalli
Maxim Raginsky
11
0
0
01 Jul 2023
1
2
3
4
5
6
7
8
Next