ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.04485
  4. Cited By
Benefits of depth in neural networks

Benefits of depth in neural networks

14 February 2016
Matus Telgarsky
ArXivPDFHTML

Papers citing "Benefits of depth in neural networks"

50 / 353 papers shown
Title
Block-Biased Mamba for Long-Range Sequence Processing
Block-Biased Mamba for Long-Range Sequence Processing
Annan Yu
N. Benjamin Erichson
Mamba
37
0
0
13 May 2025
On the Depth of Monotone ReLU Neural Networks and ICNNs
On the Depth of Monotone ReLU Neural Networks and ICNNs
Egor Bakaev
Florestan Brunck
Christoph Hertrich
Daniel Reichman
Amir Yehudayoff
26
0
0
09 May 2025
Nonlocal techniques for the analysis of deep ReLU neural network approximations
Nonlocal techniques for the analysis of deep ReLU neural network approximations
Cornelia Schneider
Mario Ullrich
Jan Vybiral
18
0
0
07 Apr 2025
On Space Folds of ReLU Neural Networks
On Space Folds of ReLU Neural Networks
Michal Lewandowski
Hamid Eghbalzadeh
Bernhard Heinzl
Raphael Pisoni
Bernhard A.Moser
MLT
73
1
0
17 Feb 2025
On the Expressiveness of Rational ReLU Neural Networks With Bounded Depth
Gennadiy Averkov
Christopher Hojny
Maximilian Merkert
81
3
0
10 Feb 2025
Free-Knots Kolmogorov-Arnold Network: On the Analysis of Spline Knots and Advancing Stability
Free-Knots Kolmogorov-Arnold Network: On the Analysis of Spline Knots and Advancing Stability
L. Zheng
W. Zhang
Lin Yue
Miao Xu
Olaf Maennel
Weitong Chen
54
1
0
17 Jan 2025
Theoretical limitations of multi-layer Transformer
Theoretical limitations of multi-layer Transformer
Lijie Chen
Binghui Peng
Hongxun Wu
AI4CE
72
6
0
04 Dec 2024
Understanding the Effect of GCN Convolutions in Regression Tasks
Understanding the Effect of GCN Convolutions in Regression Tasks
Juntong Chen
Johannes Schmidt-Hieber
Claire Donnat
Olga Klopp
GNN
29
0
0
26 Oct 2024
Towards characterizing the value of edge embeddings in Graph Neural
  Networks
Towards characterizing the value of edge embeddings in Graph Neural Networks
Dhruv Rohatgi
Tanya Marwah
Zachary Chase Lipton
Jianfeng Lu
Ankur Moitra
Andrej Risteski
AI4CE
16
0
0
13 Oct 2024
On the Expressive Power of Tree-Structured Probabilistic Circuits
On the Expressive Power of Tree-Structured Probabilistic Circuits
Lang Yin
Han Zhao
TPM
24
2
0
07 Oct 2024
Identification of Mean-Field Dynamics using Transformers
Identification of Mean-Field Dynamics using Transformers
Shiba Biswal
Karthik Elamvazhuthi
Rishi Sonthalia
AI4CE
27
1
0
06 Oct 2024
Deep Neural Networks: Multi-Classification and Universal Approximation
Deep Neural Networks: Multi-Classification and Universal Approximation
Martín Hernández
Enrique Zuazua
26
2
0
10 Sep 2024
Activation function optimization method: Learnable series linear units
  (LSLUs)
Activation function optimization method: Learnable series linear units (LSLUs)
Chuan Feng
Xi Lin
Shiping Zhu
Hongkang Shi
Maojie Tang
Hua Huang
24
0
0
28 Aug 2024
Variance reduction of diffusion model's gradients with Taylor
  approximation-based control variate
Variance reduction of diffusion model's gradients with Taylor approximation-based control variate
Paul Jeha
Will Grathwohl
Michael Riis Andersen
Carl Henrik Ek
J. Frellsen
DiffM
29
1
0
22 Aug 2024
Graph Classification with GNNs: Optimisation, Representation and
  Inductive Bias
Graph Classification with GNNs: Optimisation, Representation and Inductive Bias
P. Krishna Kumar a
H. G. Ramaswamy
24
0
0
17 Aug 2024
The Role of Temporal Hierarchy in Spiking Neural Networks
The Role of Temporal Hierarchy in Spiking Neural Networks
Filippo Moro
Pau Vilimelis Aceituno
Laura Kriener
Melika Payvand
AI4CE
32
3
0
26 Jul 2024
When Can Transformers Count to n?
When Can Transformers Count to n?
Gilad Yehudai
Haim Kaplan
Asma Ghandeharioun
Mor Geva
Amir Globerson
39
10
0
21 Jul 2024
The Role of Depth, Width, and Tree Size in Expressiveness of Deep Forest
The Role of Depth, Width, and Tree Size in Expressiveness of Deep Forest
Shen-Huan Lyu
Jin-Hui Wu
Qin-Cheng Zheng
Baoliu Ye
31
0
0
06 Jul 2024
Analytical Solution of a Three-layer Network with a Matrix Exponential
  Activation Function
Analytical Solution of a Three-layer Network with a Matrix Exponential Activation Function
Kuo Gai
Shihua Zhang
FAtt
38
0
0
02 Jul 2024
Neural Networks Trained by Weight Permutation are Universal Approximators
Neural Networks Trained by Weight Permutation are Universal Approximators
Yongqiang Cai
Gaohang Chen
Zhonghua Qiao
69
1
0
01 Jul 2024
1-Lipschitz Neural Distance Fields
1-Lipschitz Neural Distance Fields
Guillaume Coiffier
Louis Bethune
41
3
0
14 Jun 2024
Highway Value Iteration Networks
Highway Value Iteration Networks
Yuhui Wang
Weida Li
Francesco Faccio
Qingyuan Wu
Jürgen Schmidhuber
32
2
0
05 Jun 2024
Understanding Encoder-Decoder Structures in Machine Learning Using
  Information Measures
Understanding Encoder-Decoder Structures in Machine Learning Using Information Measures
Jorge F. Silva
Victor Faraggi
Camilo Ramírez
Álvaro F. Egaña
Eduardo Pavez
17
1
0
30 May 2024
Tropical Expressivity of Neural Networks
Tropical Expressivity of Neural Networks
Shiv Bhatia
Yueqi Cao
Paul Lezeau
Anthea Monod
21
0
0
30 May 2024
Unified Universality Theorem for Deep and Shallow Joint-Group-Equivariant Machines
Unified Universality Theorem for Deep and Shallow Joint-Group-Equivariant Machines
Sho Sonoda
Yuka Hashimoto
Isao Ishikawa
Masahiro Ikeda
34
0
0
22 May 2024
Hyperplane Arrangements and Fixed Points in Iterated PWL Neural Networks
Hyperplane Arrangements and Fixed Points in Iterated PWL Neural Networks
H. Beise
MLT
19
0
0
16 May 2024
Spectral complexity of deep neural networks
Spectral complexity of deep neural networks
Simmaco Di Lillo
Domenico Marinucci
Michele Salvi
S. Vigogna
BDL
74
1
0
15 May 2024
Half-Space Feature Learning in Neural Networks
Half-Space Feature Learning in Neural Networks
Mahesh Lorik Yadav
H. G. Ramaswamy
Chandrashekar Lakshminarayanan
MLT
27
0
0
05 Apr 2024
The Real Tropical Geometry of Neural Networks
The Real Tropical Geometry of Neural Networks
Marie-Charlotte Brandenburg
Georg Loho
Guido Montúfar
54
7
0
18 Mar 2024
Linearly Constrained Weights: Reducing Activation Shift for Faster
  Training of Neural Networks
Linearly Constrained Weights: Reducing Activation Shift for Faster Training of Neural Networks
Takuro Kutsuna
LLMSV
19
1
0
08 Mar 2024
On Minimal Depth in Neural Networks
On Minimal Depth in Neural Networks
J. L. Valerdi
38
3
0
23 Feb 2024
Depth Separation in Norm-Bounded Infinite-Width Neural Networks
Depth Separation in Norm-Bounded Infinite-Width Neural Networks
Suzanna Parkinson
Greg Ongie
Rebecca Willett
Ohad Shamir
Nathan Srebro
MDE
43
2
0
13 Feb 2024
Depth Separations in Neural Networks: Separating the Dimension from the
  Accuracy
Depth Separations in Neural Networks: Separating the Dimension from the Accuracy
Itay Safran
Daniel Reichman
Paul Valiant
53
0
0
11 Feb 2024
Locality Sensitive Sparse Encoding for Learning World Models Online
Locality Sensitive Sparse Encoding for Learning World Models Online
Zi-Yan Liu
Chao Du
Wee Sun Lee
Min-Bin Lin
KELM
CLL
OffRL
31
8
0
23 Jan 2024
Nonlinear functional regression by functional deep neural network with kernel embedding
Nonlinear functional regression by functional deep neural network with kernel embedding
Zhongjie Shi
Jun Fan
Linhao Song
Ding-Xuan Zhou
Johan A. K. Suykens
50
5
0
05 Jan 2024
Deep Radon Prior: A Fully Unsupervised Framework for Sparse-View CT
  Reconstruction
Deep Radon Prior: A Fully Unsupervised Framework for Sparse-View CT Reconstruction
Shuo Xu
Yucheng Zhang
Gang Chen
Xincheng Xiang
Peng Cong
Yuewen Sun
17
1
0
30 Dec 2023
Optimal Deep Neural Network Approximation for Korobov Functions with
  respect to Sobolev Norms
Optimal Deep Neural Network Approximation for Korobov Functions with respect to Sobolev Norms
Yahong Yang
Yulong Lu
31
3
0
08 Nov 2023
The Expressive Power of Low-Rank Adaptation
The Expressive Power of Low-Rank Adaptation
Yuchen Zeng
Kangwook Lee
28
49
0
26 Oct 2023
Topological Expressivity of ReLU Neural Networks
Topological Expressivity of ReLU Neural Networks
Ekin Ergen
Moritz Grillo
54
2
0
17 Oct 2023
Deep Ridgelet Transform: Voice with Koopman Operator Proves Universality
  of Formal Deep Networks
Deep Ridgelet Transform: Voice with Koopman Operator Proves Universality of Formal Deep Networks
Sho Sonoda
Yuka Hashimoto
Isao Ishikawa
Masahiro Ikeda
19
3
0
05 Oct 2023
Why should autoencoders work?
Why should autoencoders work?
Matthew D. Kvalheim
E.D. Sontag
21
0
0
03 Oct 2023
Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across
  Language Models
Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language Models
Zijun Wu
Yongkang Wu
Lili Mou
VLM
25
2
0
02 Oct 2023
A Primer on Bayesian Neural Networks: Review and Debates
A Primer on Bayesian Neural Networks: Review and Debates
Federico Danieli
Konstantinos Pitas
M. Vladimirova
Vincent Fortuin
BDL
AAML
56
18
0
28 Sep 2023
Minimum width for universal approximation using ReLU networks on compact
  domain
Minimum width for universal approximation using ReLU networks on compact domain
Namjun Kim
Chanho Min
Sejun Park
VLM
27
10
0
19 Sep 2023
DiT: Efficient Vision Transformers with Dynamic Token Routing
DiT: Efficient Vision Transformers with Dynamic Token Routing
Yuchen Ma
Zhengcong Fei
Junshi Huang
ViT
24
2
0
07 Aug 2023
A Distance Correlation-Based Approach to Characterize the Effectiveness
  of Recurrent Neural Networks for Time Series Forecasting
A Distance Correlation-Based Approach to Characterize the Effectiveness of Recurrent Neural Networks for Time Series Forecasting
Christopher Salazar
A. Banerjee
AI4TS
18
2
0
28 Jul 2023
How Many Neurons Does it Take to Approximate the Maximum?
How Many Neurons Does it Take to Approximate the Maximum?
Itay Safran
Daniel Reichman
Paul Valiant
31
8
0
18 Jul 2023
Machine learning for option pricing: an empirical investigation of
  network architectures
Machine learning for option pricing: an empirical investigation of network architectures
Laurens Van Mieghem
A. Papapantoleon
Jonas Papazoglou-Hennig
11
2
0
14 Jul 2023
Neural Hilbert Ladders: Multi-Layer Neural Networks in Function Space
Neural Hilbert Ladders: Multi-Layer Neural Networks in Function Space
Zhengdao Chen
28
1
0
03 Jul 2023
A Constructive Approach to Function Realization by Neural Stochastic
  Differential Equations
A Constructive Approach to Function Realization by Neural Stochastic Differential Equations
Tanya Veeravalli
Maxim Raginsky
11
0
0
01 Jul 2023
12345678
Next