Benefits of depth in neural networks

14 February 2016

Papers citing "Benefits of depth in neural networks"

50 / 353 papers shown

Title
Block-Biased Mamba for Long-Range Sequence Processing Annan Yu N. Benjamin Erichson Mamba 37 0 0 13 May 2025
On the Depth of Monotone ReLU Neural Networks and ICNNs Egor Bakaev Florestan Brunck Christoph Hertrich Daniel Reichman Amir Yehudayoff 26 0 0 09 May 2025
Nonlocal techniques for the analysis of deep ReLU neural network approximations Cornelia Schneider Mario Ullrich Jan Vybiral 18 0 0 07 Apr 2025
On Space Folds of ReLU Neural Networks Michal Lewandowski Hamid Eghbalzadeh Bernhard Heinzl Raphael Pisoni Bernhard A.Moser MLT 73 1 0 17 Feb 2025
On the Expressiveness of Rational ReLU Neural Networks With Bounded Depth Gennadiy Averkov Christopher Hojny Maximilian Merkert 81 3 0 10 Feb 2025
Free-Knots Kolmogorov-Arnold Network: On the Analysis of Spline Knots and Advancing Stability L. Zheng W. Zhang Lin Yue Miao Xu Olaf Maennel Weitong Chen 54 1 0 17 Jan 2025
Theoretical limitations of multi-layer Transformer Lijie Chen Binghui Peng Hongxun Wu AI4CE 72 6 0 04 Dec 2024
Understanding the Effect of GCN Convolutions in Regression Tasks Juntong Chen Johannes Schmidt-Hieber Claire Donnat Olga Klopp GNN 29 0 0 26 Oct 2024
Towards characterizing the value of edge embeddings in Graph Neural Networks Dhruv Rohatgi Tanya Marwah Zachary Chase Lipton Jianfeng Lu Ankur Moitra Andrej Risteski AI4CE 16 0 0 13 Oct 2024
On the Expressive Power of Tree-Structured Probabilistic Circuits Lang Yin Han Zhao TPM 24 2 0 07 Oct 2024
Identification of Mean-Field Dynamics using Transformers Shiba Biswal Karthik Elamvazhuthi Rishi Sonthalia AI4CE 27 1 0 06 Oct 2024
Deep Neural Networks: Multi-Classification and Universal Approximation Martín Hernández Enrique Zuazua 26 2 0 10 Sep 2024
Activation function optimization method: Learnable series linear units (LSLUs) Chuan Feng Xi Lin Shiping Zhu Hongkang Shi Maojie Tang Hua Huang 24 0 0 28 Aug 2024
Variance reduction of diffusion model's gradients with Taylor approximation-based control variate Paul Jeha Will Grathwohl Michael Riis Andersen Carl Henrik Ek J. Frellsen DiffM 29 1 0 22 Aug 2024
Graph Classification with GNNs: Optimisation, Representation and Inductive Bias P. Krishna Kumar a H. G. Ramaswamy 24 0 0 17 Aug 2024
The Role of Temporal Hierarchy in Spiking Neural Networks Filippo Moro Pau Vilimelis Aceituno Laura Kriener Melika Payvand AI4CE 32 3 0 26 Jul 2024
When Can Transformers Count to n? Gilad Yehudai Haim Kaplan Asma Ghandeharioun Mor Geva Amir Globerson 39 10 0 21 Jul 2024
The Role of Depth, Width, and Tree Size in Expressiveness of Deep Forest Shen-Huan Lyu Jin-Hui Wu Qin-Cheng Zheng Baoliu Ye 31 0 0 06 Jul 2024
Analytical Solution of a Three-layer Network with a Matrix Exponential Activation Function Kuo Gai Shihua Zhang FAtt 38 0 0 02 Jul 2024
Neural Networks Trained by Weight Permutation are Universal Approximators Yongqiang Cai Gaohang Chen Zhonghua Qiao 69 1 0 01 Jul 2024
1-Lipschitz Neural Distance Fields Guillaume Coiffier Louis Bethune 41 3 0 14 Jun 2024
Highway Value Iteration Networks Yuhui Wang Weida Li Francesco Faccio Qingyuan Wu Jürgen Schmidhuber 32 2 0 05 Jun 2024
Understanding Encoder-Decoder Structures in Machine Learning Using Information Measures Jorge F. Silva Victor Faraggi Camilo Ramírez Álvaro F. Egaña Eduardo Pavez 17 1 0 30 May 2024
Tropical Expressivity of Neural Networks Shiv Bhatia Yueqi Cao Paul Lezeau Anthea Monod 21 0 0 30 May 2024
Unified Universality Theorem for Deep and Shallow Joint-Group-Equivariant Machines Sho Sonoda Yuka Hashimoto Isao Ishikawa Masahiro Ikeda 34 0 0 22 May 2024
Hyperplane Arrangements and Fixed Points in Iterated PWL Neural Networks H. Beise MLT 19 0 0 16 May 2024
Spectral complexity of deep neural networks Simmaco Di Lillo Domenico Marinucci Michele Salvi S. Vigogna BDL 74 1 0 15 May 2024
Half-Space Feature Learning in Neural Networks Mahesh Lorik Yadav H. G. Ramaswamy Chandrashekar Lakshminarayanan MLT 27 0 0 05 Apr 2024
The Real Tropical Geometry of Neural Networks Marie-Charlotte Brandenburg Georg Loho Guido Montúfar 54 7 0 18 Mar 2024
Linearly Constrained Weights: Reducing Activation Shift for Faster Training of Neural Networks Takuro Kutsuna LLMSV 19 1 0 08 Mar 2024
On Minimal Depth in Neural Networks J. L. Valerdi 38 3 0 23 Feb 2024
Depth Separation in Norm-Bounded Infinite-Width Neural Networks Suzanna Parkinson Greg Ongie Rebecca Willett Ohad Shamir Nathan Srebro MDE 43 2 0 13 Feb 2024
Depth Separations in Neural Networks: Separating the Dimension from the Accuracy Itay Safran Daniel Reichman Paul Valiant 53 0 0 11 Feb 2024
Locality Sensitive Sparse Encoding for Learning World Models Online Zi-Yan Liu Chao Du Wee Sun Lee Min-Bin Lin KELM CLL OffRL 31 8 0 23 Jan 2024
Nonlinear functional regression by functional deep neural network with kernel embedding Zhongjie Shi Jun Fan Linhao Song Ding-Xuan Zhou Johan A. K. Suykens 50 5 0 05 Jan 2024
Deep Radon Prior: A Fully Unsupervised Framework for Sparse-View CT Reconstruction Shuo Xu Yucheng Zhang Gang Chen Xincheng Xiang Peng Cong Yuewen Sun 17 1 0 30 Dec 2023
Optimal Deep Neural Network Approximation for Korobov Functions with respect to Sobolev Norms Yahong Yang Yulong Lu 31 3 0 08 Nov 2023
The Expressive Power of Low-Rank Adaptation Yuchen Zeng Kangwook Lee 28 49 0 26 Oct 2023
Topological Expressivity of ReLU Neural Networks Ekin Ergen Moritz Grillo 54 2 0 17 Oct 2023
Deep Ridgelet Transform: Voice with Koopman Operator Proves Universality of Formal Deep Networks Sho Sonoda Yuka Hashimoto Isao Ishikawa Masahiro Ikeda 19 3 0 05 Oct 2023
Why should autoencoders work? Matthew D. Kvalheim E.D. Sontag 21 0 0 03 Oct 2023
Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language Models Zijun Wu Yongkang Wu Lili Mou VLM 25 2 0 02 Oct 2023
A Primer on Bayesian Neural Networks: Review and Debates Federico Danieli Konstantinos Pitas M. Vladimirova Vincent Fortuin BDL AAML 56 18 0 28 Sep 2023
Minimum width for universal approximation using ReLU networks on compact domain Namjun Kim Chanho Min Sejun Park VLM 27 10 0 19 Sep 2023
DiT: Efficient Vision Transformers with Dynamic Token Routing Yuchen Ma Zhengcong Fei Junshi Huang ViT 24 2 0 07 Aug 2023
A Distance Correlation-Based Approach to Characterize the Effectiveness of Recurrent Neural Networks for Time Series Forecasting Christopher Salazar A. Banerjee AI4TS 18 2 0 28 Jul 2023
How Many Neurons Does it Take to Approximate the Maximum? Itay Safran Daniel Reichman Paul Valiant 31 8 0 18 Jul 2023
Machine learning for option pricing: an empirical investigation of network architectures Laurens Van Mieghem A. Papapantoleon Jonas Papazoglou-Hennig 11 2 0 14 Jul 2023
Neural Hilbert Ladders: Multi-Layer Neural Networks in Function Space Zhengdao Chen 28 1 0 03 Jul 2023
A Constructive Approach to Function Realization by Neural Stochastic Differential Equations Tanya Veeravalli Maxim Raginsky 11 0 0 01 Jul 2023