Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.01530
Cited By
Fisher-Rao Metric, Geometry, and Complexity of Neural Networks
5 November 2017
Tengyuan Liang
T. Poggio
Alexander Rakhlin
J. Stokes
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fisher-Rao Metric, Geometry, and Complexity of Neural Networks"
43 / 43 papers shown
Title
More Optimal Fractional-Order Stochastic Gradient Descent for Non-Convex Optimization Problems
Mohammad Partohaghighi
Roummel Marcia
YangQuan Chen
19
0
0
05 May 2025
Effective Dimension Aware Fractional-Order Stochastic Gradient Descent for Convex Optimization Problems
Mohammad Partohaghighi
Roummel Marcia
YangQuan Chen
49
0
0
17 Mar 2025
Application of Langevin Dynamics to Advance the Quantum Natural Gradient Optimization Algorithm
Oleksandr Borysenko
Mykhailo Bratchenko
Ilya Lukin
Mykola Luhanko
Ihor Omelchenko
Andrii Sotnikov
Alessandro Lomi
55
0
0
17 Feb 2025
What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?
Weijie Tu
Weijian Deng
Liang Zheng
Tom Gedeon
40
0
0
14 Jun 2024
The Uncanny Valley: Exploring Adversarial Robustness from a Flatness Perspective
Nils Philipp Walter
Linara Adilova
Jilles Vreeken
Michael Kamp
AAML
48
2
0
27 May 2024
Information-Theoretic Generalization Bounds for Deep Neural Networks
Haiyun He
Christina Lee Yu
38
4
0
04 Apr 2024
Randomized Adversarial Training via Taylor Expansion
Gao Jin
Xinping Yi
Dengyu Wu
Ronghui Mu
Xiaowei Huang
AAML
44
34
0
19 Mar 2023
A numerical approximation method for the Fisher-Rao distance between multivariate normal distributions
Frank Nielsen
27
17
0
16 Feb 2023
The Geometry of Neural Nets' Parameter Spaces Under Reparametrization
Agustinus Kristiadi
Felix Dangel
Philipp Hennig
32
11
0
14 Feb 2023
A Modern Look at the Relationship between Sharpness and Generalization
Maksym Andriushchenko
Francesco Croce
Maximilian Müller
Matthias Hein
Nicolas Flammarion
3DH
19
55
0
14 Feb 2023
ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients
Guihong Li
Yuedong Yang
Kartikeya Bhardwaj
R. Marculescu
36
61
0
26 Jan 2023
Efficient Activation Function Optimization through Surrogate Modeling
G. Bingham
Risto Miikkulainen
24
2
0
13 Jan 2023
Effects of Data Geometry in Early Deep Learning
Saket Tiwari
George Konidaris
79
7
0
29 Dec 2022
Task Discovery: Finding the Tasks that Neural Networks Generalize on
Andrei Atanov
Andrei Filatov
Teresa Yeo
Ajay Sohmshetty
Amir Zamir
OOD
45
10
0
01 Dec 2022
Exploring Temporal Information Dynamics in Spiking Neural Networks
Youngeun Kim
Yuhang Li
Hyoungseob Park
Yeshwanth Venkatesha
Anna Hambitzer
Priyadarshini Panda
19
32
0
26 Nov 2022
Do highly over-parameterized neural networks generalize since bad solutions are rare?
Julius Martinetz
T. Martinetz
30
1
0
07 Nov 2022
Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel
Sungyub Kim
Si-hun Park
Kyungsu Kim
Eunho Yang
BDL
32
4
0
30 Sep 2022
Lower and Upper Bounds for Numbers of Linear Regions of Graph Convolutional Networks
Hao Chen
Yu Wang
Huan Xiong
GNN
16
6
0
01 Jun 2022
Low-Pass Filtering SGD for Recovering Flat Optima in the Deep Learning Optimization Landscape
Devansh Bisla
Jing Wang
A. Choromańska
25
34
0
20 Jan 2022
On the Convergence of Shallow Neural Network Training with Randomly Masked Neurons
Fangshuo Liao
Anastasios Kyrillidis
46
16
0
05 Dec 2021
In Search of Probeable Generalization Measures
Jonathan Jaegerman
Khalil Damouni
M. M. Ankaralı
Konstantinos N. Plataniotis
27
2
0
23 Oct 2021
Rethinking Multidimensional Discriminator Output for Generative Adversarial Networks
M. Dai
Haibin Hang
A. Srivastava
18
3
0
08 Sep 2021
Minimum sharpness: Scale-invariant parameter-robustness of neural networks
Hikaru Ibayashi
Takuo Hamaguchi
Masaaki Imaizumi
25
5
0
23 Jun 2021
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks
Jungmin Kwon
Jeongseop Kim
Hyunseong Park
I. Choi
48
281
0
23 Feb 2021
Noise and Fluctuation of Finite Learning Rate Stochastic Gradient Descent
Kangqiao Liu
Liu Ziyin
Masakuni Ueda
MLT
61
37
0
07 Dec 2020
Interpreting and Disentangling Feature Components of Various Complexity from DNNs
Jie Ren
Mingjie Li
Zexu Liu
Quanshi Zhang
CoGe
19
18
0
29 Jun 2020
On the Optimal Weighted
ℓ
2
\ell_2
ℓ
2
Regularization in Overparameterized Linear Regression
Denny Wu
Ji Xu
33
121
0
10 Jun 2020
Similarity of Neural Networks with Gradients
Shuai Tang
Wesley J. Maddox
Charlie Dickens
Tom Diethe
Andreas C. Damianou
19
25
0
25 Mar 2020
Geometric Dataset Distances via Optimal Transport
David Alvarez-Melis
Nicolò Fusi
OT
80
194
0
07 Feb 2020
Information-Theoretic Local Minima Characterization and Regularization
Zhiwei Jia
Hao Su
27
19
0
19 Nov 2019
Theoretical Issues in Deep Networks: Approximation, Optimization and Generalization
T. Poggio
Andrzej Banburski
Q. Liao
ODL
31
161
0
25 Aug 2019
The Normalization Method for Alleviating Pathological Sharpness in Wide Neural Networks
Ryo Karakida
S. Akaho
S. Amari
27
39
0
07 Jun 2019
Dimensionality compression and expansion in Deep Neural Networks
Stefano Recanatesi
M. Farrell
Madhu S. Advani
Timothy Moore
Guillaume Lajoie
E. Shea-Brown
23
72
0
02 Jun 2019
A Priori Estimates of the Population Risk for Residual Networks
E. Weinan
Chao Ma
Qingcan Wang
UQCV
37
61
0
06 Mar 2019
Understanding over-parameterized deep networks by geometrization
Xiao Dong
Ling Zhou
GNN
AI4CE
21
7
0
11 Feb 2019
Task2Vec: Task Embedding for Meta-Learning
Alessandro Achille
Michael Lam
Rahul Tewari
Avinash Ravichandran
Subhransu Maji
Charless C. Fowlkes
Stefano Soatto
Pietro Perona
SSL
28
309
0
10 Feb 2019
Optimization Problems for Machine Learning: A Survey
Claudio Gambella
Bissan Ghaddar
Joe Naoum-Sawaya
AI4CE
30
178
0
16 Jan 2019
Detecting Memorization in ReLU Networks
Edo Collins
Siavash Bigdeli
Sabine Süsstrunk
36
4
0
08 Oct 2018
Theory of Generative Deep Learning : Probe Landscape of Empirical Error via Norm Based Capacity Control
Wendi Xu
Ming Zhang
24
4
0
03 Oct 2018
Overfitting or perfect fitting? Risk bounds for classification and regression rules that interpolate
M. Belkin
Daniel J. Hsu
P. Mitra
AI4CE
30
256
0
13 Jun 2018
Universal Statistics of Fisher Information in Deep Neural Networks: Mean Field Approach
Ryo Karakida
S. Akaho
S. Amari
FedML
47
140
0
04 Jun 2018
Stronger generalization bounds for deep nets via a compression approach
Sanjeev Arora
Rong Ge
Behnam Neyshabur
Yi Zhang
MLT
AI4CE
26
630
0
14 Feb 2018
Norm-Based Capacity Control in Neural Networks
Behnam Neyshabur
Ryota Tomioka
Nathan Srebro
127
577
0
27 Feb 2015
1