Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.00909
Cited By
Understanding the Loss Surface of Neural Networks for Binary Classification
19 February 2018
Shiyu Liang
Ruoyu Sun
Yixuan Li
R. Srikant
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Understanding the Loss Surface of Neural Networks for Binary Classification"
50 / 60 papers shown
Title
Analyzing the Role of Permutation Invariance in Linear Mode Connectivity
Keyao Zhan
Puheng Li
Lei Wu
MoMe
82
0
0
13 Mar 2025
Process Reward Model with Q-Value Rankings
W. Li
Yixuan Li
LRM
59
15
0
15 Oct 2024
Understanding the Learning Dynamics of Alignment with Human Feedback
Shawn Im
Yixuan Li
ALM
32
11
0
27 Mar 2024
On the Emergence of Cross-Task Linearity in the Pretraining-Finetuning Paradigm
Zhanpeng Zhou
Zijun Chen
Yilan Chen
Bo-Wen Zhang
Junchi Yan
MoMe
19
10
0
06 Feb 2024
ARGS: Alignment as Reward-Guided Search
Maxim Khanov
Jirayu Burapacheep
Yixuan Li
35
46
0
23 Jan 2024
Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity
Zhanpeng Zhou
Yongyi Yang
Xiaojiang Yang
Junchi Yan
Wei Hu
39
26
0
17 Jul 2023
Empirical Loss Landscape Analysis of Neural Network Activation Functions
Anna Sergeevna Bosman
A. Engelbrecht
Mardé Helbig
10
4
0
28 Jun 2023
NTK-SAP: Improving neural network pruning by aligning training dynamics
Yite Wang
Dawei Li
Ruoyu Sun
42
19
0
06 Apr 2023
Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks
Shuai Zhang
Ming Wang
Pin-Yu Chen
Sijia Liu
Songtao Lu
Miaoyuan Liu
MLT
27
16
0
06 Feb 2023
Xtreme Margin: A Tunable Loss Function for Binary Classification Problems
Rayan Wali
MQ
20
3
0
31 Oct 2022
When Expressivity Meets Trainability: Fewer than
n
n
n
Neurons Can Work
Jiawei Zhang
Yushun Zhang
Mingyi Hong
Ruoyu Sun
Zhi-Quan Luo
29
10
0
21 Oct 2022
Deep learning, stochastic gradient descent and diffusion maps
Carmina Fjellström
Kaj Nyström
DiffM
25
14
0
04 Apr 2022
Global Convergence Analysis of Deep Linear Networks with A One-neuron Layer
Kun Chen
Dachao Lin
Zhihua Zhang
19
1
0
08 Jan 2022
Complexity from Adaptive-Symmetries Breaking: Global Minima in the Statistical Mechanics of Deep Neural Networks
Shaun Li
AI4CE
46
0
0
03 Jan 2022
Theoretical Exploration of Flexible Transmitter Model
Jin-Hui Wu
Shao-Qun Zhang
Yuan Jiang
Zhiping Zhou
44
3
0
11 Nov 2021
ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees
Kuan-Lin Chen
Ching-Hua Lee
H. Garudadri
Bhaskar D. Rao
AI4TS
17
6
0
10 Nov 2021
What Happens after SGD Reaches Zero Loss? --A Mathematical Framework
Zhiyuan Li
Tianhao Wang
Sanjeev Arora
MLT
90
98
0
13 Oct 2021
Noise-robust Graph Learning by Estimating and Leveraging Pairwise Interactions
Xuefeng Du
Tian Bian
Yu Rong
Bo Han
Tongliang Liu
Tingyang Xu
Wenbing Huang
Yixuan Li
Junzhou Huang
NoLa
38
11
0
14 Jun 2021
Achieving Small Test Error in Mildly Overparameterized Neural Networks
Shiyu Liang
Ruoyu Sun
R. Srikant
20
3
0
24 Apr 2021
Spurious Local Minima Are Common for Deep Neural Networks with Piecewise Linear Activations
Bo Liu
17
7
0
25 Feb 2021
Noisy Gradient Descent Converges to Flat Minima for Nonconvex Matrix Factorization
Tianyi Liu
Yan Li
S. Wei
Enlu Zhou
T. Zhao
21
13
0
24 Feb 2021
When Are Solutions Connected in Deep Networks?
Quynh N. Nguyen
Pierre Bréchet
Marco Mondelli
27
9
0
18 Feb 2021
WGAN with an Infinitely Wide Generator Has No Spurious Stationary Points
Albert No
Taeho Yoon
Sehyun Kwon
Ernest K. Ryu
GAN
27
2
0
15 Feb 2021
A Convergence Theory Towards Practical Over-parameterized Deep Neural Networks
Asaf Noy
Yi Tian Xu
Y. Aflalo
Lihi Zelnik-Manor
Rong Jin
39
3
0
12 Jan 2021
Recent Theoretical Advances in Non-Convex Optimization
Marina Danilova
Pavel Dvurechensky
Alexander Gasnikov
Eduard A. Gorbunov
Sergey Guminov
Dmitry Kamzolov
Innokentiy Shibaev
33
77
0
11 Dec 2020
Towards a Better Global Loss Landscape of GANs
Ruoyu Sun
Tiantian Fang
A. Schwing
GAN
35
26
0
10 Nov 2020
Adaptive Signal Variances: CNN Initialization Through Modern Architectures
Takahiko Henmi
E. R. R. Zara
Yoshihiro Hirohashi
Tsuyoshi Kato
8
2
0
16 Aug 2020
Maximum-and-Concatenation Networks
Xingyu Xie
Hao Kong
Jianlong Wu
Wayne Zhang
Guangcan Liu
Zhouchen Lin
83
2
0
09 Jul 2020
The Global Landscape of Neural Networks: An Overview
Ruoyu Sun
Dawei Li
Shiyu Liang
Tian Ding
R. Srikant
22
84
0
02 Jul 2020
Global Convergence and Generalization Bound of Gradient-Based Meta-Learning with Deep Neural Nets
Haoxiang Wang
Ruoyu Sun
Bo Li
MLT
AI4CE
27
14
0
25 Jun 2020
On the alpha-loss Landscape in the Logistic Model
Tyler Sypherd
Mario Díaz
Lalitha Sankar
Gautam Dasarathy
17
5
0
22 Jun 2020
Piecewise linear activations substantially shape the loss surfaces of neural networks
Fengxiang He
Bohan Wang
Dacheng Tao
ODL
36
28
0
27 Mar 2020
Some Geometrical and Topological Properties of DNNs' Decision Boundaries
Bo Liu
Mengya Shen
AAML
17
3
0
07 Mar 2020
Understanding Global Loss Landscape of One-hidden-layer ReLU Networks, Part 1: Theory
Bo Liu
FAtt
MLT
24
1
0
12 Feb 2020
Sharp Rate of Convergence for Deep Neural Network Classifiers under the Teacher-Student Setting
Tianyang Hu
Zuofeng Shang
Guang Cheng
32
19
0
19 Jan 2020
Revisiting Landscape Analysis in Deep Neural Networks: Eliminating Decreasing Paths to Infinity
Shiyu Liang
Ruoyu Sun
R. Srikant
35
19
0
31 Dec 2019
Deep Transfer Learning Based Downlink Channel Prediction for FDD Massive MIMO Systems
Yuwen Yang
F. Gao
Zhimeng Zhong
B. Ai
Ahmed Alkhateeb
19
133
0
27 Dec 2019
Landscape Connectivity and Dropout Stability of SGD Solutions for Over-parameterized Neural Networks
A. Shevchenko
Marco Mondelli
27
37
0
20 Dec 2019
Optimization for deep learning: theory and algorithms
Ruoyu Sun
ODL
25
168
0
19 Dec 2019
Sub-Optimal Local Minima Exist for Neural Networks with Almost All Non-Linear Activations
Tian Ding
Dawei Li
Ruoyu Sun
18
13
0
04 Nov 2019
Truth or Backpropaganda? An Empirical Investigation of Deep Learning Theory
Micah Goldblum
Jonas Geiping
Avi Schwarzschild
Michael Moeller
Tom Goldstein
18
32
0
01 Oct 2019
Are deep ResNets provably better than linear predictors?
Chulhee Yun
S. Sra
Ali Jadbabaie
19
12
0
09 Jul 2019
Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets
Rohith Kuditipudi
Xiang Wang
Holden Lee
Yi Zhang
Zhiyuan Li
Wei Hu
Sanjeev Arora
Rong Ge
FAtt
13
93
0
14 Jun 2019
A Tunable Loss Function for Robust Classification: Calibration, Landscape, and Generalization
Tyler Sypherd
Mario Díaz
J. Cava
Gautam Dasarathy
Peter Kairouz
Lalitha Sankar
23
27
0
05 Jun 2019
Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models: Extension
Yunfei Teng
Wenbo Gao
F. Chalus
A. Choromańska
D. Goldfarb
Adrian Weller
27
12
0
24 May 2019
Mean Field Analysis of Deep Neural Networks
Justin A. Sirignano
K. Spiliopoulos
19
82
0
11 Mar 2019
Numerically Recovering the Critical Points of a Deep Linear Autoencoder
Charles G. Frye
Neha S. Wadia
M. DeWeese
K. Bouchard
27
6
0
29 Jan 2019
On Connected Sublevel Sets in Deep Learning
Quynh N. Nguyen
19
102
0
22 Jan 2019
Visualized Insights into the Optimization Landscape of Fully Convolutional Networks
Jianjie Lu
K. Tong
27
12
0
20 Jan 2019
On the Benefit of Width for Neural Networks: Disappearance of Bad Basins
Dawei Li
Tian Ding
Ruoyu Sun
29
37
0
28 Dec 2018
1
2
Next