Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.09913
Cited By
v1
v2
v3 (latest)
Visualizing the Loss Landscape of Neural Nets
28 December 2017
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Visualizing the Loss Landscape of Neural Nets"
50 / 1,068 papers shown
Title
Towards Understanding the Condensation of Neural Networks at Initial Training
Hanxu Zhou
Qixuan Zhou
Yaoyu Zhang
Yaoyu Zhang
Z. Xu
MLT
AI4CE
94
30
0
25 May 2021
AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on the Fly
Yuchen Jin
Dinesh Manocha
Liangyu Zhao
Yibo Zhu
Chuanxiong Guo
Marco Canini
Arvind Krishnamurthy
84
19
0
22 May 2021
AngularGrad: A New Optimization Technique for Angular Convergence of Convolutional Neural Networks
S. K. Roy
Mercedes Eugenia Paoletti
J. Haut
S. Dubey
Purushottam Kar
A. Plaza
B. B. Chaudhuri
ODL
85
18
0
21 May 2021
Variational Quantum Classifiers Through the Lens of the Hessian
Pinaki Sen
Amandeep Singh Bhatia
A. Bhatia
Ahmed Elbeltagi
54
25
0
21 May 2021
Neighborhood-Aware Neural Architecture Search
Xiaofang Wang
Shengcao Cao
Mengtian Li
Kris Kitani
148
6
0
13 May 2021
On the reproducibility of fully convolutional neural networks for modeling time-space evolving physical systems
Wagner Gonçalves Pinto
Antonio Alguacil
Michaël Bauerheim
41
2
0
12 May 2021
The layer-wise L1 Loss Landscape of Neural Nets is more complex around local minima
Peter Hinz
16
2
0
06 May 2021
Data-driven Weight Initialization with Sylvester Solvers
Debasmit Das
Yash Bhalgat
Fatih Porikli
ODL
72
3
0
02 May 2021
AG-CUResNeSt: A Novel Method for Colon Polyp Segmentation
D. V. Sang
Tran Quang Chung
P. N. Lan
D. V. Hang
D. Long
N. T. Thuy
163
21
0
02 May 2021
Focusing on Shadows for Predicting Heightmaps from Single Remotely Sensed RGB Images with Deep Learning
S. Karatsiolis
A. Kamilaris
41
1
0
22 Apr 2021
MetricOpt: Learning to Optimize Black-Box Evaluation Metrics
Chen Huang
Shuangfei Zhai
Pengsheng Guo
J. Susskind
92
12
0
21 Apr 2021
Annealing Knowledge Distillation
A. Jafari
Mehdi Rezagholizadeh
Pranav Sharma
A. Ghodsi
85
79
0
14 Apr 2021
Targeted Adversarial Training for Natural Language Understanding
L. Pereira
Xiaodong Liu
Hao Cheng
Hoifung Poon
Jianfeng Gao
Ichiro Kobayashi
65
12
0
12 Apr 2021
Relating Adversarially Robust Generalization to Flat Minima
David Stutz
Matthias Hein
Bernt Schiele
OOD
105
67
0
09 Apr 2021
Spectral Analysis of the Neural Tangent Kernel for Deep Residual Networks
Yuval Belfer
Amnon Geifman
Meirav Galun
Ronen Basri
71
17
0
07 Apr 2021
Training Deep Neural Networks via Branch-and-Bound
Yuanwei Wu
Ziming Zhang
Guanghui Wang
ODL
57
0
0
05 Apr 2021
Empirically explaining SGD from a line search perspective
Max Mutschler
A. Zell
ODL
LRM
46
4
0
31 Mar 2021
Exploiting Invariance in Training Deep Neural Networks
Chengxi Ye
Xiong Zhou
Tristan McKinney
Yanfeng Liu
Qinggang Zhou
Fedor Zhdanov
20
4
0
30 Mar 2021
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning
Hiroki Furuta
T. Matsushima
Tadashi Kozuno
Y. Matsuo
Sergey Levine
Ofir Nachum
S. Gu
OffRL
52
14
0
23 Mar 2021
Adversarial Feature Augmentation and Normalization for Visual Recognition
Tianlong Chen
Yu Cheng
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zhangyang Wang
Jingjing Liu
AAML
ViT
68
19
0
22 Mar 2021
Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges
Cynthia Rudin
Chaofan Chen
Zhi Chen
Haiyang Huang
Lesia Semenova
Chudi Zhong
FaML
AI4CE
LRM
240
675
0
20 Mar 2021
Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces
Tao Li
Lei Tan
Qinghua Tao
Yipeng Liu
Xiaolin Huang
85
10
0
20 Mar 2021
UPANets: Learning from the Universal Pixel Attention Networks
Ching-Hsun Tseng
Shin-Jye Lee
Jianxing Feng
Shengzhong Mao
Yuping Wu
Jia-Yu Shang
Mou-Chung Tseng
Xiao-Jun Zeng
52
15
0
15 Mar 2021
A Gradient Estimator for Time-Varying Electrical Networks with Non-Linear Dissipation
Jack D. Kendall
57
7
0
09 Mar 2021
QAIR: Practical Query-efficient Black-Box Attacks for Image Retrieval
Xiaodan Li
Jinfeng Li
YueFeng Chen
Shaokai Ye
Yuan He
Shuhui Wang
Hang Su
Hui Xue
75
44
0
04 Mar 2021
LQResNet: A Deep Neural Network Architecture for Learning Dynamic Processes
P. Goyal
P. Benner
AI4CE
23
11
0
03 Mar 2021
DPlis: Boosting Utility of Differentially Private Deep Learning via Randomized Smoothing
Wenxiao Wang
Tianhao Wang
Lun Wang
Nanqing Luo
Pan Zhou
Basel Alomair
R. Jia
109
16
0
02 Mar 2021
Smoothness Analysis of Adversarial Training
Sekitoshi Kanai
Masanori Yamada
Hiroshi Takahashi
Yuki Yamanaka
Yasutoshi Ida
AAML
84
6
0
02 Mar 2021
Spurious Local Minima Are Common for Deep Neural Networks with Piecewise Linear Activations
Bo Liu
45
7
0
25 Feb 2021
Loss Surface Simplexes for Mode Connecting Volumes and Fast Ensembling
Gregory W. Benton
Wesley J. Maddox
Sanae Lotfi
A. Wilson
UQCV
126
70
0
25 Feb 2021
Visualizing MuZero Models
Joery A. de Vries
K. Voskuil
Thomas M. Moerland
Aske Plaat
84
9
0
25 Feb 2021
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks
Jungmin Kwon
Jeongseop Kim
Hyunseong Park
I. Choi
122
290
0
23 Feb 2021
Learning Neural Network Subspaces
Mitchell Wortsman
Maxwell Horton
Carlos Guestrin
Ali Farhadi
Mohammad Rastegari
UQCV
99
88
0
20 Feb 2021
Training Larger Networks for Deep Reinforcement Learning
Keita Ota
Devesh K. Jha
Asako Kanezaki
OffRL
97
40
0
16 Feb 2021
WGAN with an Infinitely Wide Generator Has No Spurious Stationary Points
Albert No
Taeho Yoon
Sehyun Kwon
Ernest K. Ryu
GAN
49
2
0
15 Feb 2021
Eliminating Sharp Minima from SGD with Truncated Heavy-tailed Noise
Xingyu Wang
Sewoong Oh
C. Rhee
75
17
0
08 Feb 2021
OPT-GAN: A Broad-Spectrum Global Optimizer for Black-box Problems by Learning Distribution
Minfang Lu
Shuai Ning
Shuangrong Liu
Fengyang Sun
Bo Zhang
Bo Yang
Linshan Wang
62
5
0
07 Feb 2021
Understanding the Interaction of Adversarial Training with Noisy Labels
Jianing Zhu
Jingfeng Zhang
Bo Han
Tongliang Liu
Gang Niu
Hongxia Yang
Mohan Kankanhalli
Masashi Sugiyama
AAML
90
27
0
06 Feb 2021
Adversarial Training Makes Weight Loss Landscape Sharper in Logistic Regression
Masanori Yamada
Sekitoshi Kanai
Tomoharu Iwata
Tomokatsu Takahashi
Yuki Yamanaka
Hiroshi Takahashi
Atsutoshi Kumagai
AAML
124
9
0
05 Feb 2021
ConvNets for Counting: Object Detection of Transient Phenomena in Steelpan Drums
Scott H. Hawley
Andrew C. Morrison
26
2
0
01 Feb 2021
Exploring the Geometry and Topology of Neural Network Loss Landscapes
Stefan Horoi
Je-chun Huang
Bastian Rieck
Guillaume Lajoie
Guy Wolf
Smita Krishnaswamy
45
13
0
31 Jan 2021
Visualization of Nonlinear Programming for Robot Motion Planning
David Hägele
Moataz Abdelaal
Ozgur S. Oguz
Marc Toussaint
Daniel Weiskopf
27
3
0
28 Jan 2021
Old but Gold: Reconsidering the value of feedforward learners for software analytics
Rahul Yedida
Xueqi Yang
Tim Menzies
AI4TS
38
4
0
15 Jan 2021
Spending Your Winning Lottery Better After Drawing It
Ajay Jaiswal
Haoyu Ma
Tianlong Chen
Ying Ding
Zhangyang Wang
48
6
0
08 Jan 2021
BN-invariant sharpness regularizes the training model to better generalization
Mingyang Yi
Huishuai Zhang
Wei Chen
Zhi-Ming Ma
Tie-Yan Liu
128
3
0
08 Jan 2021
Advances in Electron Microscopy with Deep Learning
Jeffrey M. Ede
77
3
0
04 Jan 2021
Recoding latent sentence representations -- Dynamic gradient-based activation modification in RNNs
Dennis Ulmer
45
0
0
03 Jan 2021
Topological obstructions in neural networks learning
S. Barannikov
Daria Voronkova
I. Trofimov
Alexander Korotin
Grigorii Sotnikov
Evgeny Burnaev
37
6
0
31 Dec 2020
BinaryBERT: Pushing the Limit of BERT Quantization
Haoli Bai
Wei Zhang
Lu Hou
Lifeng Shang
Jing Jin
Xin Jiang
Qun Liu
Michael Lyu
Irwin King
MQ
226
227
0
31 Dec 2020
Mathematical Models of Overparameterized Neural Networks
Cong Fang
Hanze Dong
Tong Zhang
173
23
0
27 Dec 2020
Previous
1
2
3
...
15
16
17
...
20
21
22
Next