Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1601.04114
Cited By
Training Recurrent Neural Networks by Diffusion
16 January 2016
H. Mobahi
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training Recurrent Neural Networks by Diffusion"
14 / 14 papers shown
Title
Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
Aodi Li
Liansheng Zhuang
Xiao Long
Minghong Yao
Shafei Wang
246
0
0
18 Dec 2024
Low-Pass Filtering SGD for Recovering Flat Optima in the Deep Learning Optimization Landscape
Devansh Bisla
Jing Wang
A. Choromańska
27
34
0
20 Jan 2022
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks
Jungmin Kwon
Jeongseop Kim
Hyunseong Park
I. Choi
48
282
0
23 Feb 2021
Adversarial Training Makes Weight Loss Landscape Sharper in Logistic Regression
Masanori Yamada
Sekitoshi Kanai
Tomoharu Iwata
Tomokatsu Takahashi
Yuki Yamanaka
Hiroshi Takahashi
Atsutoshi Kumagai
AAML
16
9
0
05 Feb 2021
Sharpness-Aware Minimization for Efficiently Improving Generalization
Pierre Foret
Ariel Kleiner
H. Mobahi
Behnam Neyshabur
AAML
122
1,283
0
03 Oct 2020
Adaptive Regularization via Residual Smoothing in Deep Learning Optimization
Jung-Kyun Cho
Junseok Kwon
Byung-Woo Hong
31
1
0
23 Jul 2019
Detecting Synapse Location and Connectivity by Signed Proximity Estimation and Pruning with Deep Nets
T. Parag
Daniel R. Berger
L. Kamentsky
B. Staffler
D. Wei
M. Helmstaedter
J. Lichtman
Hanspeter Pfister
20
11
0
08 Jul 2018
SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning
W. Wen
Yandan Wang
Feng Yan
Cong Xu
Chunpeng Wu
Yiran Chen
H. Li
24
50
0
21 May 2018
Deep Relaxation: partial differential equations for optimizing deep neural networks
Pratik Chaudhari
Adam M. Oberman
Stanley Osher
Stefano Soatto
G. Carlier
27
153
0
17 Apr 2017
Mollifying Networks
Çağlar Gülçehre
Marcin Moczulski
Francesco Visin
Yoshua Bengio
23
46
0
17 Aug 2016
Noisy Activation Functions
Çağlar Gülçehre
Marcin Moczulski
Misha Denil
Yoshua Bengio
9
283
0
01 Mar 2016
Adding Gradient Noise Improves Learning for Very Deep Networks
Arvind Neelakantan
Luke Vilnis
Quoc V. Le
Ilya Sutskever
Lukasz Kaiser
Karol Kurach
James Martens
AI4CE
ODL
27
541
0
21 Nov 2015
On the energy landscape of deep networks
Pratik Chaudhari
Stefano Soatto
ODL
40
27
0
20 Nov 2015
Improving neural networks by preventing co-adaptation of feature detectors
Geoffrey E. Hinton
Nitish Srivastava
A. Krizhevsky
Ilya Sutskever
Ruslan Salakhutdinov
VLM
266
7,639
0
03 Jul 2012
1