Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.04747
Cited By
v1
v2 (latest)
An overview of gradient descent optimization algorithms
15 September 2016
Sebastian Ruder
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An overview of gradient descent optimization algorithms"
47 / 697 papers shown
Title
AirLab: Autograd Image Registration Laboratory
Robin Sandkühler
C. Jud
Simon Andermatt
Philippe C. Cattin
63
53
0
26 Jun 2018
Deep Learning based Estimation of Weaving Target Maneuvers
Vitaly Shalumov
Itzik Klein
24
0
0
13 Jun 2018
A Deep Neural Network Surrogate for High-Dimensional Random Partial Differential Equations
M. A. Nabian
Hadi Meidani
AI4CE
76
102
0
08 Jun 2018
A Machine Learning Framework for Stock Selection
XingYu Fu
JinHong Du
Yifeng Guo
MingWen Liu
Tao Dong
XiuWen Duan
AIFin
63
31
0
05 Jun 2018
Solving the Kolmogorov PDE by means of deep learning
C. Beck
S. Becker
Philipp Grohs
Nor Jaafari
Arnulf Jentzen
83
96
0
01 Jun 2018
Dynamic learning rate using Mutual Information
Shrihari Vasudevan
30
6
0
18 May 2018
Opinion Fraud Detection via Neural Autoencoder Decision Forest
Manqing Dong
Lina Yao
Xianzhi Wang
B. Benatallah
Chaoran Huang
Xiaodong Ning
44
57
0
09 May 2018
An improvement of the convergence proof of the ADAM-Optimizer
Sebastian Bock
Josef Goppold
M. Weiß
73
143
0
27 Apr 2018
High-dimension Tensor Completion via Gradient-based Optimization Under Tensor-train Format
Longhao Yuan
Qibin Zhao
Lihua Gui
Jianting Cao
ViT
70
57
0
05 Apr 2018
Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks
Xiaoyuan Liang
Xunsheng Du
Guiling Wang
Zhu Han
67
418
0
29 Mar 2018
What Do We Understand About Convolutional Networks?
Isma Hadji
Richard P. Wildes
FAtt
66
99
0
23 Mar 2018
A high-bias, low-variance introduction to Machine Learning for physicists
Pankaj Mehta
Marin Bukov
Ching-Hao Wang
A. G. Day
C. Richardson
Charles K. Fisher
D. Schwab
AI4CE
133
885
0
23 Mar 2018
Lower error bounds for the stochastic gradient descent optimization algorithm: Sharp convergence rates for slowly and fast decaying learning rates
Arnulf Jentzen
Philippe von Wurstemberger
101
31
0
22 Mar 2018
Efficient Hardware Realization of Convolutional Neural Networks using Intra-Kernel Regular Pruning
Maurice Yang
Mahmoud Faraj
Assem Hussein
V. Gaudet
CVBM
64
12
0
15 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey
Chaoyun Zhang
P. Patras
Hamed Haddadi
134
1,320
0
12 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
133
883
0
03 Mar 2018
Slow and Stale Gradients Can Win the Race: Error-Runtime Trade-offs in Distributed SGD
Sanghamitra Dutta
Gauri Joshi
Soumyadip Ghosh
Parijat Dube
P. Nagpurkar
82
198
0
03 Mar 2018
Anticipation in Human-Robot Cooperation: A Recurrent Neural Network Approach for Multiple Action Sequences Prediction
Paul Schydlo
M. Raković
L. Jamone
J. Santos-Victor
123
64
0
28 Feb 2018
G
\mathcal{G}
G
-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
Qi Meng
Shuxin Zheng
Huishuai Zhang
Wei Chen
Zhi-Ming Ma
Tie-Yan Liu
133
39
0
11 Feb 2018
Deep learning in radiology: an overview of the concepts and a survey of the state of the art
Maciej A. Mazurowski
Mateusz Buda
Ashirbani Saha
Mustafa R. Bashir
MedIm
AI4CE
62
443
0
10 Feb 2018
Fast Point Spread Function Modeling with Deep Learning
J. Herbel
T. Kacprzak
A. Amara
Alexandre Réfrégier
Aurelien Lucchi
79
44
0
23 Jan 2018
Universal Language Model Fine-tuning for Text Classification
Jeremy Howard
Sebastian Ruder
VLM
83
276
0
18 Jan 2018
MXNET-MPI: Embedding MPI parallelism in Parameter Server Task Model for scaling Deep Learning
Amith R. Mamidala
Georgios Kollias
C. Ward
F. Artico
78
20
0
11 Jan 2018
Convergence Analysis of Gradient Descent Algorithms with Proportional Updates
Igor Gitman
D. Dilipkumar
Ben Parr
32
5
0
09 Jan 2018
Recent Advances in Recurrent Neural Networks
Hojjat Salehinejad
Sharan Sankar
Joseph Barfett
E. Colak
S. Valaee
AI4TS
153
587
0
29 Dec 2017
Deep supervised learning using local errors
Hesham Mostafa
V. Ramesh
Gert Cauwenberghs
75
115
0
17 Nov 2017
Sequential Keystroke Behavioral Biometrics for Mobile User Identification via Multi-view Deep Learning
Lichao Sun
Yuqi Wang
Bokai Cao
Philip S. Yu
W. Srisa-an
Alex Leow
HAI
68
46
0
07 Nov 2017
Estimating Historical Hourly Traffic Volumes via Machine Learning and Vehicle Probe Data: A Maryland Case Study
Przemysław Sekuła
Nikola Marković
Zachary Vander Laan
K. Sadabadi
40
65
0
02 Nov 2017
A Novel Stochastic Stratified Average Gradient Method: Convergence Rate and Its Complexity
Aixiang Chen
Bingchuan Chen
Xiaolong Chai
Rui-Ling Bian
Hengguang Li
77
22
0
21 Oct 2017
Clickbait Detection in Tweets Using Self-attentive Network
Yiwei Zhou
70
53
0
15 Oct 2017
Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial
Mingzhe Chen
Ursula Challita
Walid Saad
Changchuan Yin
Mérouane Debbah
102
209
0
09 Oct 2017
Machine learning approximation algorithms for high-dimensional fully nonlinear partial differential equations and second-order backward stochastic differential equations
C. Beck
Weinan E
Arnulf Jentzen
87
333
0
18 Sep 2017
A Tutorial on Deep Learning for Music Information Retrieval
Keunwoo Choi
Gyorgy Fazekas
Kyunghyun Cho
Mark Sandler
VLM
169
91
0
13 Sep 2017
Neural Translation of Musical Style
Iman Malik
Carl Henrik Ek
83
38
0
11 Aug 2017
Argument Labeling of Explicit Discourse Relations using LSTM Neural Networks
Sohail Hooda
Leila Kosseim
18
9
0
11 Aug 2017
Forecasting day-ahead electricity prices in Europe: the importance of considering market integration
J. Lago
F. Ridder
Peter Vrancx
B. de Schutter
41
203
0
01 Aug 2017
Analysis and Optimization of Convolutional Neural Network Architectures
Martin Thoma
101
73
0
31 Jul 2017
Tensor-Based Backpropagation in Neural Networks with Non-Sequential Input
Hirsh R. Agarwal
Andrew Huang
24
0
0
13 Jul 2017
Convergence Analysis of Optimization Algorithms
Hyoungseok Kim
Jihoon Kang
Woo-Myoung Park
SukHyun Ko
Yoon-Ho Choi
Daesung Yu
YoungSook Song
JungWon Choi
31
8
0
06 Jul 2017
Variants of RMSProp and Adagrad with Logarithmic Regret Bounds
Mahesh Chandra Mukkamala
Matthias Hein
ODL
84
258
0
17 Jun 2017
Train longer, generalize better: closing the generalization gap in large batch training of neural networks
Elad Hoffer
Itay Hubara
Daniel Soudry
ODL
204
803
0
24 May 2017
Deep Learning Based Regression and Multi-class Models for Acute Oral Toxicity Prediction with Automatic Chemical Feature Extraction
Youjun Xu
Jianfeng Pei
L. Lai
64
192
0
16 Apr 2017
Efficient Parallel Translating Embedding For Knowledge Graphs
Denghui Zhang
Manling Li
Yantao Jia
Yuanzhuo Wang
Xueqi Cheng
73
18
0
30 Mar 2017
Deep Robust Kalman Filter
Shirli Di-Castro Shashua
Shie Mannor
BDL
79
28
0
07 Mar 2017
Neural Machine Translation and Sequence-to-sequence Models: A Tutorial
Graham Neubig
AIMat
104
173
0
05 Mar 2017
A State Space Approach for Piecewise-Linear Recurrent Neural Networks for Reconstructing Nonlinear Dynamics from Neural Measurements
Daniel Durstewitz
239
55
0
23 Dec 2016
Cyclical Learning Rates for Training Neural Networks
L. Smith
ODL
246
2,548
0
03 Jun 2015
Previous
1
2
3
...
12
13
14