ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.04747
  4. Cited By
An overview of gradient descent optimization algorithms
v1v2 (latest)

An overview of gradient descent optimization algorithms

15 September 2016
Sebastian Ruder
    ODL
ArXiv (abs)PDFHTML

Papers citing "An overview of gradient descent optimization algorithms"

47 / 697 papers shown
Title
AirLab: Autograd Image Registration Laboratory
AirLab: Autograd Image Registration Laboratory
Robin Sandkühler
C. Jud
Simon Andermatt
Philippe C. Cattin
63
53
0
26 Jun 2018
Deep Learning based Estimation of Weaving Target Maneuvers
Deep Learning based Estimation of Weaving Target Maneuvers
Vitaly Shalumov
Itzik Klein
24
0
0
13 Jun 2018
A Deep Neural Network Surrogate for High-Dimensional Random Partial
  Differential Equations
A Deep Neural Network Surrogate for High-Dimensional Random Partial Differential Equations
M. A. Nabian
Hadi Meidani
AI4CE
76
102
0
08 Jun 2018
A Machine Learning Framework for Stock Selection
A Machine Learning Framework for Stock Selection
XingYu Fu
JinHong Du
Yifeng Guo
MingWen Liu
Tao Dong
XiuWen Duan
AIFin
63
31
0
05 Jun 2018
Solving the Kolmogorov PDE by means of deep learning
Solving the Kolmogorov PDE by means of deep learning
C. Beck
S. Becker
Philipp Grohs
Nor Jaafari
Arnulf Jentzen
83
96
0
01 Jun 2018
Dynamic learning rate using Mutual Information
Dynamic learning rate using Mutual Information
Shrihari Vasudevan
30
6
0
18 May 2018
Opinion Fraud Detection via Neural Autoencoder Decision Forest
Opinion Fraud Detection via Neural Autoencoder Decision Forest
Manqing Dong
Lina Yao
Xianzhi Wang
B. Benatallah
Chaoran Huang
Xiaodong Ning
44
57
0
09 May 2018
An improvement of the convergence proof of the ADAM-Optimizer
An improvement of the convergence proof of the ADAM-Optimizer
Sebastian Bock
Josef Goppold
M. Weiß
73
143
0
27 Apr 2018
High-dimension Tensor Completion via Gradient-based Optimization Under
  Tensor-train Format
High-dimension Tensor Completion via Gradient-based Optimization Under Tensor-train Format
Longhao Yuan
Qibin Zhao
Lihua Gui
Jianting Cao
ViT
70
57
0
05 Apr 2018
Deep Reinforcement Learning for Traffic Light Control in Vehicular
  Networks
Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks
Xiaoyuan Liang
Xunsheng Du
Guiling Wang
Zhu Han
67
418
0
29 Mar 2018
What Do We Understand About Convolutional Networks?
What Do We Understand About Convolutional Networks?
Isma Hadji
Richard P. Wildes
FAtt
66
99
0
23 Mar 2018
A high-bias, low-variance introduction to Machine Learning for
  physicists
A high-bias, low-variance introduction to Machine Learning for physicists
Pankaj Mehta
Marin Bukov
Ching-Hao Wang
A. G. Day
C. Richardson
Charles K. Fisher
D. Schwab
AI4CE
133
885
0
23 Mar 2018
Lower error bounds for the stochastic gradient descent optimization
  algorithm: Sharp convergence rates for slowly and fast decaying learning
  rates
Lower error bounds for the stochastic gradient descent optimization algorithm: Sharp convergence rates for slowly and fast decaying learning rates
Arnulf Jentzen
Philippe von Wurstemberger
101
31
0
22 Mar 2018
Efficient Hardware Realization of Convolutional Neural Networks using
  Intra-Kernel Regular Pruning
Efficient Hardware Realization of Convolutional Neural Networks using Intra-Kernel Regular Pruning
Maurice Yang
Mahmoud Faraj
Assem Hussein
V. Gaudet
CVBM
64
12
0
15 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey
Deep Learning in Mobile and Wireless Networking: A Survey
Chaoyun Zhang
P. Patras
Hamed Haddadi
134
1,320
0
12 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning
  Approaches
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
133
883
0
03 Mar 2018
Slow and Stale Gradients Can Win the Race: Error-Runtime Trade-offs in
  Distributed SGD
Slow and Stale Gradients Can Win the Race: Error-Runtime Trade-offs in Distributed SGD
Sanghamitra Dutta
Gauri Joshi
Soumyadip Ghosh
Parijat Dube
P. Nagpurkar
82
198
0
03 Mar 2018
Anticipation in Human-Robot Cooperation: A Recurrent Neural Network
  Approach for Multiple Action Sequences Prediction
Anticipation in Human-Robot Cooperation: A Recurrent Neural Network Approach for Multiple Action Sequences Prediction
Paul Schydlo
M. Raković
L. Jamone
J. Santos-Victor
123
64
0
28 Feb 2018
$\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively
  Scale-Invariant Space
G\mathcal{G}G-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
Qi Meng
Shuxin Zheng
Huishuai Zhang
Wei Chen
Zhi-Ming Ma
Tie-Yan Liu
133
39
0
11 Feb 2018
Deep learning in radiology: an overview of the concepts and a survey of
  the state of the art
Deep learning in radiology: an overview of the concepts and a survey of the state of the art
Maciej A. Mazurowski
Mateusz Buda
Ashirbani Saha
Mustafa R. Bashir
MedImAI4CE
62
443
0
10 Feb 2018
Fast Point Spread Function Modeling with Deep Learning
Fast Point Spread Function Modeling with Deep Learning
J. Herbel
T. Kacprzak
A. Amara
Alexandre Réfrégier
Aurelien Lucchi
79
44
0
23 Jan 2018
Universal Language Model Fine-tuning for Text Classification
Universal Language Model Fine-tuning for Text Classification
Jeremy Howard
Sebastian Ruder
VLM
83
276
0
18 Jan 2018
MXNET-MPI: Embedding MPI parallelism in Parameter Server Task Model for
  scaling Deep Learning
MXNET-MPI: Embedding MPI parallelism in Parameter Server Task Model for scaling Deep Learning
Amith R. Mamidala
Georgios Kollias
C. Ward
F. Artico
78
20
0
11 Jan 2018
Convergence Analysis of Gradient Descent Algorithms with Proportional
  Updates
Convergence Analysis of Gradient Descent Algorithms with Proportional Updates
Igor Gitman
D. Dilipkumar
Ben Parr
32
5
0
09 Jan 2018
Recent Advances in Recurrent Neural Networks
Recent Advances in Recurrent Neural Networks
Hojjat Salehinejad
Sharan Sankar
Joseph Barfett
E. Colak
S. Valaee
AI4TS
153
587
0
29 Dec 2017
Deep supervised learning using local errors
Deep supervised learning using local errors
Hesham Mostafa
V. Ramesh
Gert Cauwenberghs
75
115
0
17 Nov 2017
Sequential Keystroke Behavioral Biometrics for Mobile User
  Identification via Multi-view Deep Learning
Sequential Keystroke Behavioral Biometrics for Mobile User Identification via Multi-view Deep Learning
Lichao Sun
Yuqi Wang
Bokai Cao
Philip S. Yu
W. Srisa-an
Alex Leow
HAI
68
46
0
07 Nov 2017
Estimating Historical Hourly Traffic Volumes via Machine Learning and
  Vehicle Probe Data: A Maryland Case Study
Estimating Historical Hourly Traffic Volumes via Machine Learning and Vehicle Probe Data: A Maryland Case Study
Przemysław Sekuła
Nikola Marković
Zachary Vander Laan
K. Sadabadi
40
65
0
02 Nov 2017
A Novel Stochastic Stratified Average Gradient Method: Convergence Rate
  and Its Complexity
A Novel Stochastic Stratified Average Gradient Method: Convergence Rate and Its Complexity
Aixiang Chen
Bingchuan Chen
Xiaolong Chai
Rui-Ling Bian
Hengguang Li
77
22
0
21 Oct 2017
Clickbait Detection in Tweets Using Self-attentive Network
Clickbait Detection in Tweets Using Self-attentive Network
Yiwei Zhou
70
53
0
15 Oct 2017
Artificial Neural Networks-Based Machine Learning for Wireless Networks:
  A Tutorial
Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial
Mingzhe Chen
Ursula Challita
Walid Saad
Changchuan Yin
Mérouane Debbah
102
209
0
09 Oct 2017
Machine learning approximation algorithms for high-dimensional fully
  nonlinear partial differential equations and second-order backward stochastic
  differential equations
Machine learning approximation algorithms for high-dimensional fully nonlinear partial differential equations and second-order backward stochastic differential equations
C. Beck
Weinan E
Arnulf Jentzen
87
333
0
18 Sep 2017
A Tutorial on Deep Learning for Music Information Retrieval
A Tutorial on Deep Learning for Music Information Retrieval
Keunwoo Choi
Gyorgy Fazekas
Kyunghyun Cho
Mark Sandler
VLM
169
91
0
13 Sep 2017
Neural Translation of Musical Style
Neural Translation of Musical Style
Iman Malik
Carl Henrik Ek
83
38
0
11 Aug 2017
Argument Labeling of Explicit Discourse Relations using LSTM Neural
  Networks
Argument Labeling of Explicit Discourse Relations using LSTM Neural Networks
Sohail Hooda
Leila Kosseim
18
9
0
11 Aug 2017
Forecasting day-ahead electricity prices in Europe: the importance of
  considering market integration
Forecasting day-ahead electricity prices in Europe: the importance of considering market integration
J. Lago
F. Ridder
Peter Vrancx
B. de Schutter
41
203
0
01 Aug 2017
Analysis and Optimization of Convolutional Neural Network Architectures
Analysis and Optimization of Convolutional Neural Network Architectures
Martin Thoma
101
73
0
31 Jul 2017
Tensor-Based Backpropagation in Neural Networks with Non-Sequential
  Input
Tensor-Based Backpropagation in Neural Networks with Non-Sequential Input
Hirsh R. Agarwal
Andrew Huang
24
0
0
13 Jul 2017
Convergence Analysis of Optimization Algorithms
Convergence Analysis of Optimization Algorithms
Hyoungseok Kim
Jihoon Kang
Woo-Myoung Park
SukHyun Ko
Yoon-Ho Choi
Daesung Yu
YoungSook Song
JungWon Choi
31
8
0
06 Jul 2017
Variants of RMSProp and Adagrad with Logarithmic Regret Bounds
Variants of RMSProp and Adagrad with Logarithmic Regret Bounds
Mahesh Chandra Mukkamala
Matthias Hein
ODL
84
258
0
17 Jun 2017
Train longer, generalize better: closing the generalization gap in large
  batch training of neural networks
Train longer, generalize better: closing the generalization gap in large batch training of neural networks
Elad Hoffer
Itay Hubara
Daniel Soudry
ODL
204
803
0
24 May 2017
Deep Learning Based Regression and Multi-class Models for Acute Oral
  Toxicity Prediction with Automatic Chemical Feature Extraction
Deep Learning Based Regression and Multi-class Models for Acute Oral Toxicity Prediction with Automatic Chemical Feature Extraction
Youjun Xu
Jianfeng Pei
L. Lai
64
192
0
16 Apr 2017
Efficient Parallel Translating Embedding For Knowledge Graphs
Efficient Parallel Translating Embedding For Knowledge Graphs
Denghui Zhang
Manling Li
Yantao Jia
Yuanzhuo Wang
Xueqi Cheng
73
18
0
30 Mar 2017
Deep Robust Kalman Filter
Deep Robust Kalman Filter
Shirli Di-Castro Shashua
Shie Mannor
BDL
79
28
0
07 Mar 2017
Neural Machine Translation and Sequence-to-sequence Models: A Tutorial
Neural Machine Translation and Sequence-to-sequence Models: A Tutorial
Graham Neubig
AIMat
104
173
0
05 Mar 2017
A State Space Approach for Piecewise-Linear Recurrent Neural Networks
  for Reconstructing Nonlinear Dynamics from Neural Measurements
A State Space Approach for Piecewise-Linear Recurrent Neural Networks for Reconstructing Nonlinear Dynamics from Neural Measurements
Daniel Durstewitz
239
55
0
23 Dec 2016
Cyclical Learning Rates for Training Neural Networks
Cyclical Learning Rates for Training Neural Networks
L. Smith
ODL
246
2,548
0
03 Jun 2015
Previous
123...121314