ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.04747
  4. Cited By
An overview of gradient descent optimization algorithms
v1v2 (latest)

An overview of gradient descent optimization algorithms

15 September 2016
Sebastian Ruder
    ODL
ArXiv (abs)PDFHTML

Papers citing "An overview of gradient descent optimization algorithms"

50 / 697 papers shown
Title
FU-net: Multi-class Image Segmentation Using Feedback Weighted U-net
FU-net: Multi-class Image Segmentation Using Feedback Weighted U-net
M. Jafari
Ruizhe Li
Yue Xing
Dorothee Auer
S. Francis
J. Garibaldi
Xin Chen
SSeg
50
14
0
28 Apr 2020
A Review of Privacy-preserving Federated Learning for the
  Internet-of-Things
A Review of Privacy-preserving Federated Learning for the Internet-of-Things
Christopher Briggs
Zhong Fan
Péter András
135
15
0
24 Apr 2020
Self-Organized Operational Neural Networks with Generative Neurons
Self-Organized Operational Neural Networks with Generative Neurons
S. Kiranyaz
Junaid Malik
Habib Ben Abdallah
T. Ince
Alexandros Iosifidis
Moncef Gabbouj
55
81
0
24 Apr 2020
Supervised Contrastive Learning
Supervised Contrastive Learning
Prannay Khosla
Piotr Teterwak
Chen Wang
Aaron Sarna
Yonglong Tian
Phillip Isola
Aaron Maschinot
Ce Liu
Dilip Krishnan
SSL
222
4,615
0
23 Apr 2020
Automated diagnosis of COVID-19 with limited posteroanterior chest X-ray
  images using fine-tuned deep neural networks
Automated diagnosis of COVID-19 with limited posteroanterior chest X-ray images using fine-tuned deep neural networks
Narinder Singh Punn
Sonali Agarwal
114
193
0
23 Apr 2020
Heterogeneous CPU+GPU Stochastic Gradient Descent Algorithms
Heterogeneous CPU+GPU Stochastic Gradient Descent Algorithms
Yujing Ma
Florin Rusu
33
3
0
19 Apr 2020
Stochastic batch size for adaptive regularization in deep network
  optimization
Stochastic batch size for adaptive regularization in deep network optimization
Kensuke Nakamura
Stefano Soatto
Byung-Woo Hong
ODL
51
6
0
14 Apr 2020
Machine-Learning Dessins dÉnfants: Explorations via Modular and
  Seiberg-Witten Curves
Machine-Learning Dessins dÉnfants: Explorations via Modular and Seiberg-Witten Curves
Yang-Hui He
Edward Hirst
Toby Peterken
51
37
0
10 Apr 2020
Structure-preserving neural networks
Structure-preserving neural networks
Quercus Hernandez
Alberto Badías
D. González
Francisco Chinesta
Elías Cueto
PINN
129
71
0
09 Apr 2020
Federated Multi-view Matrix Factorization for Personalized
  Recommendations
Federated Multi-view Matrix Factorization for Personalized Recommendations
Adrian Flanagan
Were Oyomno
A. Grigorievskiy
K. E. Tan
Suleiman A. Khan
Muhammad Ammad-ud-din
FedML
75
71
0
08 Apr 2020
Weighted Aggregating Stochastic Gradient Descent for Parallel Deep
  Learning
Weighted Aggregating Stochastic Gradient Descent for Parallel Deep Learning
Pengzhan Guo
Zeyang Ye
Keli Xiao
Wei Zhu
50
14
0
07 Apr 2020
Adaptive Partial Scanning Transmission Electron Microscopy with
  Reinforcement Learning
Adaptive Partial Scanning Transmission Electron Microscopy with Reinforcement Learning
Jeffrey M. Ede
110
13
0
06 Apr 2020
On the convergence of physics informed neural networks for linear
  second-order elliptic and parabolic type PDEs
On the convergence of physics informed neural networks for linear second-order elliptic and parabolic type PDEs
Yeonjong Shin
Jérome Darbon
George Karniadakis
PINN
73
79
0
03 Apr 2020
FeederGAN: Synthetic Feeder Generation via Deep Graph Adversarial Nets
FeederGAN: Synthetic Feeder Generation via Deep Graph Adversarial Nets
Ming Liang
Yao Meng
Jiyu Wang
D. Lubkeman
N. Lu
GAN
48
23
0
03 Apr 2020
SiTGRU: Single-Tunnelled Gated Recurrent Unit for Abnormality Detection
SiTGRU: Single-Tunnelled Gated Recurrent Unit for Abnormality Detection
Habtamu Fanta
Zhiwen Shao
Lizhuang Ma
36
39
0
30 Mar 2020
Non-Adversarial Video Synthesis with Learned Priors
Non-Adversarial Video Synthesis with Learned Priors
Abhishek Aich
Akash Gupta
Yikang Shen
Rakib Hyder
M. Salman Asif
Amit K. Roy-Chowdhury
VGenGAN
160
18
0
21 Mar 2020
Mass Estimation of Galaxy Clusters with Deep Learning I:
  Sunyaev-Zel'dovich Effect
Mass Estimation of Galaxy Clusters with Deep Learning I: Sunyaev-Zel'dovich Effect
N. Gupta
C. Reichardt
74
14
0
13 Mar 2020
Hyper-Parameter Optimization: A Review of Algorithms and Applications
Hyper-Parameter Optimization: A Review of Algorithms and Applications
Tong Yu
Hong Zhu
AAML
99
541
0
12 Mar 2020
Improving the Backpropagation Algorithm with Consequentialism Weight
  Updates over Mini-Batches
Improving the Backpropagation Algorithm with Consequentialism Weight Updates over Mini-Batches
Naeem Paeedeh
Kamaledin Ghiasi-Shirazi
ODL
62
8
0
11 Mar 2020
Explore and Exploit with Heterotic Line Bundle Models
Explore and Exploit with Heterotic Line Bundle Models
Magdalena Larfors
Robin Schneider
81
38
0
10 Mar 2020
Joint Parameter-and-Bandwidth Allocation for Improving the Efficiency of
  Partitioned Edge Learning
Joint Parameter-and-Bandwidth Allocation for Improving the Efficiency of Partitioned Edge Learning
Dingzhu Wen
M. Bennis
Kaibin Huang
75
49
0
10 Mar 2020
Warwick Electron Microscopy Datasets
Warwick Electron Microscopy Datasets
Jeffrey M. Ede
105
14
0
02 Mar 2020
Do optimization methods in deep learning applications matter?
Do optimization methods in deep learning applications matter?
Buse Melis Özyildirim
Mariam Kiran
52
11
0
28 Feb 2020
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking
Shirli Di-Castro Shashua
Shie Mannor
OffRL
71
12
0
17 Feb 2020
Controlled time series generation for automotive software-in-the-loop
  testing using GANs
Controlled time series generation for automotive software-in-the-loop testing using GANs
Dhasarathy Parthasarathy
Karl Bäckström
Jens Henriksson
S. Einarsdóttir
31
13
0
16 Feb 2020
CSM-NN: Current Source Model Based Logic Circuit Simulation -- A Neural
  Network Approach
CSM-NN: Current Source Model Based Logic Circuit Simulation -- A Neural Network Approach
M. Abrishami
Massoud Pedram
Shahin Nazarian
24
7
0
13 Feb 2020
LaProp: Separating Momentum and Adaptivity in Adam
LaProp: Separating Momentum and Adaptivity in Adam
Liu Ziyin
Zhikang T.Wang
Masahito Ueda
ODL
70
18
0
12 Feb 2020
On Layer Normalization in the Transformer Architecture
On Layer Normalization in the Transformer Architecture
Ruibin Xiong
Yunchang Yang
Di He
Kai Zheng
Shuxin Zheng
Chen Xing
Huishuai Zhang
Yanyan Lan
Liwei Wang
Tie-Yan Liu
AI4CE
160
1,006
0
12 Feb 2020
D2D-Enabled Data Sharing for Distributed Machine Learning at Wireless
  Network Edge
D2D-Enabled Data Sharing for Distributed Machine Learning at Wireless Network Edge
Xiaoran Cai
Xiaopeng Mo
Junyang Chen
Jie Xu
43
26
0
28 Jan 2020
Design of Capacity-Approaching Low-Density Parity-Check Codes using
  Recurrent Neural Networks
Design of Capacity-Approaching Low-Density Parity-Check Codes using Recurrent Neural Networks
Eleni Nisioti
N. Thomos
30
22
0
05 Jan 2020
A Comprehensive Survey of Multilingual Neural Machine Translation
A Comprehensive Survey of Multilingual Neural Machine Translation
Raj Dabre
Chenhui Chu
Anoop Kunchukuttan
LRM
116
33
0
04 Jan 2020
Distributed Stochastic Algorithms for High-rate Streaming Principal
  Component Analysis
Distributed Stochastic Algorithms for High-rate Streaming Principal Component Analysis
Haroon Raja
W. Bajwa
88
11
0
04 Jan 2020
Deep Learning-Based Intrusion Detection System for Advanced Metering
  Infrastructure
Deep Learning-Based Intrusion Detection System for Advanced Metering Infrastructure
Zakaria El Mrabet
Mehdi Ezzari
Hassan El Ghazi
B. A. E. Majd
23
14
0
31 Dec 2019
Parallel cross-validation: a scalable fitting method for Gaussian
  process models
Parallel cross-validation: a scalable fitting method for Gaussian process models
Florian Gerber
D. Nychka
19
9
0
31 Dec 2019
Pipelined Training with Stale Weights of Deep Convolutional Neural
  Networks
Pipelined Training with Stale Weights of Deep Convolutional Neural Networks
Lifu Zhang
T. Abdelrahman
55
0
0
29 Dec 2019
SoftAdapt: Techniques for Adaptive Loss Weighting of Neural Networks
  with Multi-Part Loss Functions
SoftAdapt: Techniques for Adaptive Loss Weighting of Neural Networks with Multi-Part Loss Functions
A. Heydari
Craig Thompson
A. Mehmood
64
64
0
27 Dec 2019
Second-order Information in First-order Optimization Methods
Second-order Information in First-order Optimization Methods
Yuzheng Hu
Licong Lin
Shange Tang
ODL
53
2
0
20 Dec 2019
Optimization for deep learning: theory and algorithms
Optimization for deep learning: theory and algorithms
Ruoyu Sun
ODL
137
169
0
19 Dec 2019
Comparison of Neuronal Attention Models
Comparison of Neuronal Attention Models
Mohamed Karim Belaid
40
1
0
07 Dec 2019
Physically Interpretable Neural Networks for the Geosciences:
  Applications to Earth System Variability
Physically Interpretable Neural Networks for the Geosciences: Applications to Earth System Variability
B. Toms
E. Barnes
I. Ebert‐Uphoff
AI4CE
103
216
0
04 Dec 2019
Region segmentation via deep learning and convex optimization
Region segmentation via deep learning and convex optimization
Matthias Sonntag
V. Morgenshtern
3DPC
28
1
0
28 Nov 2019
Adaptive dynamic programming for nonaffine nonlinear optimal control
  problem with state constraints
Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints
Jingliang Duan
Zhengyu Liu
Shengbo Eben Li
Qi Sun
Zhenzhong Jia
B. Cheng
72
65
0
26 Nov 2019
Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems
Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems
Jaynta Mandi
Emir Demirović
Peter Stuckey
Tias Guns
76
146
0
22 Nov 2019
Understanding the Disharmony between Weight Normalization Family and
  Weight Decay: $ε-$shifted $L_2$ Regularizer
Understanding the Disharmony between Weight Normalization Family and Weight Decay: ε−ε-ε−shifted L2L_2L2​ Regularizer
Li Xiang
Chen Shuo
Xia Yan
Yang Jian
59
2
0
14 Nov 2019
Variable Star Classification Using Multi-View Metric Learning
Variable Star Classification Using Multi-View Metric Learning
K. Johnston
S. Caballero-Nieves
V. Petit
A. Peter
Rana Haber
50
3
0
13 Nov 2019
Short-term forecasting of solar irradiance without local telemetry: a
  generalized model using satellite data
Short-term forecasting of solar irradiance without local telemetry: a generalized model using satellite data
J. Lago
K. D. Brabandere
F. Ridder
B. de Schutter
37
59
0
12 Nov 2019
Regularized Deep Networks in Intelligent Transportation Systems: A
  Taxonomy and a Case Study
Regularized Deep Networks in Intelligent Transportation Systems: A Taxonomy and a Case Study
Mohammad Mahdi Bejani
M. Ghatee
OOD
41
13
0
08 Nov 2019
An Efficient and Effective Second-Order Training Algorithm for
  LSTM-based Adaptive Learning
An Efficient and Effective Second-Order Training Algorithm for LSTM-based Adaptive Learning
Nuri Mert Vural
Salih Ergüt
Suleyman S. Kozat
41
13
0
22 Oct 2019
Topological Navigation Graph Framework
Topological Navigation Graph Framework
P. Daniušis
Shubham Juneja
Lukas Valatka
Linas Petkevičius
67
1
0
15 Oct 2019
Characterizing Deep Learning Training Workloads on Alibaba-PAI
Characterizing Deep Learning Training Workloads on Alibaba-PAI
Mengdi Wang
Chen Meng
Guoping Long
Chuan Wu
Jun Yang
Wei Lin
Yangqing Jia
73
56
0
14 Oct 2019
Previous
123...1011121314
Next