ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.04747
  4. Cited By
An overview of gradient descent optimization algorithms
v1v2 (latest)

An overview of gradient descent optimization algorithms

15 September 2016
Sebastian Ruder
    ODL
ArXiv (abs)PDFHTML

Papers citing "An overview of gradient descent optimization algorithms"

50 / 698 papers shown
Title
SplitAVG: A heterogeneity-aware federated deep learning method for
  medical imaging
SplitAVG: A heterogeneity-aware federated deep learning method for medical imaging
Miao Zhang
Liangqiong Qu
Praveer Singh
Jayashree Kalpathy-Cramer
D. Rubin
OODFedML
127
63
0
06 Jul 2021
A comparison of LSTM and GRU networks for learning symbolic sequences
A comparison of LSTM and GRU networks for learning symbolic sequences
Roberto Cahuantzi
Xinye Chen
S. Güttel
96
144
0
05 Jul 2021
Photozilla: A Large-Scale Photography Dataset and Visual Embedding for
  20 Photography Styles
Photozilla: A Large-Scale Photography Dataset and Visual Embedding for 20 Photography Styles
Trisha Singhal
Junhua Liu
L. Blessing
Kwan Hui Lim
92
3
0
21 Jun 2021
Memory Augmented Optimizers for Deep Learning
Memory Augmented Optimizers for Deep Learning
Paul-Aymeric McRae
Prasanna Parthasarathi
Mahmoud Assran
Sarath Chandar
ODL
82
3
0
20 Jun 2021
HUMAP: Hierarchical Uniform Manifold Approximation and Projection
HUMAP: Hierarchical Uniform Manifold Approximation and Projection
Wilson E. Marcílio-Jr
D. M. Eler
F. Paulovich
Rafael M. Martins
56
10
0
14 Jun 2021
Category Theory in Machine Learning
Category Theory in Machine Learning
Dan Shiebler
Bruno Gavranović
Paul W. Wilson
60
31
0
13 Jun 2021
XBNet : An Extremely Boosted Neural Network
XBNet : An Extremely Boosted Neural Network
Tushar Sarkar
LMTD
31
22
0
09 Jun 2021
Adam in Private: Secure and Fast Training of Deep Neural Networks with
  Adaptive Moment Estimation
Adam in Private: Secure and Fast Training of Deep Neural Networks with Adaptive Moment Estimation
Nuttapong Attrapadung
Koki Hamada
Dai Ikarashi
Ryo Kikuchi
Takahiro Matsuda
Ibuki Mishina
Hiraku Morita
Jacob C. N. Schuldt
62
27
0
04 Jun 2021
Neural Network Training Using $\ell_1$-Regularization and Bi-fidelity
  Data
Neural Network Training Using ℓ1\ell_1ℓ1​-Regularization and Bi-fidelity Data
Subhayan De
Alireza Doostan
71
25
0
27 May 2021
2nd-order Updates with 1st-order Complexity
2nd-order Updates with 1st-order Complexity
M. Zimmer
25
0
0
24 May 2021
Understanding and Improvement of Adversarial Training for Network
  Embedding from an Optimization Perspective
Understanding and Improvement of Adversarial Training for Network Embedding from an Optimization Perspective
Lun Du
Xu Chen
Fei Gao
Qiang Fu
Kunqing Xie
Shi Han
Dongmei Zhang
99
12
0
17 May 2021
Layerwise Optimization by Gradient Decomposition for Continual Learning
Layerwise Optimization by Gradient Decomposition for Continual Learning
Shixiang Tang
Dapeng Chen
Jinguo Zhu
Shijie Yu
Wanli Ouyang
CLL
79
65
0
17 May 2021
Unsupervised MRI Reconstruction via Zero-Shot Learned Adversarial
  Transformers
Unsupervised MRI Reconstruction via Zero-Shot Learned Adversarial Transformers
Yilmaz Korkmaz
S. Dar
Mahmut Yurt
Muzaffer Özbey
Tolga Çukur
ViTMedIm
110
194
0
15 May 2021
ROSEFusion: Random Optimization for Online Dense Reconstruction under
  Fast Camera Motion
ROSEFusion: Random Optimization for Online Dense Reconstruction under Fast Camera Motion
JIazhao Zhang
Chenyang Zhu
Lintao Zheng
Kai Xu
102
46
0
12 May 2021
The impact of the additional features on the performance of regression
  analysis: a case study on regression analysis of music signal
The impact of the additional features on the performance of regression analysis: a case study on regression analysis of music signal
V. N. Aditya
Rupaj Kumar
23
0
0
11 May 2021
Implicit Regularization in Deep Tensor Factorization
Implicit Regularization in Deep Tensor Factorization
P. Milanesi
Hachem Kadri
Stéphane Ayache
Thierry Artières
85
9
0
04 May 2021
Citadel: Protecting Data Privacy and Model Confidentiality for
  Collaborative Learning with SGX
Citadel: Protecting Data Privacy and Model Confidentiality for Collaborative Learning with SGX
Chengliang Zhang
Junzhe Xia
Baichen Yang
Huancheng Puyang
Wei Wang
Ruichuan Chen
Istemi Ekin Akkus
Paarijaat Aditya
Feng Yan
FedML
91
39
0
04 May 2021
Dynamical prediction of two meteorological factors using the deep neural
  network and the long short-term memory $(2)$
Dynamical prediction of two meteorological factors using the deep neural network and the long short-term memory (2)(2)(2)
Ki-Hong Shin
Jae-Won Jung
Ki-Ho Chang
Dong-In Lee
C. You
Kyungsik Kim
51
2
0
28 Apr 2021
Efficient training of physics-informed neural networks via importance
  sampling
Efficient training of physics-informed neural networks via importance sampling
M. A. Nabian
R. J. Gladstone
Hadi Meidani
DiffMPINN
135
239
0
26 Apr 2021
Parallel Physics-Informed Neural Networks via Domain Decomposition
Parallel Physics-Informed Neural Networks via Domain Decomposition
K. Shukla
Ameya Dilip Jagtap
George Karniadakis
PINN
179
289
0
20 Apr 2021
Quantum Architecture Search via Deep Reinforcement Learning
Quantum Architecture Search via Deep Reinforcement Learning
En-Jui Kuo
Yao-Lung L. Fang
Samuel Yen-Chi Chen
AI4CE
90
90
0
15 Apr 2021
Demystifying BERT: Implications for Accelerator Design
Demystifying BERT: Implications for Accelerator Design
Suchita Pati
Shaizeen Aga
Nuwan Jayasena
Matthew D. Sinclair
LLMAG
88
17
0
14 Apr 2021
A proof of convergence for stochastic gradient descent in the training
  of artificial neural networks with ReLU activation for constant target
  functions
A proof of convergence for stochastic gradient descent in the training of artificial neural networks with ReLU activation for constant target functions
Arnulf Jentzen
Adrian Riekert
MLT
86
13
0
01 Apr 2021
Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end
  speech recognition
Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end speech recognition
Cong-Thanh Do
R. Doddipatla
Thomas Hain
54
6
0
29 Mar 2021
Improving Online Forums Summarization via Hierarchical Unified Deep
  Neural Network
Improving Online Forums Summarization via Hierarchical Unified Deep Neural Network
Sansiri Tarnpradab
Fereshteh Jafariakinabad
K. Hua
49
5
0
25 Mar 2021
Resonant Scanning Design and Control for Fast Spatial Sampling
Resonant Scanning Design and Control for Fast Spatial Sampling
Zhanghao Sun
Ronald Quan
O. Solgaard
60
2
0
24 Mar 2021
On Imitation Learning of Linear Control Policies: Enforcing Stability
  and Robustness Constraints via LMI Conditions
On Imitation Learning of Linear Control Policies: Enforcing Stability and Robustness Constraints via LMI Conditions
Aaron J. Havens
Bin Hu
64
15
0
24 Mar 2021
Deep Learning for fully automatic detection, segmentation, and Gleason
  Grade estimation of prostate cancer in multiparametric Magnetic Resonance
  Images
Deep Learning for fully automatic detection, segmentation, and Gleason Grade estimation of prostate cancer in multiparametric Magnetic Resonance Images
Oscar J. Pellicer-Valero
José L. Jiménez
V. González-Pérez
J. C. Ramón-Borja
I. García
María Barrios Benito
P. P. Gómez
J. Rubio-Briones
M. J. Rupérez
J. D. Martín-Guerrero
27
76
0
23 Mar 2021
CLIP: Cheap Lipschitz Training of Neural Networks
CLIP: Cheap Lipschitz Training of Neural Networks
Leon Bungert
René Raab
Tim Roith
Leo Schwinn
Daniel Tenbrinck
59
33
0
23 Mar 2021
Differentiable Agent-Based Simulation for Gradient-Guided
  Simulation-Based Optimization
Differentiable Agent-Based Simulation for Gradient-Guided Simulation-Based Optimization
Philipp Andelfinger
141
14
0
23 Mar 2021
Spatio-Temporal Neural Network for Fitting and Forecasting COVID-19
Spatio-Temporal Neural Network for Fitting and Forecasting COVID-19
Yi-Shuai Niu
Wentao Ding
Junpeng Hu
Wenxu Xu
S. Canu
45
2
0
22 Mar 2021
Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in
  Tiny Subspaces
Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces
Tao Li
Lei Tan
Qinghua Tao
Yipeng Liu
Xiaolin Huang
88
10
0
20 Mar 2021
Learning without gradient descent encoded by the dynamics of a
  neurobiological model
Learning without gradient descent encoded by the dynamics of a neurobiological model
V. George
V. Morar
Weiwei Yang
Jonathan Larson
B. Tower
Shweti Mahajan
Arkin Gupta
Christopher M. White
Gabriel A. Silva
26
1
0
16 Mar 2021
Memristive Stochastic Computing for Deep Learning Parameter Optimization
Memristive Stochastic Computing for Deep Learning Parameter Optimization
Corey Lammie
Jason K. Eshraghian
Wei D. Lu
M. R. Azghadi
BDL
36
21
0
11 Mar 2021
Quantum machine learning with differential privacy
Quantum machine learning with differential privacy
William Watkins
Samuel Yen-Chi Chen
Shinjae Yoo
95
49
0
10 Mar 2021
Machine Biometrics -- Towards Identifying Machines in a Smart City
  Environment
Machine Biometrics -- Towards Identifying Machines in a Smart City Environment
George K. Sidiropoulos
G. Papakostas
30
2
0
25 Feb 2021
Convergence rates for gradient descent in the training of
  overparameterized artificial neural networks with biases
Convergence rates for gradient descent in the training of overparameterized artificial neural networks with biases
Arnulf Jentzen
T. Kröger
ODL
73
7
0
23 Feb 2021
A Novel Framework for Neural Architecture Search in the Hill Climbing
  Domain
A Novel Framework for Neural Architecture Search in the Hill Climbing Domain
Mudit Verma
Pradyumn Sinha
Karan Goyal
Apoorva Verma
Seba Susan
119
7
0
22 Feb 2021
A proof of convergence for gradient descent in the training of
  artificial neural networks for constant target functions
A proof of convergence for gradient descent in the training of artificial neural networks for constant target functions
Patrick Cheridito
Arnulf Jentzen
Adrian Riekert
Florian Rossmannek
72
25
0
19 Feb 2021
Stochastic Spatio-Temporal Optimization for Control and Co-Design of
  Systems in Robotics and Applied Physics
Stochastic Spatio-Temporal Optimization for Control and Co-Design of Systems in Robotics and Applied Physics
Ethan N. Evans
Andrew P. Kendall
Evangelos A. Theodorou
AI4CE
57
11
0
18 Feb 2021
Momentum Residual Neural Networks
Momentum Residual Neural Networks
Michael E. Sander
Pierre Ablin
Mathieu Blondel
Gabriel Peyré
90
58
0
15 Feb 2021
The Role of Momentum Parameters in the Optimal Convergence of Adaptive
  Polyak's Heavy-ball Methods
The Role of Momentum Parameters in the Optimal Convergence of Adaptive Polyak's Heavy-ball Methods
Wei Tao
Sheng Long
Gao-wei Wu
Qing Tao
43
14
0
15 Feb 2021
Self-Supervised Multisensor Change Detection
Self-Supervised Multisensor Change Detection
Sudipan Saha
Patrick Ebel
Xiaoxiang Zhu
SSL
80
80
0
12 Feb 2021
COVID-19 identification from volumetric chest CT scans using a
  progressively resized 3D-CNN incorporating segmentation, augmentation, and
  class-rebalancing
COVID-19 identification from volumetric chest CT scans using a progressively resized 3D-CNN incorporating segmentation, augmentation, and class-rebalancing
Md. Kamrul Hasan
Md. Tasnim Jawad
Kazi N. Hasan
Sajal Basak Partha
Md. Masum Al Masba
Shumit Saha
77
21
0
11 Feb 2021
Privacy-Preserving Graph Convolutional Networks for Text Classification
Privacy-Preserving Graph Convolutional Networks for Text Classification
Timour Igamberdiev
Ivan Habernal
GNN
85
33
0
10 Feb 2021
IWA: Integrated Gradient based White-box Attacks for Fooling Deep Neural
  Networks
IWA: Integrated Gradient based White-box Attacks for Fooling Deep Neural Networks
Yixiang Wang
Jiqiang Liu
Xiaolin Chang
J. Misic
Vojislav B. Mišić
AAML
69
12
0
03 Feb 2021
Synthetic Dataset Generation of Driver Telematics
Synthetic Dataset Generation of Driver Telematics
Banghee So
J. Boucher
Emiliano A. Valdez
74
25
0
30 Jan 2021
Choice modelling in the age of machine learning -- discussion paper
Choice modelling in the age of machine learning -- discussion paper
Sander van Cranenburgh
S. Wang
A. Vij
Francisco Câmara Pereira
J. Walker
78
97
0
28 Jan 2021
Reverse Derivative Ascent: A Categorical Approach to Learning Boolean
  Circuits
Reverse Derivative Ascent: A Categorical Approach to Learning Boolean Circuits
Paul W. Wilson
Fabio Zanasi
99
15
0
26 Jan 2021
Optimizing Convergence for Iterative Learning of ARIMA for Stationary
  Time Series
Optimizing Convergence for Iterative Learning of ARIMA for Stationary Time Series
K. Styp-Rekowski
Florian Schmidt
O. Kao
AI4TS
30
0
0
25 Jan 2021
Previous
123...789...121314
Next