ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.04747
  4. Cited By
An overview of gradient descent optimization algorithms
v1v2 (latest)

An overview of gradient descent optimization algorithms

15 September 2016
Sebastian Ruder
    ODL
ArXiv (abs)PDFHTML

Papers citing "An overview of gradient descent optimization algorithms"

50 / 697 papers shown
Title
Quantum Langevin Dynamics for Optimization
Quantum Langevin Dynamics for Optimization
Zherui Chen
Yuchen Lu
Hao Wang
Yizhou Liu
Tongyang Li
AI4CE
146
11
0
27 Nov 2023
Filtered Partial Differential Equations: a robust surrogate constraint
  in physics-informed deep learning framework
Filtered Partial Differential Equations: a robust surrogate constraint in physics-informed deep learning framework
Dashan Zhang
Yuntian Chen
Shiyi Chen
AI4CE
82
2
0
07 Nov 2023
DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision
  Model and Feature Mixing
DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing
Gaoshuang Huang
Yang Zhou
Xiaofei Hu
Chenglong Zhang
Luying Zhao
Wenjian Gan
Mingbo Hou
48
3
0
01 Nov 2023
Enhancing Deep Neural Network Training Efficiency and Performance
  through Linear Prediction
Enhancing Deep Neural Network Training Efficiency and Performance through Linear Prediction
Hejie Ying
Mengmeng Song
Yaohong Tang
S. Xiao
Zimin Xiao
73
10
0
17 Oct 2023
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
Kai Lv
Hang Yan
Qipeng Guo
Haijun Lv
Xipeng Qiu
ODL
90
23
0
16 Oct 2023
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General
  Sequential Decision Scenarios
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Yazhe Niu
Yuan Pu
Zhenjie Yang
Xueyan Li
Tong Zhou
Jiyuan Ren
Shuai Hu
Hongsheng Li
Yu Liu
139
15
0
12 Oct 2023
Unsupervised Representations Improve Supervised Learning in Speech
  Emotion Recognition
Unsupervised Representations Improve Supervised Learning in Speech Emotion Recognition
Amirali Soltani Tehrani
Niloufar Faridani
Ramin Toosi
SSL
40
3
0
22 Sep 2023
Neural Network Exemplar Parallelization with Go
Neural Network Exemplar Parallelization with Go
Georg Wiesinger
Erich Schikuta
MoE
30
0
0
15 Sep 2023
Stochastic Gradient Descent-like relaxation is equivalent to Metropolis
  dynamics in discrete optimization and inference problems
Stochastic Gradient Descent-like relaxation is equivalent to Metropolis dynamics in discrete optimization and inference problems
Maria Chiara Angelini
A. Cavaliere
Raffaele Marino
F. Ricci-Tersenghi
124
5
0
11 Sep 2023
Proof of Deep Learning: Approaches, Challenges, and Future Directions
Proof of Deep Learning: Approaches, Challenges, and Future Directions
Mahmoud Salhab
Khaleel W. Mershad
71
1
0
31 Aug 2023
Multilayer Multiset Neuronal Networks -- MMNNs
Multilayer Multiset Neuronal Networks -- MMNNs
Alexandre Benatti
L. D. F. Costa
57
1
0
28 Aug 2023
Stable Adam Optimization for 16-bit Neural Networks Training
Juyoung Yun
25
1
0
30 Jul 2023
Cross-dimensional transfer learning in medical image segmentation with
  deep learning
Cross-dimensional transfer learning in medical image segmentation with deep learning
Hicham Messaoudi
Ahror Belaid
Douraied BEN SALEM
Pierre-Henri Conze
MedIm
91
27
0
29 Jul 2023
Convergence of Adam for Non-convex Objectives: Relaxed Hyperparameters
  and Non-ergodic Case
Convergence of Adam for Non-convex Objectives: Relaxed Hyperparameters and Non-ergodic Case
Meixuan He
Yuqing Liang
Jinlan Liu
Dongpo Xu
87
9
0
20 Jul 2023
Sig-Splines: universal approximation and convex calibration of time
  series generative models
Sig-Splines: universal approximation and convex calibration of time series generative models
Magnus Wiese
Phillip Murray
R. Korn
AI4TS
151
1
0
19 Jul 2023
Learning Differentiable Logic Programs for Abstract Visual Reasoning
Learning Differentiable Logic Programs for Abstract Visual Reasoning
Hikaru Shindo
Viktor Pfanschilling
Devendra Singh Dhami
Kristian Kersting
NAI
87
9
0
03 Jul 2023
The Deep Arbitrary Polynomial Chaos Neural Network or how Deep
  Artificial Neural Networks could benefit from Data-Driven Homogeneous Chaos
  Theory
The Deep Arbitrary Polynomial Chaos Neural Network or how Deep Artificial Neural Networks could benefit from Data-Driven Homogeneous Chaos Theory
S. Oladyshkin
T. Praditia
Ilja Kroker
F. Mohammadi
Wolfgang Nowak
S. Otte
AI4CE
50
5
0
26 Jun 2023
Comparing Deep Learning Models for the Task of Volatility Prediction
  Using Multivariate Data
Comparing Deep Learning Models for the Task of Volatility Prediction Using Multivariate Data
Wenbo Ge
Pooia Lalbakhsh
Leigh Isai
Artem Lenskiy
Hanna Suominen
OOD
34
3
0
20 Jun 2023
Schema-learning and rebinding as mechanisms of in-context learning and
  emergence
Schema-learning and rebinding as mechanisms of in-context learning and emergence
Siva K. Swaminathan
Antoine Dedieu
Rajkumar Vasudeva Raju
Murray Shanahan
Miguel Lazaro-Gredilla
Dileep George
101
14
0
16 Jun 2023
In-context Cross-Density Adaptation on Noisy Mammogram Abnormalities
  Detection
In-context Cross-Density Adaptation on Noisy Mammogram Abnormalities Detection
H. Nguyen
Thinh B. Lam
Quan D.D. Tran
M. T. Nguyen
Dat T. Chung
V. Q. Dinh
71
8
0
12 Jun 2023
Nonparametric Iterative Machine Teaching
Nonparametric Iterative Machine Teaching
Chen Zhang
Xiaofeng Cao
Weiyang Liu
Ivor Tsang
James T. Kwok
101
8
0
05 Jun 2023
ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative
  Neural Radiance Fields
ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative Neural Radiance Fields
Kanghyeok Ko
Minhyeok Lee
81
2
0
05 Jun 2023
Neuronal Cell Type Classification using Deep Learning
Neuronal Cell Type Classification using Deep Learning
Ofek Ophir
Orit Shefi
Ofir Lindenbaum
81
3
0
01 Jun 2023
Bayesian inference and neural estimation of acoustic wave propagation
Bayesian inference and neural estimation of acoustic wave propagation
Yongchao Huang
Yuhang He
Hong Ge
65
0
0
28 May 2023
The Evolution of Distributed Systems for Graph Neural Networks and their
  Origin in Graph Processing and Deep Learning: A Survey
The Evolution of Distributed Systems for Graph Neural Networks and their Origin in Graph Processing and Deep Learning: A Survey
Jana Vatter
R. Mayer
Hans-Arno Jacobsen
GNNAI4TSAI4CE
98
29
0
23 May 2023
GraVAC: Adaptive Compression for Communication-Efficient Distributed DL
  Training
GraVAC: Adaptive Compression for Communication-Efficient Distributed DL Training
S. Tyagi
Martin Swany
81
5
0
20 May 2023
Dragon-Alpha&cu32: A Java-based Tensor Computing Framework With its
  High-Performance CUDA Library
Dragon-Alpha&cu32: A Java-based Tensor Computing Framework With its High-Performance CUDA Library
Zhiyi Zhang
Pengfei Zhang
Qi Wang
52
1
0
15 May 2023
Online Learning Under A Separable Stochastic Approximation Framework
Online Learning Under A Separable Stochastic Approximation Framework
Min Gan
Xiang-Xiang Su
Guang-yong Chen
Jing Chen
66
0
0
12 May 2023
Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare
  Species
Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare Species
Tayfun Karaderi
T. Burghardt
R. Morard
D. Schmidt
93
1
0
11 May 2023
LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial
  Expression Recognition
LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Fuyan Ma
Bin Sun
Shutao Li
ViT
72
21
0
05 May 2023
Universal Adversarial Backdoor Attacks to Fool Vertical Federated
  Learning in Cloud-Edge Collaboration
Universal Adversarial Backdoor Attacks to Fool Vertical Federated Learning in Cloud-Edge Collaboration
Peng Chen
Xin Du
Zhihui Lu
Hongfeng Chai
FedMLAAML
98
11
0
22 Apr 2023
Benchmarking Low-Shot Robustness to Natural Distribution Shifts
Benchmarking Low-Shot Robustness to Natural Distribution Shifts
Aaditya K. Singh
Kartik Sarangmath
Prithvijit Chattopadhyay
Judy Hoffman
OOD
90
1
0
21 Apr 2023
Bayesian neural networks via MCMC: a Python-based tutorial
Bayesian neural networks via MCMC: a Python-based tutorial
Rohitash Chandra
Royce Chen
Joshua Simmons
BDL
120
11
0
02 Apr 2023
Random Weights Networks Work as Loss Prior Constraint for Image
  Restoration
Random Weights Networks Work as Loss Prior Constraint for Image Restoration
Man Zhou
Naishan Zheng
Jie Huang
Xiangyu Rui
Chunle Guo
Deyu Meng
Chongyi Li
Liang Feng
118
0
0
29 Mar 2023
Searching for long faint astronomical high energy transients: a data
  driven approach
Searching for long faint astronomical high energy transients: a data driven approach
Riccardo Crupi
G. Dilillo
Kester Ward
E. Bissaldi
F. Fiore
A. Vacchi
56
6
0
28 Mar 2023
Architecturing Binarized Neural Networks for Traffic Sign Recognition
Architecturing Binarized Neural Networks for Traffic Sign Recognition
Andreea Postovan
Madalina Erascu
39
4
0
27 Mar 2023
Reimagining Application User Interface (UI) Design using Deep Learning
  Methods: Challenges and Opportunities
Reimagining Application User Interface (UI) Design using Deep Learning Methods: Challenges and Opportunities
Subtain Malik
M. T. Saeed
Marya Jabeen Zia
S. Rasool
Liaquat A. Khan
Mian Ilyas Ahmed
AI4TSAI4CE
36
3
0
23 Mar 2023
CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D
  Recognition
CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition
Deepti Hegde
Jeya Maria Jose Valanarasu
Vishal M. Patel
CLIP
122
68
0
20 Mar 2023
SSGD: A smartphone screen glass dataset for defect detection
SSGD: A smartphone screen glass dataset for defect detection
Haonan Han
Rui Yang
Shuyan Li
R. Hu
Xiu Li
93
13
0
12 Mar 2023
Error mitigation of entangled states using brainbox quantum autoencoders
Error mitigation of entangled states using brainbox quantum autoencoders
Joséphine Pazem
M. Ansari
70
4
0
02 Mar 2023
AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning
  Rate and Momentum for Training Deep Neural Networks
AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning Rate and Momentum for Training Deep Neural Networks
Hao Sun
Li Shen
Qihuang Zhong
Liang Ding
Shi-Yong Chen
Jingwei Sun
Jing Li
Guangzhong Sun
Dacheng Tao
98
34
0
01 Mar 2023
Edge-Based Detection and Localization of Adversarial Oscillatory Load
  Attacks Orchestrated By Compromised EV Charging Stations
Edge-Based Detection and Localization of Adversarial Oscillatory Load Attacks Orchestrated By Compromised EV Charging Stations
Khaled Sarieddine
M. Sayed
Sadegh Torabi
Ribal Atallah
C. Assi
28
18
0
24 Feb 2023
Neural networks for learning personality traits from natural language
Neural networks for learning personality traits from natural language
Giorgia Adorni
GNN
34
0
0
23 Feb 2023
Evolving Deep Neural Network by Customized Moth Flame Optimization
  Algorithm for Underwater Targets Recognition
Evolving Deep Neural Network by Customized Moth Flame Optimization Algorithm for Underwater Targets Recognition
M. Khishe
M. Mohammadi
Tarik Ahmed Rashid
Hoger Mahmud
Seyedali Mirjalili
50
4
0
16 Feb 2023
Verifying Generalization in Deep Learning
Verifying Generalization in Deep Learning
Guy Amir
Osher Maayan
Tom Zelazny
Guy Katz
Michael Schapira
AAMLAI4CE
81
15
0
11 Feb 2023
Text recognition on images using pre-trained CNN
Text recognition on images using pre-trained CNN
Afgani Fajar Rizky
N. Yudistira
Edy Santoso
VLM
76
4
0
10 Feb 2023
PINN Training using Biobjective Optimization: The Trade-off between Data
  Loss and Residual Loss
PINN Training using Biobjective Optimization: The Trade-off between Data Loss and Residual Loss
Fabian Heldmann
Sarah Treibert
Matthias Ehrhardt
K. Klamroth
79
22
0
03 Feb 2023
Eloss in the way: A Sensitive Input Quality Metrics for Intelligent
  Driving
Eloss in the way: A Sensitive Input Quality Metrics for Intelligent Driving
Hao-Ting Yang
Shiyan Zhang
Zhuo Yang
Xinyu Zhang
48
0
0
02 Feb 2023
Adapting Step-size: A Unified Perspective to Analyze and Improve
  Gradient-based Methods for Adversarial Attacks
Adapting Step-size: A Unified Perspective to Analyze and Improve Gradient-based Methods for Adversarial Attacks
Wei Tao
Lei Bao
Long Sheng
Gao-wei Wu
Qing Tao
AAML
61
1
0
27 Jan 2023
Multi-limb Split Learning for Tumor Classification on Vertically
  Distributed Data
Multi-limb Split Learning for Tumor Classification on Vertically Distributed Data
Omar S. Ads
Mayar M. Alfares
Mohammed Abdel-Megeed Salem
67
10
0
27 Jan 2023
Previous
123456...121314
Next