ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.04747
  4. Cited By
An overview of gradient descent optimization algorithms

An overview of gradient descent optimization algorithms

15 September 2016
Sebastian Ruder
    ODL
ArXivPDFHTML

Papers citing "An overview of gradient descent optimization algorithms"

50 / 991 papers shown
Title
Humble your Overconfident Networks: Unlearning Overfitting via Sequential Monte Carlo Tempered Deep Ensembles
Humble your Overconfident Networks: Unlearning Overfitting via Sequential Monte Carlo Tempered Deep Ensembles
Andrew Millard
Zheng Zhao
Joshua Murphy
Simon Maskell
UQCV
BDL
2
0
0
16 May 2025
ICE-Pruning: An Iterative Cost-Efficient Pruning Pipeline for Deep Neural Networks
ICE-Pruning: An Iterative Cost-Efficient Pruning Pipeline for Deep Neural Networks
Wenhao Hu
Paul Henderson
José Cano
32
0
0
12 May 2025
Injecting Knowledge Graphs into Large Language Models
Injecting Knowledge Graphs into Large Language Models
Erica Coppolillo
29
0
0
12 May 2025
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Wenhao Li
Bo Jin
Mingyi Hong
Changhong Lu
Xiangfeng Wang
48
0
0
07 May 2025
Small-Scale-Fading-Aware Resource Allocation in Wireless Federated Learning
Small-Scale-Fading-Aware Resource Allocation in Wireless Federated Learning
Jiacheng Wang
Le Liang
Hao Ye
Chongtao Guo
Shi Jin
47
0
0
06 May 2025
am-ELO: A Stable Framework for Arena-based LLM Evaluation
am-ELO: A Stable Framework for Arena-based LLM Evaluation
Zirui Liu
Jiatong Li
Yan Zhuang
Qiang Liu
Shuanghong Shen
Jie Ouyang
Mingyue Cheng
Shijin Wang
44
1
0
06 May 2025
Cooperative Bayesian and variance networks disentangle aleatoric and epistemic uncertainties
Cooperative Bayesian and variance networks disentangle aleatoric and epistemic uncertainties
Jiaxiang Yi
Miguel A. Bessa
UD
PER
UQCV
46
0
0
05 May 2025
Sharp higher order convergence rates for the Adam optimizer
Sharp higher order convergence rates for the Adam optimizer
Steffen Dereich
Arnulf Jentzen
Adrian Riekert
ODL
61
0
0
28 Apr 2025
Temperature Estimation in Induction Motors using Machine Learning
Temperature Estimation in Induction Motors using Machine Learning
Dinan Li
Panagiotis Kakosimos
26
2
0
25 Apr 2025
The effects of Hessian eigenvalue spectral density type on the applicability of Hessian analysis to generalization capability assessment of neural networks
The effects of Hessian eigenvalue spectral density type on the applicability of Hessian analysis to generalization capability assessment of neural networks
Nikita Gabdullin
20
0
0
24 Apr 2025
Gradient-Optimized Fuzzy Classifier: A Benchmark Study Against State-of-the-Art Models
Gradient-Optimized Fuzzy Classifier: A Benchmark Study Against State-of-the-Art Models
Magnus Sieverding
Nathan Steffen
Kelly Cohen
20
0
0
22 Apr 2025
AlphaGrad: Non-Linear Gradient Normalization Optimizer
AlphaGrad: Non-Linear Gradient Normalization Optimizer
Soham Sane
ODL
56
0
0
22 Apr 2025
VeLU: Variance-enhanced Learning Unit for Deep Neural Networks
VeLU: Variance-enhanced Learning Unit for Deep Neural Networks
Ashkan Shakarami
Yousef Yeganeh
Azade Farshad
Lorenzo Nicolè
Stefano Ghidoni
Nassir Navab
57
0
0
21 Apr 2025
DMPCN: Dynamic Modulated Predictive Coding Network with Hybrid Feedback Representations
DMPCN: Dynamic Modulated Predictive Coding Network with Hybrid Feedback Representations
A S M Sharifuzzaman Sagar
Yu Chen
Jun Hoong Chan
36
0
0
20 Apr 2025
PC-DeepNet: A GNSS Positioning Error Minimization Framework Using Permutation-Invariant Deep Neural Network
PC-DeepNet: A GNSS Positioning Error Minimization Framework Using Permutation-Invariant Deep Neural Network
M. Humayun Kabir
Md. Ali Hasan
Md. Shafiqul Islam
Kyeongjun Ko
Wonjae Shin
39
0
0
18 Apr 2025
Stochastic Gradient Descent in Non-Convex Problems: Asymptotic Convergence with Relaxed Step-Size via Stopping Time Methods
Stochastic Gradient Descent in Non-Convex Problems: Asymptotic Convergence with Relaxed Step-Size via Stopping Time Methods
Ruinan Jin
Difei Cheng
Hong Qiao
Xin Shi
Shaodong Liu
Bo Zhang
43
0
0
17 Apr 2025
Fine-Grained Rib Fracture Diagnosis with Hyperbolic Embeddings: A Detailed Annotation Framework and Multi-Label Classification Model
Fine-Grained Rib Fracture Diagnosis with Hyperbolic Embeddings: A Detailed Annotation Framework and Multi-Label Classification Model
Shripad Pate
Aiman Farooq
Suvrankar Datta
Musadiq Aadil Sheikh
Atin Kumar
Deepak Mishra
31
0
0
15 Apr 2025
FATE: A Prompt-Tuning-Based Semi-Supervised Learning Framework for Extremely Limited Labeled Data
FATE: A Prompt-Tuning-Based Semi-Supervised Learning Framework for Extremely Limited Labeled Data
Hezhao Liu
Yang Lu
Mengke Li
Yiqun Zhang
Shreyank N Gowda
Chen Gong
Hanzi Wang
34
0
0
14 Apr 2025
DUE: A Deep Learning Framework and Library for Modeling Unknown Equations
DUE: A Deep Learning Framework and Library for Modeling Unknown Equations
Junfeng Chen
Kailiang Wu
D. Xiu
27
0
0
14 Apr 2025
Enhancing knowledge retention for continual learning with domain-specific adapters and features gating
Enhancing knowledge retention for continual learning with domain-specific adapters and features gating
Mohamed Abbas Hedjazi
O. Hadjerci
Adel Hafiane
CLL
34
0
0
11 Apr 2025
LLM-based Automated Grading with Human-in-the-Loop
LLM-based Automated Grading with Human-in-the-Loop
Hang Li
Yucheng Chu
Kaiqi Yang
Yasemin Copur-Gencturk
Jiliang Tang
AI4Ed
ELM
59
0
0
07 Apr 2025
Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning
Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning
Anja Surina
Amin Mansouri
Lars Quaedvlieg
Amal Seddas
Maryna Viazovska
Emmanuel Abbe
Çağlar Gülçehre
38
1
0
07 Apr 2025
Loss Functions in Deep Learning: A Comprehensive Review
Loss Functions in Deep Learning: A Comprehensive Review
Omar Elharrouss
Yasir Mahmood
Yassine Bechqito
Mohamed Adel Serhani
E. Badidi
Jamal Riffi
Hamid Tairi
38
0
0
05 Apr 2025
Data Synthesis with Diverse Styles for Face Recognition via 3DMM-Guided Diffusion
Data Synthesis with Diverse Styles for Face Recognition via 3DMM-Guided Diffusion
Yuxi Mi
Zhizhou Zhong
Y. Huang
Qiuyang Yuan
Xuan Zhao
Jianqing Xu
Shouhong Ding
Shaoming Wang
Rizen Guo
Shuigeng Zhou
DiffM
57
0
0
01 Apr 2025
Towards Efficient Training of Graph Neural Networks: A Multiscale Approach
Towards Efficient Training of Graph Neural Networks: A Multiscale Approach
Eshed Gal
Moshe Eliasof
Carola-Bibiane Schönlieb
Eldad Haber
Eran Treister
GNN
AI4CE
67
1
0
25 Mar 2025
Adaptive Machine Learning for Resource-Constrained Environments
Adaptive Machine Learning for Resource-Constrained Environments
Sebastián A. Cajas Ordóñez
Jaydeep Samanta
Andrés L. Suárez-Cetrulo
Ricardo Simón Carbajo
36
0
0
24 Mar 2025
GranQ: Granular Zero-Shot Quantization with Channel-Wise Activation Scaling in QAT
GranQ: Granular Zero-Shot Quantization with Channel-Wise Activation Scaling in QAT
Inpyo Hong
Youngwan Jo
Hyojeong Lee
Sunghyun Ahn
Sanghyun Park
MQ
65
0
0
24 Mar 2025
Offline Model-Based Optimization: Comprehensive Review
Offline Model-Based Optimization: Comprehensive Review
Minsu Kim
Jiayao Gu
Ye Yuan
Taeyoung Yun
Ziqiang Liu
Yoshua Bengio
Can Chen
OffRL
67
2
0
21 Mar 2025
Beyond RGB: Adaptive Parallel Processing for RAW Object Detection
Beyond RGB: Adaptive Parallel Processing for RAW Object Detection
Shani Gamrian
Hila Barel
Feiran Li
Masakazu Yoshimura
Daisuke Iso
56
0
0
17 Mar 2025
Value Gradients with Action Adaptive Search Trees in Continuous (PO)MDPs
Value Gradients with Action Adaptive Search Trees in Continuous (PO)MDPs
Idan Lev-Yehudi
Michael Novitsky
Moran Barenboim
Ron Benchetrit
Vadim Indelman
50
0
0
15 Mar 2025
Clothes-Changing Person Re-identification Based On Skeleton Dynamics
Asaf Joseph
Shmuel Peleg
45
1
0
13 Mar 2025
Hamiltonian Neural Networks for Robust Out-of-Time Credit Scoring
Hamiltonian Neural Networks for Robust Out-of-Time Credit Scoring
Javier Marín
86
0
0
13 Mar 2025
From Centralized to Decentralized Federated Learning: Theoretical Insights, Privacy Preservation, and Robustness Challenges
Qiongxiu Li
Wenrui Yu
Yufei Xia
Jun Pang
FedML
60
1
0
10 Mar 2025
SHAP-Integrated Convolutional Diagnostic Networks for Feature-Selective Medical Analysis
Yan Hu
Ahmad Chaddad
51
0
0
10 Mar 2025
SGA-INTERACT: A 3D Skeleton-based Benchmark for Group Activity Understanding in Modern Basketball Tactic
Yuqing Yang
Wei Wang
Yifei Liu
Linfeng Dong
Hao Wu
Mingxin Zhang
Zhihang Zhong
Xiao-Fu Sun
54
1
0
09 Mar 2025
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Nguyen H K. Do
Truc Nguyen
Malik Hassanaly
Raed Alharbi
Jung Taek Seo
My T. Thai
54
0
0
09 Mar 2025
Machine Learned Force Fields: Fundamentals, its reach, and challenges
Carlos A. Vital
Román J. Armenta-Rico
Huziel E. Sauceda
AI4CE
34
0
0
07 Mar 2025
Wanda++: Pruning Large Language Models via Regional Gradients
Wanda++: Pruning Large Language Models via Regional Gradients
Yifan Yang
Kai Zhen
Bhavana Ganesh
Aram Galstyan
Goeric Huybrechts
...
S. Bodapati
Nathan Susanj
Zheng Zhang
Jack FitzGerald
Abhishek Kumar
61
0
0
06 Mar 2025
Non-convergence to the optimal risk for Adam and stochastic gradient descent optimization in the training of deep neural networks
Thang Do
Arnulf Jentzen
Adrian Riekert
58
1
0
03 Mar 2025
MIDAS: Mixing Ambiguous Data with Soft Labels for Dynamic Facial Expression Recognition
Ryosuke Kawamura
Hideaki Hayashi
Noriko Takemura
Hajime Nagahara
CVBM
3DH
65
4
0
28 Feb 2025
A Survey of Link Prediction in Temporal Networks
A Survey of Link Prediction in Temporal Networks
Jiafeng Xiong
Ahmad Zareie
Rizos Sakellariou
AI4TS
AI4CE
42
1
0
28 Feb 2025
Online Prototypes and Class-Wise Hypergradients for Online Continual Learning with Pre-Trained Models
Online Prototypes and Class-Wise Hypergradients for Online Continual Learning with Pre-Trained Models
Nicolas Michel
Maorong Wang
Jiangpeng He
Toshihiko Yamasaki
CLL
59
0
0
26 Feb 2025
Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints
Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints
Mihaela C. Stoian
Eleonora Giunchiglia
92
2
0
25 Feb 2025
Swarm Characteristics Classification Using Neural Networks
Swarm Characteristics Classification Using Neural Networks
Donald W. Peltier
Isaac Kaminer
Abram H. Clark
Marko Orescanin
40
1
0
20 Feb 2025
SEW: Self-calibration Enhanced Whole Slide Pathology Image Analysis
SEW: Self-calibration Enhanced Whole Slide Pathology Image Analysis
Haoming Luo
Xiaotian Yu
Shengxuming Zhang
Jiabin Xia
Yang Jian
...
Liang Xue
Xiuming Zhang
Jing Zhang
Jing Zhang
Zunlei Feng
76
0
0
17 Feb 2025
Preconditioned Inexact Stochastic ADMM for Deep Model
Shenglong Zhou
Ouya Wang
Ziyan Luo
Yongxu Zhu
Geoffrey Ye Li
46
0
0
15 Feb 2025
Multi-level Supervised Contrastive Learning
Multi-level Supervised Contrastive Learning
Naghmeh Ghanooni
Barbod Pajoum
Harshit Rawal
Sophie Fellenz
Vo Nguyen Le Duy
Marius Kloft
86
0
0
04 Feb 2025
Online Gradient Boosting Decision Tree: In-Place Updates for Efficient Adding/Deleting Data
Online Gradient Boosting Decision Tree: In-Place Updates for Efficient Adding/Deleting Data
Huawei Lin
Jun Woo Chung
Yingjie Lao
Weijie Zhao
51
0
0
03 Feb 2025
When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search
When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search
Xuan Chen
Yuzhou Nie
Wenbo Guo
Xiangyu Zhang
112
10
0
28 Jan 2025
Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts
Wenju Sun
Qingyong Li
Wen Wang
Yangli-ao Geng
Boyang Li
44
3
0
28 Jan 2025
1234...181920
Next