Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.04747
Cited By
An overview of gradient descent optimization algorithms
15 September 2016
Sebastian Ruder
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An overview of gradient descent optimization algorithms"
50 / 991 papers shown
Title
Humble your Overconfident Networks: Unlearning Overfitting via Sequential Monte Carlo Tempered Deep Ensembles
Andrew Millard
Zheng Zhao
Joshua Murphy
Simon Maskell
UQCV
BDL
2
0
0
16 May 2025
ICE-Pruning: An Iterative Cost-Efficient Pruning Pipeline for Deep Neural Networks
Wenhao Hu
Paul Henderson
José Cano
32
0
0
12 May 2025
Injecting Knowledge Graphs into Large Language Models
Erica Coppolillo
29
0
0
12 May 2025
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Wenhao Li
Bo Jin
Mingyi Hong
Changhong Lu
Xiangfeng Wang
48
0
0
07 May 2025
Small-Scale-Fading-Aware Resource Allocation in Wireless Federated Learning
Jiacheng Wang
Le Liang
Hao Ye
Chongtao Guo
Shi Jin
47
0
0
06 May 2025
am-ELO: A Stable Framework for Arena-based LLM Evaluation
Zirui Liu
Jiatong Li
Yan Zhuang
Qiang Liu
Shuanghong Shen
Jie Ouyang
Mingyue Cheng
Shijin Wang
44
1
0
06 May 2025
Cooperative Bayesian and variance networks disentangle aleatoric and epistemic uncertainties
Jiaxiang Yi
Miguel A. Bessa
UD
PER
UQCV
46
0
0
05 May 2025
Sharp higher order convergence rates for the Adam optimizer
Steffen Dereich
Arnulf Jentzen
Adrian Riekert
ODL
61
0
0
28 Apr 2025
Temperature Estimation in Induction Motors using Machine Learning
Dinan Li
Panagiotis Kakosimos
26
2
0
25 Apr 2025
The effects of Hessian eigenvalue spectral density type on the applicability of Hessian analysis to generalization capability assessment of neural networks
Nikita Gabdullin
20
0
0
24 Apr 2025
Gradient-Optimized Fuzzy Classifier: A Benchmark Study Against State-of-the-Art Models
Magnus Sieverding
Nathan Steffen
Kelly Cohen
20
0
0
22 Apr 2025
AlphaGrad: Non-Linear Gradient Normalization Optimizer
Soham Sane
ODL
56
0
0
22 Apr 2025
VeLU: Variance-enhanced Learning Unit for Deep Neural Networks
Ashkan Shakarami
Yousef Yeganeh
Azade Farshad
Lorenzo Nicolè
Stefano Ghidoni
Nassir Navab
57
0
0
21 Apr 2025
DMPCN: Dynamic Modulated Predictive Coding Network with Hybrid Feedback Representations
A S M Sharifuzzaman Sagar
Yu Chen
Jun Hoong Chan
36
0
0
20 Apr 2025
PC-DeepNet: A GNSS Positioning Error Minimization Framework Using Permutation-Invariant Deep Neural Network
M. Humayun Kabir
Md. Ali Hasan
Md. Shafiqul Islam
Kyeongjun Ko
Wonjae Shin
39
0
0
18 Apr 2025
Stochastic Gradient Descent in Non-Convex Problems: Asymptotic Convergence with Relaxed Step-Size via Stopping Time Methods
Ruinan Jin
Difei Cheng
Hong Qiao
Xin Shi
Shaodong Liu
Bo Zhang
43
0
0
17 Apr 2025
Fine-Grained Rib Fracture Diagnosis with Hyperbolic Embeddings: A Detailed Annotation Framework and Multi-Label Classification Model
Shripad Pate
Aiman Farooq
Suvrankar Datta
Musadiq Aadil Sheikh
Atin Kumar
Deepak Mishra
31
0
0
15 Apr 2025
FATE: A Prompt-Tuning-Based Semi-Supervised Learning Framework for Extremely Limited Labeled Data
Hezhao Liu
Yang Lu
Mengke Li
Yiqun Zhang
Shreyank N Gowda
Chen Gong
Hanzi Wang
34
0
0
14 Apr 2025
DUE: A Deep Learning Framework and Library for Modeling Unknown Equations
Junfeng Chen
Kailiang Wu
D. Xiu
27
0
0
14 Apr 2025
Enhancing knowledge retention for continual learning with domain-specific adapters and features gating
Mohamed Abbas Hedjazi
O. Hadjerci
Adel Hafiane
CLL
34
0
0
11 Apr 2025
LLM-based Automated Grading with Human-in-the-Loop
Hang Li
Yucheng Chu
Kaiqi Yang
Yasemin Copur-Gencturk
Jiliang Tang
AI4Ed
ELM
59
0
0
07 Apr 2025
Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning
Anja Surina
Amin Mansouri
Lars Quaedvlieg
Amal Seddas
Maryna Viazovska
Emmanuel Abbe
Çağlar Gülçehre
38
1
0
07 Apr 2025
Loss Functions in Deep Learning: A Comprehensive Review
Omar Elharrouss
Yasir Mahmood
Yassine Bechqito
Mohamed Adel Serhani
E. Badidi
Jamal Riffi
Hamid Tairi
38
0
0
05 Apr 2025
Data Synthesis with Diverse Styles for Face Recognition via 3DMM-Guided Diffusion
Yuxi Mi
Zhizhou Zhong
Y. Huang
Qiuyang Yuan
Xuan Zhao
Jianqing Xu
Shouhong Ding
Shaoming Wang
Rizen Guo
Shuigeng Zhou
DiffM
57
0
0
01 Apr 2025
Towards Efficient Training of Graph Neural Networks: A Multiscale Approach
Eshed Gal
Moshe Eliasof
Carola-Bibiane Schönlieb
Eldad Haber
Eran Treister
GNN
AI4CE
67
1
0
25 Mar 2025
Adaptive Machine Learning for Resource-Constrained Environments
Sebastián A. Cajas Ordóñez
Jaydeep Samanta
Andrés L. Suárez-Cetrulo
Ricardo Simón Carbajo
36
0
0
24 Mar 2025
GranQ: Granular Zero-Shot Quantization with Channel-Wise Activation Scaling in QAT
Inpyo Hong
Youngwan Jo
Hyojeong Lee
Sunghyun Ahn
Sanghyun Park
MQ
65
0
0
24 Mar 2025
Offline Model-Based Optimization: Comprehensive Review
Minsu Kim
Jiayao Gu
Ye Yuan
Taeyoung Yun
Ziqiang Liu
Yoshua Bengio
Can Chen
OffRL
67
2
0
21 Mar 2025
Beyond RGB: Adaptive Parallel Processing for RAW Object Detection
Shani Gamrian
Hila Barel
Feiran Li
Masakazu Yoshimura
Daisuke Iso
56
0
0
17 Mar 2025
Value Gradients with Action Adaptive Search Trees in Continuous (PO)MDPs
Idan Lev-Yehudi
Michael Novitsky
Moran Barenboim
Ron Benchetrit
Vadim Indelman
50
0
0
15 Mar 2025
Clothes-Changing Person Re-identification Based On Skeleton Dynamics
Asaf Joseph
Shmuel Peleg
45
1
0
13 Mar 2025
Hamiltonian Neural Networks for Robust Out-of-Time Credit Scoring
Javier Marín
86
0
0
13 Mar 2025
From Centralized to Decentralized Federated Learning: Theoretical Insights, Privacy Preservation, and Robustness Challenges
Qiongxiu Li
Wenrui Yu
Yufei Xia
Jun Pang
FedML
60
1
0
10 Mar 2025
SHAP-Integrated Convolutional Diagnostic Networks for Feature-Selective Medical Analysis
Yan Hu
Ahmad Chaddad
51
0
0
10 Mar 2025
SGA-INTERACT: A 3D Skeleton-based Benchmark for Group Activity Understanding in Modern Basketball Tactic
Yuqing Yang
Wei Wang
Yifei Liu
Linfeng Dong
Hao Wu
Mingxin Zhang
Zhihang Zhong
Xiao-Fu Sun
54
1
0
09 Mar 2025
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Nguyen H K. Do
Truc Nguyen
Malik Hassanaly
Raed Alharbi
Jung Taek Seo
My T. Thai
54
0
0
09 Mar 2025
Machine Learned Force Fields: Fundamentals, its reach, and challenges
Carlos A. Vital
Román J. Armenta-Rico
Huziel E. Sauceda
AI4CE
34
0
0
07 Mar 2025
Wanda++: Pruning Large Language Models via Regional Gradients
Yifan Yang
Kai Zhen
Bhavana Ganesh
Aram Galstyan
Goeric Huybrechts
...
S. Bodapati
Nathan Susanj
Zheng Zhang
Jack FitzGerald
Abhishek Kumar
61
0
0
06 Mar 2025
Non-convergence to the optimal risk for Adam and stochastic gradient descent optimization in the training of deep neural networks
Thang Do
Arnulf Jentzen
Adrian Riekert
58
1
0
03 Mar 2025
MIDAS: Mixing Ambiguous Data with Soft Labels for Dynamic Facial Expression Recognition
Ryosuke Kawamura
Hideaki Hayashi
Noriko Takemura
Hajime Nagahara
CVBM
3DH
65
4
0
28 Feb 2025
A Survey of Link Prediction in Temporal Networks
Jiafeng Xiong
Ahmad Zareie
Rizos Sakellariou
AI4TS
AI4CE
42
1
0
28 Feb 2025
Online Prototypes and Class-Wise Hypergradients for Online Continual Learning with Pre-Trained Models
Nicolas Michel
Maorong Wang
Jiangpeng He
Toshihiko Yamasaki
CLL
59
0
0
26 Feb 2025
Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints
Mihaela C. Stoian
Eleonora Giunchiglia
92
2
0
25 Feb 2025
Swarm Characteristics Classification Using Neural Networks
Donald W. Peltier
Isaac Kaminer
Abram H. Clark
Marko Orescanin
40
1
0
20 Feb 2025
SEW: Self-calibration Enhanced Whole Slide Pathology Image Analysis
Haoming Luo
Xiaotian Yu
Shengxuming Zhang
Jiabin Xia
Yang Jian
...
Liang Xue
Xiuming Zhang
Jing Zhang
Jing Zhang
Zunlei Feng
76
0
0
17 Feb 2025
Preconditioned Inexact Stochastic ADMM for Deep Model
Shenglong Zhou
Ouya Wang
Ziyan Luo
Yongxu Zhu
Geoffrey Ye Li
46
0
0
15 Feb 2025
Multi-level Supervised Contrastive Learning
Naghmeh Ghanooni
Barbod Pajoum
Harshit Rawal
Sophie Fellenz
Vo Nguyen Le Duy
Marius Kloft
86
0
0
04 Feb 2025
Online Gradient Boosting Decision Tree: In-Place Updates for Efficient Adding/Deleting Data
Huawei Lin
Jun Woo Chung
Yingjie Lao
Weijie Zhao
51
0
0
03 Feb 2025
When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search
Xuan Chen
Yuzhou Nie
Wenbo Guo
Xiangyu Zhang
112
10
0
28 Jan 2025
Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts
Wenju Sun
Qingyong Li
Wen Wang
Yangli-ao Geng
Boyang Li
44
3
0
28 Jan 2025
1
2
3
4
...
18
19
20
Next