Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.04747
Cited By
An overview of gradient descent optimization algorithms
15 September 2016
Sebastian Ruder
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An overview of gradient descent optimization algorithms"
50 / 996 papers shown
Title
A Review of Differentiable Simulators
Rhys Newbury
Jack Collins
Kerry He
Jiahe Pan
Ingmar Posner
David Howard
Akansel Cosgun
AI4CE
49
9
0
08 Jul 2024
Entropy-Informed Weighting Channel Normalizing Flow
Wei Chen
Shian Du
Shigui Li
Delu Zeng
John Paisley
37
0
0
06 Jul 2024
Bias of Stochastic Gradient Descent or the Architecture: Disentangling the Effects of Overparameterization of Neural Networks
Amit Peleg
Matthias Hein
39
0
0
04 Jul 2024
Venomancer: Towards Imperceptible and Target-on-Demand Backdoor Attacks in Federated Learning
Son Nguyen
Thinh Nguyen
Khoa D. Doan
Kok-Seng Wong
FedML
AAML
32
0
0
03 Jul 2024
AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition
Fadi Boutros
Vitomir Štruc
Naser Damer
49
2
0
01 Jul 2024
BADM: Batch ADMM for Deep Learning
Ouya Wang
Shenglong Zhou
Geoffrey Ye Li
ODL
50
1
0
30 Jun 2024
Multi-agent Cooperative Games Using Belief Map Assisted Training
Qinwei Huang
Chen Luo
Alex B. Wu
Simon Khan
Hai Helen Li
Qinru Qiu
36
0
0
27 Jun 2024
Semi-adaptive Synergetic Two-way Pseudoinverse Learning System
Binghong Liu
Ziqi Zhao
Shupan Li
Ke Wang
24
0
0
27 Jun 2024
DataStates-LLM: Lazy Asynchronous Checkpointing for Large Language Models
Avinash Maurya
Robert Underwood
M. Rafique
Franck Cappello
Bogdan Nicolae
21
14
0
15 Jun 2024
Gradient-based Learning in State-based Potential Games for Self-Learning Production Systems
Steve Yuwono
Marlon Löppenberg
Dorothea Schwung
Andreas Schwung
34
2
0
14 Jun 2024
MEMO-QCD: Quantum Density Estimation through Memetic Optimisation for Quantum Circuit Design
Juan E. Ardila-García
Vladimir Vargas-Calderón
Fabio A. González
Diego H. Useche
Herbert Vinck-Posada
43
1
0
12 Jun 2024
Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Zanlin Ni
Yulin Wang
Renping Zhou
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Shiji Song
Yuan Yao
Gao Huang
32
14
0
08 Jun 2024
Batch-in-Batch: a new adversarial training framework for initial perturbation and sample selection
Yinting Wu
Pai Peng
Bo Cai
Le Li
.
AAML
39
0
0
06 Jun 2024
Kirigami: large convolutional kernels improve deep learning-based RNA secondary structure prediction
Marc Harary
Chengxin Zhang
29
0
0
04 Jun 2024
Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning
Leonardo Iurada
Marco Ciccone
Tatiana Tommasi
36
3
0
03 Jun 2024
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
Jiaming Li
Jiacheng Zhang
Jichang Li
Ge Li
Si Liu
Liang Lin
Guanbin Li
ObjD
VLM
50
13
0
01 Jun 2024
A Survey of Latent Factor Models in Recommender Systems
Hind I. Alshbanat
Hafida Benhidour
Said Kerrache
LRM
BDL
43
1
0
28 May 2024
AdaFisher: Adaptive Second Order Optimization via Fisher Information
Damien Martins Gomes
Yanlei Zhang
Eugene Belilovsky
Guy Wolf
Mahdi S. Hosseini
ODL
78
2
0
26 May 2024
GeoAdaLer: Geometric Insights into Adaptive Stochastic Gradient Descent Algorithms
Chinedu Eleh
Masuzyo Mwanza
Ekene S. Aguegboh
Hans-Werner van Wyk
21
0
0
25 May 2024
Federated Learning for Non-factorizable Models using Deep Generative Prior Approximations
Conor Hassan
Joshua J Bon
Elizaveta Semenova
Antonietta Mira
Kerrie Mengersen
26
0
0
25 May 2024
SimPO: Simple Preference Optimization with a Reference-Free Reward
Yu Meng
Mengzhou Xia
Danqi Chen
68
358
0
23 May 2024
Automatic Differentiation is Essential in Training Neural Networks for Solving Differential Equations
Chuqi Chen
Yahong Yang
Yang Xiang
Wenrui Hao
26
2
0
23 May 2024
Visualizing, Rethinking, and Mining the Loss Landscape of Deep Neural Networks
Xin-Chun Li
Lan Li
De-Chuan Zhan
41
2
0
21 May 2024
Review of deep learning models for crypto price prediction: implementation and evaluation
Jingyang Wu
Xinyi Zhang
Fangyixuan Huang
Haochen Zhou
Rohtiash Chandra
37
3
0
19 May 2024
Nonparametric Teaching of Implicit Neural Representations
Chen Zhang
Steven Tin Sui Luo
Jason Chun Lok Li
Yik-Chung Wu
Ngai Wong
46
2
0
17 May 2024
Deep Multi-Task Learning for Malware Image Classification
A. Bensaoud
Jugal Kalita
27
33
0
09 May 2024
Custom Gradient Estimators are Straight-Through Estimators in Disguise
Matt Schoenbauer
Daniele Moro
Lukasz Lew
Andrew G. Howard
MQ
30
3
0
08 May 2024
Collage: Light-Weight Low-Precision Strategy for LLM Training
Tao Yu
Gaurav Gupta
Karthick Gopalswamy
Amith R. Mamidala
Hao Zhou
Jeffrey Huynh
Youngsuk Park
Ron Diamant
Anoop Deoras
Jun Huan
MQ
59
3
0
06 May 2024
Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to Probe the Boundaries of Stable Diffusion Generated Data
Leonhard Hennicke
C. Adriano
Holger Giese
Jan Mathias Koehler
Lukas Schott
DiffM
55
2
0
06 May 2024
Better YOLO with Attention-Augmented Network and Enhanced Generalization Performance for Safety Helmet Detection
Shuqi Shen
Junjie Yang
38
2
0
04 May 2024
A Full Adagrad algorithm with O(Nd) operations
Antoine Godichon-Baggioni
Wei Lu
Bruno Portier
ODL
54
0
0
03 May 2024
Recovering Labels from Local Updates in Federated Learning
Huancheng Chen
H. Vikalo
FedML
AAML
35
4
0
02 May 2024
WHALE-FL: Wireless and Heterogeneity Aware Latency Efficient Federated Learning over Mobile Devices via Adaptive Subnetwork Scheduling
Huai-an Su
Jiaxiang Geng
Liang Li
Xiaoqi Qin
Yanzhao Hou
Xin Fu
Miao Pan
Miao Pan
40
1
0
01 May 2024
BUFF: Boosted Decision Tree based Ultra-Fast Flow matching
Cheng Jiang
Sitian Qian
Huilin Qu
27
1
0
28 Apr 2024
pFedAFM: Adaptive Feature Mixture for Batch-Level Personalization in Heterogeneous Federated Learning
Liping Yi
Han Yu
Chao Ren
Heng-Ming Zhang
Gang Wang
Xiaoguang Liu
Xiaoxiao Li
35
2
0
27 Apr 2024
ODCR: Orthogonal Decoupling Contrastive Regularization for Unpaired Image Dehazing
Zhongze Wang
Haitao Zhao
Jingchao Peng
Lujian Yao
Kaijie Zhao
34
8
0
27 Apr 2024
An automatic mixing speech enhancement system for multi-track audio
Xiaojing Liu
Angeliki Mourgela
Hongwei Ai
Joshua D. Reiss
19
1
0
27 Apr 2024
Incorporating Gradients to Rules: Towards Lightweight, Adaptive Provenance-based Intrusion Detection
Lingzhi Wang
Xiangmin Shen
Weijian Li
Zhenyuan Li
R. Sekar
Han Liu
Yan Chen
AAML
28
1
0
23 Apr 2024
Collaborative Visual Place Recognition through Federated Learning
Mattia Dutto
Gabriele Berton
Debora Caldarola
Eros Fani
Gabriele Trivigno
Carlo Masone
FedML
32
1
0
20 Apr 2024
Learning to Cut via Hierarchical Sequence/Set Model for Efficient Mixed-Integer Programming
Jie Wang
Zhihai Wang
Xijun Li
Yufei Kuang
Zhihao Shi
Fangzhou Zhu
Mingxuan Yuan
Jianguo Zeng
Yongdong Zhang
Feng Wu
56
7
0
19 Apr 2024
Implementation and Evaluation of a Gradient Descent-Trained Defensible Blackboard Architecture System
Jordan Milbrath
Jonathan Rivard
Jeremy Straub
17
1
0
17 Apr 2024
I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey
Noah Lewis
J. L. Bez
Suren Byna
57
0
0
16 Apr 2024
Contrastive Mean-Shift Learning for Generalized Category Discovery
Sua Choi
Dahyun Kang
Minsu Cho
29
10
0
15 Apr 2024
Automatic Defect Detection in Sewer Network Using Deep Learning Based Object Detector
Bach Ha
Birgit Schalter
Laura White
J. Köhler
ObjD
AI4CE
32
2
0
09 Apr 2024
Unified Entropy Optimization for Open-Set Test-Time Adaptation
Zhengqing Gao
Xu-Yao Zhang
Cheng-Lin Liu
TTA
37
5
0
09 Apr 2024
Dynamical stability and chaos in artificial neural network trajectories along training
Kaloyan Danovski
Miguel C. Soriano
Lucas Lacasa
43
7
0
08 Apr 2024
Stochastic Online Optimization for Cyber-Physical and Robotic Systems
Hao Ma
Melanie Zeilinger
Michael Muehlebach
62
0
0
08 Apr 2024
Exploiting Preference Elicitation in Interactive and User-centered Algorithmic Recourse: An Initial Exploration
Seyedehdelaram Esfahani
Giovanni De Toni
Bruno Lepri
Andrea Passerini
Katya Tentori
Massimo Zancanaro
27
5
0
08 Apr 2024
Optimizing Quantum Convolutional Neural Network Architectures for Arbitrary Data Dimension
Changwon Lee
Israel F. Araujo
Dongha Kim
Junghan Lee
Siheon Park
Ju-Young Ryu
Daniel K. Park
32
1
0
28 Mar 2024
MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering
Guoxing Sun
Rishabh Dabral
Pascal Fua
Christian Theobalt
Marc Habermann
3DH
56
4
0
27 Mar 2024
Previous
1
2
3
4
5
...
18
19
20
Next