Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.04747
Cited By
An overview of gradient descent optimization algorithms
15 September 2016
Sebastian Ruder
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An overview of gradient descent optimization algorithms"
50 / 1,006 papers shown
Title
Node Classification via Semantic-Structural Attention-Enhanced Graph Convolutional Networks
Hongyin Zhu
GNN
43
0
0
24 Mar 2024
Depth Estimation fusing Image and Radar Measurements with Uncertain Directions
Masaya Kotani
Takeru Oba
Norimichi Ukita
MDE
26
0
0
23 Mar 2024
An In-Depth Analysis of Data Reduction Methods for Sustainable Deep Learning
Víctor Toscano-Durán
Javier Perera-Lago
Eduardo Paluzo-Hidalgo
Rocio Gonzalez-Diaz
Miguel A. Gutiérrez-Naranjo
Matteo Rucco
32
1
0
22 Mar 2024
Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection
Jiaming Li
Xiangru Lin
Wei Zhang
Xiao Tan
Yingying Li
Junyu Han
Errui Ding
Jingdong Wang
Guanbin Li
42
8
0
22 Mar 2024
Optimal Flow Matching: Learning Straight Trajectories in Just One Step
Nikita Kornilov
Petr Mokrov
Alexander Gasnikov
Alexander Korotin
34
11
0
19 Mar 2024
The Power of Few: Accelerating and Enhancing Data Reweighting with Coreset Selection
Mohammad Jafari
Yimeng Zhang
Yihua Zhang
Sijia Liu
41
2
0
18 Mar 2024
Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM
Linyu Tang
Lei Zhang
AAML
35
3
0
18 Mar 2024
ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation
Minh-Triet Tran
Winston Bounsavy
Khoa T. Vo
Anh Nguyen
Tri Minh Nguyen
Ngan Le
ViT
32
2
0
18 Mar 2024
YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images
Chun-Tse Chien
Ruikang Ju
Kuang-Yi Chou
Jen-Shiun Chiang
MedIm
32
38
0
17 Mar 2024
CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object Detection under Unknown Degradations
Yuwei Zhang
Yan Wu
Yanming Liu
Xinyue Peng
51
5
0
17 Mar 2024
A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques
Xuetong Li
Yuan Gao
Hong Chang
Danyang Huang
Yingying Ma
...
Ke Xu
Jing Zhou
Xuening Zhu
Yingqiu Zhu
Hansheng Wang
44
7
0
17 Mar 2024
Batch-oriented Element-wise Approximate Activation for Privacy-Preserving Neural Networks
Peng Zhang
Ao Duan
Xianglu Zou
Yuhong Liu
21
0
0
16 Mar 2024
Few-Shot Class Incremental Learning with Attention-Aware Self-Adaptive Prompt
Chenxi Liu
Zhenyi Wang
Tianyi Xiong
Ruibo Chen
Yihan Wu
Junfeng Guo
Heng-Chiao Huang
CLL
50
8
0
14 Mar 2024
On the Convergence of Locally Adaptive and Scalable Diffusion-Based Sampling Methods for Deep Bayesian Neural Network Posteriors
Tim Rensmeyer
Oliver Niggemann
UQCV
BDL
OOD
MedIm
38
0
0
13 Mar 2024
SoK: Reducing the Vulnerability of Fine-tuned Language Models to Membership Inference Attacks
Guy Amit
Abigail Goldsteen
Ariel Farkash
AAML
19
6
0
13 Mar 2024
Experimental Comparison of Ensemble Methods and Time-to-Event Analysis Models Through Integrated Brier Score and Concordance Index
Camila Fernandez
Chung Shue Chen
Pierre Gaillard
Alonso Silva
27
1
0
12 Mar 2024
Towards In-Vehicle Multi-Task Facial Attribute Recognition: Investigating Synthetic Data and Vision Foundation Models
Esmaeil Seraj
Walter Talamonti
30
0
0
10 Mar 2024
Evidence, Definitions and Algorithms regarding the Existence of Cohesive-Convergence Groups in Neural Network Optimization
Thien An L. Nguyen
14
0
0
08 Mar 2024
LLM4Decompile: Decompiling Binary Code with Large Language Models
Hanzhuo Tan
Qi Luo
Jing Li
Yuqun Zhang
SyDa
ELM
65
19
0
08 Mar 2024
Privacy in Cloud Computing through Immersion-based Coding
H. Hayati
N. van de Wouw
C. Murguia
27
1
0
07 Mar 2024
GRAWA: Gradient-based Weighted Averaging for Distributed Training of Deep Learning Models
Tolga Dimlioglu
A. Choromańska
47
3
0
07 Mar 2024
OCD-FL: A Novel Communication-Efficient Peer Selection-based Decentralized Federated Learning
Nizar Masmoudi
Wael Jaafar
26
2
0
06 Mar 2024
Gradient-based Discrete Sampling with Automatic Cyclical Scheduling
Patrick Pynadath
Riddhiman Bhattacharya
Arun Hariharan
Ruqi Zhang
41
4
0
27 Feb 2024
Revisiting Convergence of AdaGrad with Relaxed Assumptions
Yusu Hong
Junhong Lin
28
12
0
21 Feb 2024
Offline Training of Language Model Agents with Functions as Learnable Weights
Shaokun Zhang
Jieyu Zhang
Jiale Liu
Linxin Song
Chi Wang
Ranjay Krishna
Qingyun Wu
LLMAG
LM&Ro
AIFin
43
12
0
17 Feb 2024
Fusing Individualized Treatment Rules Using Secondary Outcomes
Daiqi Gao
Yuanjia Wang
Donglin Zeng
27
0
0
13 Feb 2024
Preconditioners for the Stochastic Training of Implicit Neural Representations
Shin-Fang Chng
Hemanth Saratchandran
Simon Lucey
26
0
0
13 Feb 2024
Peeking Behind the Curtains of Residual Learning
Tunhou Zhang
Feng Yan
Hai Helen Li
Yiran Chen
19
0
0
13 Feb 2024
Proof-of-concept: Using ChatGPT to Translate and Modernize an Earth System Model from Fortran to Python/JAX
Anthony Zhou
Linnia Hawkins
Pierre Gentine
16
1
0
13 Feb 2024
Bayesian Deep Learning Via Expectation Maximization and Turbo Deep Approximate Message Passing
Wei Xu
An Liu
Yiting Zhang
Vincent Lau
BDL
25
1
0
12 Feb 2024
Data Distribution-based Curriculum Learning
Shonal Chaudhry
Anuraganand Sharma
21
1
0
12 Feb 2024
Non-convergence to global minimizers for Adam and stochastic gradient descent optimization and constructions of local minimizers in the training of artificial neural networks
Arnulf Jentzen
Adrian Riekert
41
4
0
07 Feb 2024
Two Trades is not Baffled: Condensing Graph via Crafting Rational Gradient Matching
Tianle Zhang
Yuchen Zhang
Kun Wang
Kai Wang
Beining Yang
Kaipeng Zhang
Wenqi Shao
Ping Liu
Qiufeng Wang
Yang You
DD
73
13
0
07 Feb 2024
Closing the Gap Between SGP4 and High-Precision Propagation via Differentiable Programming
Giacomo Acciarini
Atilim Gunecs Baydin
Dario Izzo
13
4
0
07 Feb 2024
How Realistic Is Your Synthetic Data? Constraining Deep Generative Models for Tabular Data
Mihaela C. Stoian
Salijona Dyrmishi
Maxime Cordy
Thomas Lukasiewicz
Eleonora Giunchiglia
29
15
0
07 Feb 2024
Densely Multiplied Physics Informed Neural Networks
Feilong Jiang
Xiaonan Hou
Min Xia
PINN
19
2
0
06 Feb 2024
Improving Pediatric Low-Grade Neuroepithelial Tumors Molecular Subtype Identification Using a Novel AUROC Loss Function for Convolutional Neural Networks
Khashayar Namdar
Matthias W. Wagner
C. Hawkins
U. Tabori
B. Ertl-Wagner
Farzad Khalvati
14
2
0
05 Feb 2024
Dynamic Sparse Learning: A Novel Paradigm for Efficient Recommendation
Shuyao Wang
Yongduo Sui
Jiancan Wu
Zhi Zheng
Hui Xiong
15
16
0
05 Feb 2024
pFedMoE: Data-Level Personalization with Mixture of Experts for Model-Heterogeneous Personalized Federated Learning
Liping Yi
Han Yu
Chao Ren
Heng-Ming Zhang
Gang Wang
Xiaoguang Liu
Xiaoxiao Li
MoE
31
8
0
02 Feb 2024
Comparing Spectral Bias and Robustness For Two-Layer Neural Networks: SGD vs Adaptive Random Fourier Features
Aku Kammonen
Lisi Liang
Anamika Pandey
Raúl Tempone
31
2
0
01 Feb 2024
Effective Multi-Stage Training Model For Edge Computing Devices In Intrusion Detection
Thua Huynh Trong
Thanh Nguyen Hoang
24
3
0
31 Jan 2024
Optimal Potential Shaping on SE(3) via Neural ODEs on Lie Groups
Yannik P. Wotte
Federico Califano
Stefano Stramigioli
AI4CE
29
1
0
25 Jan 2024
Inadequacy of common stochastic neural networks for reliable clinical decision support
Adrian Lindenmeyer
Malte Blattmann
S. Franke
Thomas Neumuth
Daniel Schneider
BDL
35
1
0
24 Jan 2024
Dynamic Layer Tying for Parameter-Efficient Transformers
Tamir David Hay
Lior Wolf
33
3
0
23 Jan 2024
Improving Local Training in Federated Learning via Temperature Scaling
Kichang Lee
Songkuk Kim
Jeonggil Ko
FedML
35
1
0
18 Jan 2024
GD doesn't make the cut: Three ways that non-differentiability affects neural network training
Siddharth Krishna Kumar
AAML
26
2
0
16 Jan 2024
Accelerated Sampling of Rare Events using a Neural Network Bias Potential
Xinru Hua
R. Ahmad
Jose Blanchet
Wei Cai
AI4CE
90
3
0
13 Jan 2024
Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech
Yu Xi
Baochen Yang
Hao Li
Jiaqi Guo
Kai Yu
39
4
0
12 Jan 2024
Ordering-Flexible Multi-Robot Coordination for MovingTarget Convoying Using Long-TermTask Execution
Bin-Bin Hu
Yanxin Zhou
Henglai Wei
Yan Wang
Chen Lv
24
5
0
12 Jan 2024
Artificial Intelligence for Operations Research: Revolutionizing the Operations Research Process
Zhenan Fan
Bissan Ghaddar
Xinglu Wang
Linzi Xing
Yong Zhang
Zirui Zhou
AI4CE
53
11
0
06 Jan 2024
Previous
1
2
3
4
5
6
...
19
20
21
Next