Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.04747
Cited By
An overview of gradient descent optimization algorithms
15 September 2016
Sebastian Ruder
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An overview of gradient descent optimization algorithms"
50 / 1,015 papers shown
Title
Learning Differentiable Logic Programs for Abstract Visual Reasoning
Hikaru Shindo
Viktor Pfanschilling
Devendra Singh Dhami
Kristian Kersting
NAI
34
6
0
03 Jul 2023
CoPL: Contextual Prompt Learning for Vision-Language Understanding
Koustava Goswami
Srikrishna Karanam
Prateksha Udhayanan
J. JosephK.
Balaji Vasan Srinivasan
VLM
26
8
0
03 Jul 2023
Structured Network Pruning by Measuring Filter-wise Interactions
Wenting Tang
Xingxing Wei
Bo-wen Li
25
0
0
03 Jul 2023
Classifying World War II Era Ciphers with Machine Learning
Brooke Dalton
Mark Stamp
39
0
0
02 Jul 2023
Resetting the Optimizer in Deep RL: An Empirical Study
Kavosh Asadi
Rasool Fakoor
Shoham Sabach
ODL
26
23
0
30 Jun 2023
Koopman operator learning using invertible neural networks
Yuhuang Meng
Jian-Kai Huang
Yue Qiu
13
12
0
30 Jun 2023
Weight Compander: A Simple Weight Reparameterization for Regularization
Rinor Cakaj
Jens Mehnert
B. Yang
27
1
0
29 Jun 2023
The Deep Arbitrary Polynomial Chaos Neural Network or how Deep Artificial Neural Networks could benefit from Data-Driven Homogeneous Chaos Theory
S. Oladyshkin
T. Praditia
Ilja Kroker
F. Mohammadi
Wolfgang Nowak
S. Otte
AI4CE
24
5
0
26 Jun 2023
Comparing Deep Learning Models for the Task of Volatility Prediction Using Multivariate Data
Wenbo Ge
Pooia Lalbakhsh
Leigh Isai
Artem Lenskiy
Hanna Suominen
OOD
23
3
0
20 Jun 2023
Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent
J. Lin
Javier Antorán
Shreyas Padhy
David Janz
José Miguel Hernández-Lobato
Alexander Terenin
29
23
0
20 Jun 2023
Shape Guided Gradient Voting for Domain Generalization
Jiaqi Xu
Yuwang Wang
Xuejin Chen
26
0
0
19 Jun 2023
Full Parameter Fine-tuning for Large Language Models with Limited Resources
Kai Lv
Yuqing Yang
Tengxiao Liu
Qi-jie Gao
Qipeng Guo
Xipeng Qiu
58
128
0
16 Jun 2023
Schema-learning and rebinding as mechanisms of in-context learning and emergence
Siva K. Swaminathan
Antoine Dedieu
Rajkumar Vasudeva Raju
Murray Shanahan
Miguel Lazaro-Gredilla
Dileep George
41
10
0
16 Jun 2023
Stochastic Re-weighted Gradient Descent via Distributionally Robust Optimization
Ramnath Kumar
Kushal Majmundar
Dheeraj M. Nagaraj
A. Suggala
ODL
37
6
0
15 Jun 2023
Searching for the Fakes: Efficient Neural Architecture Search for General Face Forgery Detection
Xiao Jin
Xinwen Mu
Jing Xu
CVBM
18
0
0
15 Jun 2023
Time-to-Collision-Aware Lane-Change Strategy Based on Potential Field and Cubic Polynomial for Autonomous Vehicles
Pengfei Lin
Ehsan Javanmardi
Ye Tao
Vishal Chauhan
Jin Nakazato
Manabu Tsukada
20
5
0
12 Jun 2023
In-context Cross-Density Adaptation on Noisy Mammogram Abnormalities Detection
H. Nguyen
Thinh B. Lam
Quan D.D. Tran
M. T. Nguyen
Dat T. Chung
V. Q. Dinh
34
8
0
12 Jun 2023
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
Yuhang Ran
Yi-Chen Li
Fuxiang Zhang
Zongzhang Zhang
Yang Yu
OffRL
32
23
0
11 Jun 2023
EMO: Episodic Memory Optimization for Few-Shot Meta-Learning
Yingjun Du
Jiayi Shen
Xiantong Zhen
Cees G. M. Snoek
27
1
0
08 Jun 2023
Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning
Libin Zhu
Chaoyue Liu
Adityanarayanan Radhakrishnan
M. Belkin
38
14
0
07 Jun 2023
Nonparametric Iterative Machine Teaching
Chen Zhang
Xiaofeng Cao
Weiyang Liu
Ivor Tsang
James T. Kwok
21
8
0
05 Jun 2023
ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative Neural Radiance Fields
Kanghyeok Ko
Minhyeok Lee
39
2
0
05 Jun 2023
Jammer classification with Federated Learning
Peng Wu
Helena Calatrava
Tales Imbiriba
Pau Closas
28
8
0
05 Jun 2023
Biologically-Motivated Learning Model for Instructed Visual Processing
R. Abel
S. Ullman
30
0
0
04 Jun 2023
TIES-Merging: Resolving Interference When Merging Models
Prateek Yadav
Derek Tam
Leshem Choshen
Colin Raffel
Joey Tianyi Zhou
MoMe
65
261
0
02 Jun 2023
Neuronal Cell Type Classification using Deep Learning
Ofek Ophir
Orit Shefi
Ofir Lindenbaum
24
2
0
01 Jun 2023
Bayesian inference and neural estimation of acoustic wave propagation
Yongchao Huang
Yuhang He
Hong Ge
34
0
0
28 May 2023
DeepSI: Interactive Deep Learning for Semantic Interaction
Yail Bian
Chris North
HAI
15
15
0
26 May 2023
A Tale of Two Approximations: Tightening Over-Approximation for DNN Robustness Verification via Under-Approximation
Zhiyi Xue
Si Liu
Zhaodi Zhang
Yiting Wu
Hao Fei
AAML
23
2
0
26 May 2023
Batch Model Consolidation: A Multi-Task Model Consolidation Framework
Iordanis Fostiropoulos
Jiaye Zhu
Laurent Itti
MoMe
CLL
32
3
0
25 May 2023
NODDLE: Node2vec based deep learning model for link prediction
Kazi Zainab Khanam
Aditya Singhal
Vijay K. Mago
11
3
0
25 May 2023
Condensed Prototype Replay for Class Incremental Learning
Jiangtao Kong
Zhenyu Zong
Dinesh Manocha
Huajie Shao
29
2
0
25 May 2023
GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes
Ibrahim Ethem Hamamci
Sezgin Er
Anjany Sekuboyina
Enis Simsar
A. Tezcan
...
Hadrien Reynaud
Sarthak Pati
Christian Bluethgen
M. K. Özdemir
Bjoern H. Menze
DiffM
MedIm
50
16
0
25 May 2023
TransWorldNG: Traffic Simulation via Foundation Model
Dingsu Wang
Xuhong Wang
Liang Chen
Shengyue Yao
Mi Jing
Honghai Li
Li Li
Shiqiang Bao
Feiyue Wang
Yilun Lin
24
13
0
25 May 2023
The Evolution of Distributed Systems for Graph Neural Networks and their Origin in Graph Processing and Deep Learning: A Survey
Jana Vatter
R. Mayer
Hans-Arno Jacobsen
GNN
AI4TS
AI4CE
48
23
0
23 May 2023
GraVAC: Adaptive Compression for Communication-Efficient Distributed DL Training
S. Tyagi
Martin Swany
35
4
0
20 May 2023
Brain-inspired learning in artificial neural networks: a review
Samuel Schmidgall
Jascha Achterberg
Thomas Miconi
Louis Kirsch
Rojin Ziaei
S. P. Hajiseyedrazi
Jason K. Eshraghian
41
52
0
18 May 2023
Contrastive Label Enhancement
Yifei Wang
Yi Zhou
Jihua Zhu
Xinyuan Liu
Wen-biao Yan
Zhiqiang Tian
21
5
0
16 May 2023
Dragon-Alpha&cu32: A Java-based Tensor Computing Framework With its High-Performance CUDA Library
Zhiyi Zhang
Pengfei Zhang
Qi Wang
20
1
0
15 May 2023
Online Learning Under A Separable Stochastic Approximation Framework
Min Gan
Xiang-Xiang Su
Guang-yong Chen
Jing Chen
28
0
0
12 May 2023
Meta-Optimization for Higher Model Generalizability in Single-Image Depth Prediction
Cho-Ying Wu
Yiqi Zhong
Junying Wang
Ulrich Neumann
MDE
43
5
0
12 May 2023
Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare Species
Tayfun Karaderi
T. Burghardt
R. Morard
D. Schmidt
40
1
0
11 May 2023
Towards Invisible Backdoor Attacks in the Frequency Domain against Deep Neural Networks
Xinrui Liu
Yajie Wang
Yu-an Tan
Kefan Qiu
Yuan-zhang Li
AAML
9
1
0
10 May 2023
Stealthy Low-frequency Backdoor Attack against Deep Neural Networks
Xinrui Liu
Yu-an Tan
Yajie Wang
Kefan Qiu
Yuan-zhang Li
AAML
6
1
0
10 May 2023
BARA: Efficient Incentive Mechanism with Online Reward Budget Allocation in Cross-Silo Federated Learning
Yunchao Yang
Yipeng Zhou
Miao Hu
Di Wu
Quan.Z Sheng
FedML
31
7
0
09 May 2023
TaLU: A Hybrid Activation Function Combining Tanh and Rectified Linear Unit to Enhance Neural Networks
M. Hasan
Md. Ali Hossain
Azmain Yakin Srizon
Abu Sayeed
17
0
0
08 May 2023
LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Fuyan Ma
Bin Sun
Shutao Li
ViT
27
20
0
05 May 2023
Backdoor Learning on Sequence to Sequence Models
Lichang Chen
Minhao Cheng
Heng-Chiao Huang
SILM
54
18
0
03 May 2023
Part Aware Contrastive Learning for Self-Supervised Action Recognition
Yilei Hua
Wenhan Wu
Ce Zheng
Aidong Lu
Mengyuan Liu
Chong Chen
Shiqian Wu
SSL
105
35
0
01 May 2023
ViewFormer: View Set Attention for Multi-view 3D Shape Understanding
Hongyu Sun
Yongcai Wang
Peng Wang
Xudong Cai
Deying Li
26
2
0
29 Apr 2023
Previous
1
2
3
...
7
8
9
...
19
20
21
Next