v1v2 (latest)

An overview of gradient descent optimization algorithms

15 September 2016

Papers citing "An overview of gradient descent optimization algorithms"

50 / 697 papers shown

Title
Quantum Langevin Dynamics for Optimization Zherui Chen Yuchen Lu Hao Wang Yizhou Liu Tongyang Li AI4CE 146 11 0 27 Nov 2023
Filtered Partial Differential Equations: a robust surrogate constraint in physics-informed deep learning framework Dashan Zhang Yuntian Chen Shiyi Chen AI4CE 82 2 0 07 Nov 2023
DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing Gaoshuang Huang Yang Zhou Xiaofei Hu Chenglong Zhang Luying Zhao Wenjian Gan Mingbo Hou 48 3 0 01 Nov 2023
Enhancing Deep Neural Network Training Efficiency and Performance through Linear Prediction Hejie Ying Mengmeng Song Yaohong Tang S. Xiao Zimin Xiao 73 10 0 17 Oct 2023
AdaLomo: Low-memory Optimization with Adaptive Learning Rate Kai Lv Hang Yan Qipeng Guo Haijun Lv Xipeng Qiu ODL 90 23 0 16 Oct 2023
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios Yazhe Niu Yuan Pu Zhenjie Yang Xueyan Li Tong Zhou Jiyuan Ren Shuai Hu Hongsheng Li Yu Liu 139 15 0 12 Oct 2023
Unsupervised Representations Improve Supervised Learning in Speech Emotion Recognition Amirali Soltani Tehrani Niloufar Faridani Ramin Toosi SSL 40 3 0 22 Sep 2023
Neural Network Exemplar Parallelization with Go Georg Wiesinger Erich Schikuta MoE 30 0 0 15 Sep 2023
Stochastic Gradient Descent-like relaxation is equivalent to Metropolis dynamics in discrete optimization and inference problems Maria Chiara Angelini A. Cavaliere Raffaele Marino F. Ricci-Tersenghi 124 5 0 11 Sep 2023
Proof of Deep Learning: Approaches, Challenges, and Future Directions Mahmoud Salhab Khaleel W. Mershad 71 1 0 31 Aug 2023
Multilayer Multiset Neuronal Networks -- MMNNs Alexandre Benatti L. D. F. Costa 57 1 0 28 Aug 2023
Stable Adam Optimization for 16-bit Neural Networks Training Juyoung Yun 25 1 0 30 Jul 2023
Cross-dimensional transfer learning in medical image segmentation with deep learning Hicham Messaoudi Ahror Belaid Douraied BEN SALEM Pierre-Henri Conze MedIm 91 27 0 29 Jul 2023
Convergence of Adam for Non-convex Objectives: Relaxed Hyperparameters and Non-ergodic Case Meixuan He Yuqing Liang Jinlan Liu Dongpo Xu 87 9 0 20 Jul 2023
Sig-Splines: universal approximation and convex calibration of time series generative models Magnus Wiese Phillip Murray R. Korn AI4TS 151 1 0 19 Jul 2023
Learning Differentiable Logic Programs for Abstract Visual Reasoning Hikaru Shindo Viktor Pfanschilling Devendra Singh Dhami Kristian Kersting NAI 87 9 0 03 Jul 2023
The Deep Arbitrary Polynomial Chaos Neural Network or how Deep Artificial Neural Networks could benefit from Data-Driven Homogeneous Chaos Theory S. Oladyshkin T. Praditia Ilja Kroker F. Mohammadi Wolfgang Nowak S. Otte AI4CE 50 5 0 26 Jun 2023
Comparing Deep Learning Models for the Task of Volatility Prediction Using Multivariate Data Wenbo Ge Pooia Lalbakhsh Leigh Isai Artem Lenskiy Hanna Suominen OOD 34 3 0 20 Jun 2023
Schema-learning and rebinding as mechanisms of in-context learning and emergence Siva K. Swaminathan Antoine Dedieu Rajkumar Vasudeva Raju Murray Shanahan Miguel Lazaro-Gredilla Dileep George 101 14 0 16 Jun 2023
In-context Cross-Density Adaptation on Noisy Mammogram Abnormalities Detection H. Nguyen Thinh B. Lam Quan D.D. Tran M. T. Nguyen Dat T. Chung V. Q. Dinh 71 8 0 12 Jun 2023
Nonparametric Iterative Machine Teaching Chen Zhang Xiaofeng Cao Weiyang Liu Ivor Tsang James T. Kwok 101 8 0 05 Jun 2023
ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative Neural Radiance Fields Kanghyeok Ko Minhyeok Lee 81 2 0 05 Jun 2023
Neuronal Cell Type Classification using Deep Learning Ofek Ophir Orit Shefi Ofir Lindenbaum 81 3 0 01 Jun 2023
Bayesian inference and neural estimation of acoustic wave propagation Yongchao Huang Yuhang He Hong Ge 65 0 0 28 May 2023
The Evolution of Distributed Systems for Graph Neural Networks and their Origin in Graph Processing and Deep Learning: A Survey Jana Vatter R. Mayer Hans-Arno Jacobsen GNN AI4TS AI4CE 98 29 0 23 May 2023
GraVAC: Adaptive Compression for Communication-Efficient Distributed DL Training S. Tyagi Martin Swany 81 5 0 20 May 2023
Dragon-Alpha&cu32: A Java-based Tensor Computing Framework With its High-Performance CUDA Library Zhiyi Zhang Pengfei Zhang Qi Wang 52 1 0 15 May 2023
Online Learning Under A Separable Stochastic Approximation Framework Min Gan Xiang-Xiang Su Guang-yong Chen Jing Chen 66 0 0 12 May 2023
Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare Species Tayfun Karaderi T. Burghardt R. Morard D. Schmidt 93 1 0 11 May 2023
LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition Fuyan Ma Bin Sun Shutao Li ViT 72 21 0 05 May 2023
Universal Adversarial Backdoor Attacks to Fool Vertical Federated Learning in Cloud-Edge Collaboration Peng Chen Xin Du Zhihui Lu Hongfeng Chai FedML AAML 98 11 0 22 Apr 2023
Benchmarking Low-Shot Robustness to Natural Distribution Shifts Aaditya K. Singh Kartik Sarangmath Prithvijit Chattopadhyay Judy Hoffman OOD 90 1 0 21 Apr 2023
Bayesian neural networks via MCMC: a Python-based tutorial Rohitash Chandra Royce Chen Joshua Simmons BDL 120 11 0 02 Apr 2023
Random Weights Networks Work as Loss Prior Constraint for Image Restoration Man Zhou Naishan Zheng Jie Huang Xiangyu Rui Chunle Guo Deyu Meng Chongyi Li Liang Feng 118 0 0 29 Mar 2023
Searching for long faint astronomical high energy transients: a data driven approach Riccardo Crupi G. Dilillo Kester Ward E. Bissaldi F. Fiore A. Vacchi 56 6 0 28 Mar 2023
Architecturing Binarized Neural Networks for Traffic Sign Recognition Andreea Postovan Madalina Erascu 39 4 0 27 Mar 2023
Reimagining Application User Interface (UI) Design using Deep Learning Methods: Challenges and Opportunities Subtain Malik M. T. Saeed Marya Jabeen Zia S. Rasool Liaquat A. Khan Mian Ilyas Ahmed AI4TS AI4CE 36 3 0 23 Mar 2023
CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition Deepti Hegde Jeya Maria Jose Valanarasu Vishal M. Patel CLIP 122 68 0 20 Mar 2023
SSGD: A smartphone screen glass dataset for defect detection Haonan Han Rui Yang Shuyan Li R. Hu Xiu Li 93 13 0 12 Mar 2023
Error mitigation of entangled states using brainbox quantum autoencoders Joséphine Pazem M. Ansari 70 4 0 02 Mar 2023
AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning Rate and Momentum for Training Deep Neural Networks Hao Sun Li Shen Qihuang Zhong Liang Ding Shi-Yong Chen Jingwei Sun Jing Li Guangzhong Sun Dacheng Tao 98 34 0 01 Mar 2023
Edge-Based Detection and Localization of Adversarial Oscillatory Load Attacks Orchestrated By Compromised EV Charging Stations Khaled Sarieddine M. Sayed Sadegh Torabi Ribal Atallah C. Assi 28 18 0 24 Feb 2023
Neural networks for learning personality traits from natural language Giorgia Adorni GNN 34 0 0 23 Feb 2023
Evolving Deep Neural Network by Customized Moth Flame Optimization Algorithm for Underwater Targets Recognition M. Khishe M. Mohammadi Tarik Ahmed Rashid Hoger Mahmud Seyedali Mirjalili 50 4 0 16 Feb 2023
Verifying Generalization in Deep Learning Guy Amir Osher Maayan Tom Zelazny Guy Katz Michael Schapira AAML AI4CE 81 15 0 11 Feb 2023
Text recognition on images using pre-trained CNN Afgani Fajar Rizky N. Yudistira Edy Santoso VLM 76 4 0 10 Feb 2023
PINN Training using Biobjective Optimization: The Trade-off between Data Loss and Residual Loss Fabian Heldmann Sarah Treibert Matthias Ehrhardt K. Klamroth 79 22 0 03 Feb 2023
Eloss in the way: A Sensitive Input Quality Metrics for Intelligent Driving Hao-Ting Yang Shiyan Zhang Zhuo Yang Xinyu Zhang 48 0 0 02 Feb 2023
Adapting Step-size: A Unified Perspective to Analyze and Improve Gradient-based Methods for Adversarial Attacks Wei Tao Lei Bao Long Sheng Gao-wei Wu Qing Tao AAML 61 1 0 27 Jan 2023
Multi-limb Split Learning for Tumor Classification on Vertically Distributed Data Omar S. Ads Mayar M. Alfares Mohammed Abdel-Megeed Salem 67 10 0 27 Jan 2023