v1v2 (latest)

An overview of gradient descent optimization algorithms

15 September 2016

Papers citing "An overview of gradient descent optimization algorithms"

50 / 697 papers shown

Title
FU-net: Multi-class Image Segmentation Using Feedback Weighted U-net M. Jafari Ruizhe Li Yue Xing Dorothee Auer S. Francis J. Garibaldi Xin Chen SSeg 50 14 0 28 Apr 2020
A Review of Privacy-preserving Federated Learning for the Internet-of-Things Christopher Briggs Zhong Fan Péter András 135 15 0 24 Apr 2020
Self-Organized Operational Neural Networks with Generative Neurons S. Kiranyaz Junaid Malik Habib Ben Abdallah T. Ince Alexandros Iosifidis Moncef Gabbouj 55 81 0 24 Apr 2020
Supervised Contrastive Learning Prannay Khosla Piotr Teterwak Chen Wang Aaron Sarna Yonglong Tian Phillip Isola Aaron Maschinot Ce Liu Dilip Krishnan SSL 222 4,615 0 23 Apr 2020
Automated diagnosis of COVID-19 with limited posteroanterior chest X-ray images using fine-tuned deep neural networks Narinder Singh Punn Sonali Agarwal 114 193 0 23 Apr 2020
Heterogeneous CPU+GPU Stochastic Gradient Descent Algorithms Yujing Ma Florin Rusu 33 3 0 19 Apr 2020
Stochastic batch size for adaptive regularization in deep network optimization Kensuke Nakamura Stefano Soatto Byung-Woo Hong ODL 51 6 0 14 Apr 2020
Machine-Learning Dessins dÉnfants: Explorations via Modular and Seiberg-Witten Curves Yang-Hui He Edward Hirst Toby Peterken 51 37 0 10 Apr 2020
Structure-preserving neural networks Quercus Hernandez Alberto Badías D. González Francisco Chinesta Elías Cueto PINN 129 71 0 09 Apr 2020
Federated Multi-view Matrix Factorization for Personalized Recommendations Adrian Flanagan Were Oyomno A. Grigorievskiy K. E. Tan Suleiman A. Khan Muhammad Ammad-ud-din FedML 75 71 0 08 Apr 2020
Weighted Aggregating Stochastic Gradient Descent for Parallel Deep Learning Pengzhan Guo Zeyang Ye Keli Xiao Wei Zhu 50 14 0 07 Apr 2020
Adaptive Partial Scanning Transmission Electron Microscopy with Reinforcement Learning Jeffrey M. Ede 110 13 0 06 Apr 2020
On the convergence of physics informed neural networks for linear second-order elliptic and parabolic type PDEs Yeonjong Shin Jérome Darbon George Karniadakis PINN 73 79 0 03 Apr 2020
FeederGAN: Synthetic Feeder Generation via Deep Graph Adversarial Nets Ming Liang Yao Meng Jiyu Wang D. Lubkeman N. Lu GAN 48 23 0 03 Apr 2020
SiTGRU: Single-Tunnelled Gated Recurrent Unit for Abnormality Detection Habtamu Fanta Zhiwen Shao Lizhuang Ma 36 39 0 30 Mar 2020
Non-Adversarial Video Synthesis with Learned Priors Abhishek Aich Akash Gupta Yikang Shen Rakib Hyder M. Salman Asif Amit K. Roy-Chowdhury VGen GAN 160 18 0 21 Mar 2020
Mass Estimation of Galaxy Clusters with Deep Learning I: Sunyaev-Zel'dovich Effect N. Gupta C. Reichardt 74 14 0 13 Mar 2020
Hyper-Parameter Optimization: A Review of Algorithms and Applications Tong Yu Hong Zhu AAML 99 541 0 12 Mar 2020
Improving the Backpropagation Algorithm with Consequentialism Weight Updates over Mini-Batches Naeem Paeedeh Kamaledin Ghiasi-Shirazi ODL 62 8 0 11 Mar 2020
Explore and Exploit with Heterotic Line Bundle Models Magdalena Larfors Robin Schneider 81 38 0 10 Mar 2020
Joint Parameter-and-Bandwidth Allocation for Improving the Efficiency of Partitioned Edge Learning Dingzhu Wen M. Bennis Kaibin Huang 75 49 0 10 Mar 2020
Warwick Electron Microscopy Datasets Jeffrey M. Ede 105 14 0 02 Mar 2020
Do optimization methods in deep learning applications matter? Buse Melis Özyildirim Mariam Kiran 52 11 0 28 Feb 2020
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking Shirli Di-Castro Shashua Shie Mannor OffRL 71 12 0 17 Feb 2020
Controlled time series generation for automotive software-in-the-loop testing using GANs Dhasarathy Parthasarathy Karl Bäckström Jens Henriksson S. Einarsdóttir 31 13 0 16 Feb 2020
CSM-NN: Current Source Model Based Logic Circuit Simulation -- A Neural Network Approach M. Abrishami Massoud Pedram Shahin Nazarian 24 7 0 13 Feb 2020
LaProp: Separating Momentum and Adaptivity in Adam Liu Ziyin Zhikang T.Wang Masahito Ueda ODL 70 18 0 12 Feb 2020
On Layer Normalization in the Transformer Architecture Ruibin Xiong Yunchang Yang Di He Kai Zheng Shuxin Zheng Chen Xing Huishuai Zhang Yanyan Lan Liwei Wang Tie-Yan Liu AI4CE 160 1,006 0 12 Feb 2020
D2D-Enabled Data Sharing for Distributed Machine Learning at Wireless Network Edge Xiaoran Cai Xiaopeng Mo Junyang Chen Jie Xu 43 26 0 28 Jan 2020
Design of Capacity-Approaching Low-Density Parity-Check Codes using Recurrent Neural Networks Eleni Nisioti N. Thomos 30 22 0 05 Jan 2020
A Comprehensive Survey of Multilingual Neural Machine Translation Raj Dabre Chenhui Chu Anoop Kunchukuttan LRM 116 33 0 04 Jan 2020
Distributed Stochastic Algorithms for High-rate Streaming Principal Component Analysis Haroon Raja W. Bajwa 88 11 0 04 Jan 2020
Deep Learning-Based Intrusion Detection System for Advanced Metering Infrastructure Zakaria El Mrabet Mehdi Ezzari Hassan El Ghazi B. A. E. Majd 23 14 0 31 Dec 2019
Parallel cross-validation: a scalable fitting method for Gaussian process models Florian Gerber D. Nychka 19 9 0 31 Dec 2019
Pipelined Training with Stale Weights of Deep Convolutional Neural Networks Lifu Zhang T. Abdelrahman 55 0 0 29 Dec 2019
SoftAdapt: Techniques for Adaptive Loss Weighting of Neural Networks with Multi-Part Loss Functions A. Heydari Craig Thompson A. Mehmood 64 64 0 27 Dec 2019
Second-order Information in First-order Optimization Methods Yuzheng Hu Licong Lin Shange Tang ODL 53 2 0 20 Dec 2019
Optimization for deep learning: theory and algorithms Ruoyu Sun ODL 137 169 0 19 Dec 2019
Comparison of Neuronal Attention Models Mohamed Karim Belaid 40 1 0 07 Dec 2019
Physically Interpretable Neural Networks for the Geosciences: Applications to Earth System Variability B. Toms E. Barnes I. Ebert‐Uphoff AI4CE 103 216 0 04 Dec 2019
Region segmentation via deep learning and convex optimization Matthias Sonntag V. Morgenshtern 3DPC 28 1 0 28 Nov 2019
Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints Jingliang Duan Zhengyu Liu Shengbo Eben Li Qi Sun Zhenzhong Jia B. Cheng 72 65 0 26 Nov 2019
Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems Jaynta Mandi Emir Demirović Peter Stuckey Tias Guns 76 146 0 22 Nov 2019
Understanding the Disharmony between Weight Normalization Family and Weight Decay: $ε-$ shifted $L_2$ Regularizer Li Xiang Chen Shuo Xia Yan Yang Jian 59 2 0 14 Nov 2019
Variable Star Classification Using Multi-View Metric Learning K. Johnston S. Caballero-Nieves V. Petit A. Peter Rana Haber 50 3 0 13 Nov 2019
Short-term forecasting of solar irradiance without local telemetry: a generalized model using satellite data J. Lago K. D. Brabandere F. Ridder B. de Schutter 37 59 0 12 Nov 2019
Regularized Deep Networks in Intelligent Transportation Systems: A Taxonomy and a Case Study Mohammad Mahdi Bejani M. Ghatee OOD 41 13 0 08 Nov 2019
An Efficient and Effective Second-Order Training Algorithm for LSTM-based Adaptive Learning Nuri Mert Vural Salih Ergüt Suleyman S. Kozat 41 13 0 22 Oct 2019
Topological Navigation Graph Framework P. Daniušis Shubham Juneja Lukas Valatka Linas Petkevičius 67 1 0 15 Oct 2019
Characterizing Deep Learning Training Workloads on Alibaba-PAI Mengdi Wang Chen Meng Guoping Long Chuan Wu Jun Yang Wei Lin Yangqing Jia 73 56 0 14 Oct 2019