v1v2 (latest)

An overview of gradient descent optimization algorithms

15 September 2016

Papers citing "An overview of gradient descent optimization algorithms"

47 / 697 papers shown

Title
AirLab: Autograd Image Registration Laboratory Robin Sandkühler C. Jud Simon Andermatt Philippe C. Cattin 63 53 0 26 Jun 2018
Deep Learning based Estimation of Weaving Target Maneuvers Vitaly Shalumov Itzik Klein 24 0 0 13 Jun 2018
A Deep Neural Network Surrogate for High-Dimensional Random Partial Differential Equations M. A. Nabian Hadi Meidani AI4CE 76 102 0 08 Jun 2018
A Machine Learning Framework for Stock Selection XingYu Fu JinHong Du Yifeng Guo MingWen Liu Tao Dong XiuWen Duan AIFin 63 31 0 05 Jun 2018
Solving the Kolmogorov PDE by means of deep learning C. Beck S. Becker Philipp Grohs Nor Jaafari Arnulf Jentzen 83 96 0 01 Jun 2018
Dynamic learning rate using Mutual Information Shrihari Vasudevan 30 6 0 18 May 2018
Opinion Fraud Detection via Neural Autoencoder Decision Forest Manqing Dong Lina Yao Xianzhi Wang B. Benatallah Chaoran Huang Xiaodong Ning 44 57 0 09 May 2018
An improvement of the convergence proof of the ADAM-Optimizer Sebastian Bock Josef Goppold M. Weiß 73 143 0 27 Apr 2018
High-dimension Tensor Completion via Gradient-based Optimization Under Tensor-train Format Longhao Yuan Qibin Zhao Lihua Gui Jianting Cao ViT 70 57 0 05 Apr 2018
Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks Xiaoyuan Liang Xunsheng Du Guiling Wang Zhu Han 67 418 0 29 Mar 2018
What Do We Understand About Convolutional Networks? Isma Hadji Richard P. Wildes FAtt 66 99 0 23 Mar 2018
A high-bias, low-variance introduction to Machine Learning for physicists Pankaj Mehta Marin Bukov Ching-Hao Wang A. G. Day C. Richardson Charles K. Fisher D. Schwab AI4CE 133 885 0 23 Mar 2018
Lower error bounds for the stochastic gradient descent optimization algorithm: Sharp convergence rates for slowly and fast decaying learning rates Arnulf Jentzen Philippe von Wurstemberger 101 31 0 22 Mar 2018
Efficient Hardware Realization of Convolutional Neural Networks using Intra-Kernel Regular Pruning Maurice Yang Mahmoud Faraj Assem Hussein V. Gaudet CVBM 64 12 0 15 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey Chaoyun Zhang P. Patras Hamed Haddadi 134 1,320 0 12 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches Md. Zahangir Alom T. Taha C. Yakopcic Stefan Westberg P. Sidike Mst Shamima Nasrin B. Van Essen A. Awwal V. Asari VLM 133 883 0 03 Mar 2018
Slow and Stale Gradients Can Win the Race: Error-Runtime Trade-offs in Distributed SGD Sanghamitra Dutta Gauri Joshi Soumyadip Ghosh Parijat Dube P. Nagpurkar 82 198 0 03 Mar 2018
Anticipation in Human-Robot Cooperation: A Recurrent Neural Network Approach for Multiple Action Sequences Prediction Paul Schydlo M. Raković L. Jamone J. Santos-Victor 123 64 0 28 Feb 2018
$$\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space$ $\mathcal{G}$ -SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space Qi Meng Shuxin Zheng Huishuai Zhang Wei Chen Zhi-Ming Ma Tie-Yan Liu 133 39 0 11 Feb 2018
Deep learning in radiology: an overview of the concepts and a survey of the state of the art Maciej A. Mazurowski Mateusz Buda Ashirbani Saha Mustafa R. Bashir MedIm AI4CE 62 443 0 10 Feb 2018
Fast Point Spread Function Modeling with Deep Learning J. Herbel T. Kacprzak A. Amara Alexandre Réfrégier Aurelien Lucchi 79 44 0 23 Jan 2018
Universal Language Model Fine-tuning for Text Classification Jeremy Howard Sebastian Ruder VLM 83 276 0 18 Jan 2018
MXNET-MPI: Embedding MPI parallelism in Parameter Server Task Model for scaling Deep Learning Amith R. Mamidala Georgios Kollias C. Ward F. Artico 78 20 0 11 Jan 2018
Convergence Analysis of Gradient Descent Algorithms with Proportional Updates Igor Gitman D. Dilipkumar Ben Parr 32 5 0 09 Jan 2018
Recent Advances in Recurrent Neural Networks Hojjat Salehinejad Sharan Sankar Joseph Barfett E. Colak S. Valaee AI4TS 153 587 0 29 Dec 2017
Deep supervised learning using local errors Hesham Mostafa V. Ramesh Gert Cauwenberghs 75 115 0 17 Nov 2017
Sequential Keystroke Behavioral Biometrics for Mobile User Identification via Multi-view Deep Learning Lichao Sun Yuqi Wang Bokai Cao Philip S. Yu W. Srisa-an Alex Leow HAI 68 46 0 07 Nov 2017
Estimating Historical Hourly Traffic Volumes via Machine Learning and Vehicle Probe Data: A Maryland Case Study Przemysław Sekuła Nikola Marković Zachary Vander Laan K. Sadabadi 40 65 0 02 Nov 2017
A Novel Stochastic Stratified Average Gradient Method: Convergence Rate and Its Complexity Aixiang Chen Bingchuan Chen Xiaolong Chai Rui-Ling Bian Hengguang Li 77 22 0 21 Oct 2017
Clickbait Detection in Tweets Using Self-attentive Network Yiwei Zhou 70 53 0 15 Oct 2017
Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial Mingzhe Chen Ursula Challita Walid Saad Changchuan Yin Mérouane Debbah 102 209 0 09 Oct 2017
Machine learning approximation algorithms for high-dimensional fully nonlinear partial differential equations and second-order backward stochastic differential equations C. Beck Weinan E Arnulf Jentzen 87 333 0 18 Sep 2017
A Tutorial on Deep Learning for Music Information Retrieval Keunwoo Choi Gyorgy Fazekas Kyunghyun Cho Mark Sandler VLM 169 91 0 13 Sep 2017
Neural Translation of Musical Style Iman Malik Carl Henrik Ek 83 38 0 11 Aug 2017
Argument Labeling of Explicit Discourse Relations using LSTM Neural Networks Sohail Hooda Leila Kosseim 18 9 0 11 Aug 2017
Forecasting day-ahead electricity prices in Europe: the importance of considering market integration J. Lago F. Ridder Peter Vrancx B. de Schutter 41 203 0 01 Aug 2017
Analysis and Optimization of Convolutional Neural Network Architectures Martin Thoma 101 73 0 31 Jul 2017
Tensor-Based Backpropagation in Neural Networks with Non-Sequential Input Hirsh R. Agarwal Andrew Huang 24 0 0 13 Jul 2017
Convergence Analysis of Optimization Algorithms Hyoungseok Kim Jihoon Kang Woo-Myoung Park SukHyun Ko Yoon-Ho Choi Daesung Yu YoungSook Song JungWon Choi 31 8 0 06 Jul 2017
Variants of RMSProp and Adagrad with Logarithmic Regret Bounds Mahesh Chandra Mukkamala Matthias Hein ODL 84 258 0 17 Jun 2017
Train longer, generalize better: closing the generalization gap in large batch training of neural networks Elad Hoffer Itay Hubara Daniel Soudry ODL 204 803 0 24 May 2017
Deep Learning Based Regression and Multi-class Models for Acute Oral Toxicity Prediction with Automatic Chemical Feature Extraction Youjun Xu Jianfeng Pei L. Lai 64 192 0 16 Apr 2017
Efficient Parallel Translating Embedding For Knowledge Graphs Denghui Zhang Manling Li Yantao Jia Yuanzhuo Wang Xueqi Cheng 73 18 0 30 Mar 2017
Deep Robust Kalman Filter Shirli Di-Castro Shashua Shie Mannor BDL 79 28 0 07 Mar 2017
Neural Machine Translation and Sequence-to-sequence Models: A Tutorial Graham Neubig AIMat 104 173 0 05 Mar 2017
A State Space Approach for Piecewise-Linear Recurrent Neural Networks for Reconstructing Nonlinear Dynamics from Neural Measurements Daniel Durstewitz 239 55 0 23 Dec 2016
Cyclical Learning Rates for Training Neural Networks L. Smith ODL 246 2,548 0 03 Jun 2015