SGDR: Stochastic Gradient Descent with Warm Restarts

13 August 2016

Papers citing "SGDR: Stochastic Gradient Descent with Warm Restarts"

50 / 4,280 papers shown

Title
The Evolved Transformer David R. So Chen Liang Quoc V. Le ViT 38 460 0 30 Jan 2019
Semantic Redundancies in Image-Classification Datasets: The 10% You Don't Need Vighnesh Birodkar H. Mobahi Samy Bengio 21 82 0 29 Jan 2019
Pay Less Attention with Lightweight and Dynamic Convolutions Felix Wu Angela Fan Alexei Baevski Yann N. Dauphin Michael Auli 11 604 0 29 Jan 2019
Using Pre-Training Can Improve Model Robustness and Uncertainty Dan Hendrycks Kimin Lee Mantas Mazeika NoLa 34 721 0 28 Jan 2019
Deep Learning on Small Datasets without Pre-Training using Cosine Loss Björn Barz Joachim Denzler 27 129 0 25 Jan 2019
Pricing options and computing implied volatilities using neural networks Shuaiqiang Liu C. Oosterlee S. Bohté 19 119 0 25 Jan 2019
Simultaneous lesion and neuroanatomy segmentation in Multiple Sclerosis using deep neural networks Richard McKinley Rik Wepfer F. Aschwanden L. Grunder Raphaela Muri ... M. Reyes A. Salmen A. Chan F. Wagner Roland Wiest 24 15 0 22 Jan 2019
Backbone Can Not be Trained at Once: Rolling Back to Pre-trained Network for Person Re-Identification Youngmin Ro Jongwon Choi D. Jo Byeongho Heo Jongin Lim J. Choi 27 16 0 18 Jan 2019
EAT-NAS: Elastic Architecture Transfer for Accelerating Large-scale Neural Architecture Search Jiemin Fang Yukang Chen Xinbang Zhang Qian Zhang Chang Huang Gaofeng Meng Wenyu Liu Xinggang Wang 36 24 0 17 Jan 2019
Deep learning-based electroencephalography analysis: a systematic review Yannick Roy Hubert J. Banville Isabela Albuquerque Alexandre Gramfort T. Falk J. Faubert 25 937 0 16 Jan 2019
URNet : User-Resizable Residual Networks with Conditional Gating Module Sang-ho Lee Simyung Chang Nojun Kwak 21 11 0 15 Jan 2019
Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering Victor Zhong Caiming Xiong N. Keskar R. Socher 27 63 0 03 Jan 2019
Actor Conditioned Attention Maps for Video Action Detection Oytun Ulutan S. Rallapalli Mudhakar Srivatsa Carlos Torres B. S. Manjunath 19 42 0 30 Dec 2018
AVRA: Automatic Visual Ratings of Atrophy from MRI images using Recurrent Convolutional Neural Networks G. Mårtensson D. Ferreira L. Cavallin J.-Sebastian Muehlboeck L. Wahlund Chunliang Wang E. Westman 38 20 0 23 Dec 2018
Meta Architecture Search Albert Eaton Shaw Wei Wei Weiyang Liu Le Song Bo Dai BDL 23 35 0 22 Dec 2018
Rethinking Layer-wise Feature Amounts in Convolutional Neural Network Architectures Martin Mundt Sagnik Majumder Tobias Weis Visvanathan Ramesh FAtt 11 0 0 14 Dec 2018
Learning representations of molecules and materials with atomistic neural networks Kristof T. Schütt A. Tkatchenko K. Müller NAI 30 13 0 11 Dec 2018
Deep Anomaly Detection with Outlier Exposure Dan Hendrycks Mantas Mazeika Thomas G. Dietterich OODD 31 1,457 0 11 Dec 2018
Hyperbolic Deep Learning for Chinese Natural Language Understanding Marko Valentin Micic Hugo Chu 11 7 0 11 Dec 2018
SlowFast Networks for Video Recognition Christoph Feichtenhofer Haoqi Fan Jitendra Malik Kaiming He 116 3,222 0 10 Dec 2018
ShuffleNASNets: Efficient CNN models through modified Efficient Neural Architecture Search Kevin Laube A. Zell UQCV 22 10 0 07 Dec 2018
Bag of Tricks for Image Classification with Convolutional Neural Networks Tong He Zhi-Li Zhang Hang Zhang Zhongyue Zhang Junyuan Xie Mu Li 224 1,400 0 04 Dec 2018
Transferring Knowledge across Learning Processes Sebastian Flennerhag Pablo G. Moreno Neil D. Lawrence Andreas C. Damianou 21 64 0 03 Dec 2018
Attention-based Adaptive Selection of Operations for Image Restoration in the Presence of Unknown Combined Distortions Masanori Suganuma Xing Liu Takayuki Okatani 85 82 0 03 Dec 2018
Snapshot Distillation: Teacher-Student Optimization in One Generation Chenglin Yang Lingxi Xie Chi Su Alan Yuille 10 193 0 01 Dec 2018
ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural Network Sachin Mehta Mohammad Rastegari Linda G. Shapiro Hannaneh Hajishirzi VLM 29 393 0 28 Nov 2018
Attention-Based Deep Neural Networks for Detection of Cancerous and Precancerous Esophagus Tissue on Histopathological Slides Naofumi Tomita B. Abdollahi Jason W. Wei Bing Ren A. Suriawinata Saeed Hassanpour MedIm 26 167 0 20 Nov 2018
Do Normalization Layers in a Deep ConvNet Really Need to Be Distinct? Ping Luo Zhanglin Peng Jiamin Ren Ruimao Zhang FAtt OOD 14 7 0 19 Nov 2018
Beyond Attributes: Adversarial Erasing Embedding Network for Zero-shot Learning Xiaobo Jin Kaizhu Huang Jianyu Miao 24 0 0 19 Nov 2018
Deep Frank-Wolfe For Neural Network Optimization Leonard Berrada Andrew Zisserman M. P. Kumar ODL 21 40 0 19 Nov 2018
Learning data augmentation policies using augmented random search Mingyang Geng Kele Xu Bo Ding Huaimin Wang Lei Zhang 27 9 0 12 Nov 2018
Measuring the Effects of Data Parallelism on Neural Network Training Christopher J. Shallue Jaehoon Lee J. Antognini J. Mamou J. Ketterling Yao Wang 49 408 0 08 Nov 2018
Packing Sparse Convolutional Neural Networks for Efficient Systolic Array Implementations: Column Combining Under Joint Optimization H. T. Kung Bradley McDanel Shanghang Zhang 35 133 0 07 Nov 2018
Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs Gaurav Maheshwari Priyansh Trivedi Denis Lukovnikov Nilesh Chakraborty Asja Fischer Jens Lehmann GNN 15 72 0 02 Nov 2018
Analysing Dropout and Compounding Errors in Neural Language Models James OÑeill Danushka Bollegala 28 1 0 02 Nov 2018
Online Embedding Compression for Text Classification using Low Rank Matrix Factorization Anish Acharya Rahul Goel A. Metallinou Inderjit Dhillon 25 58 0 01 Nov 2018
A Bayesian Perspective of Convolutional Neural Networks through a Deconvolutional Generative Model Yujia Wang Nhat Ho David J. Miller Anima Anandkumar Michael I. Jordan Richard G. Baraniuk BDL GAN 29 8 0 01 Nov 2018
Automated Machine Learning: From Principles to Practices Quanming Yao Mengshuo Wang Hugo Jair Escalante Huan Zhao Qiang Yang 25 258 0 31 Oct 2018
A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation Akhilesh Deepak Gotmare N. Keskar Caiming Xiong R. Socher ODL 19 275 0 29 Oct 2018
DropFilter: Dropout for Convolutions Zhengsu Chen 9 4 0 23 Oct 2018
Analysis of Atomistic Representations Using Weighted Skip-Connections K. Nicoli Pan Kessel M. Gastegger Kristof T. Schütt 33 0 0 23 Oct 2018
How to train your MAML Antreas Antoniou Harrison Edwards Amos Storkey 23 769 0 22 Oct 2018
Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks Xiaodong Cui Wei Zhang Zoltán Tüske M. Picheny ODL 16 89 0 16 Oct 2018
Domain Confusion with Self Ensembling for Unsupervised Adaptation Jiawei Wang Zhaoshui He Chengjian Feng Zhouping Zhu Q. Lin Jun Lv Shengli Xie 12 3 0 10 Oct 2018
NSGA-Net: Neural Architecture Search using Multi-Objective Genetic Algorithm Zhichao Lu Ian Whalen Vishnu Boddeti Yashesh D. Dhebar Kalyanmoy Deb E. Goodman W. Banzhaf 34 81 0 08 Oct 2018
Adaptive Input Representations for Neural Language Modeling Alexei Baevski Michael Auli 32 388 0 28 Sep 2018
Hierarchy-based Image Embeddings for Semantic Image Retrieval Björn Barz Joachim Denzler SSL 14 96 0 26 Sep 2018
Geometric Operator Convolutional Neural Network Yangling Ma Yixin Luo Zhouwang Yang 19 4 0 04 Sep 2018
Penalizing Top Performers: Conservative Loss for Semantic Segmentation Adaptation Xinge Zhu Hui Zhou Ceyuan Yang Jianping Shi Dahua Lin 22 104 0 04 Sep 2018
Towards Understanding Regularization in Batch Normalization Ping Luo Xinjiang Wang Wenqi Shao Zhanglin Peng MLT AI4CE 23 179 0 04 Sep 2018