Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1608.03983
Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts
13 August 2016
I. Loshchilov
Frank Hutter
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SGDR: Stochastic Gradient Descent with Warm Restarts"
50 / 4,280 papers shown
Title
The Evolved Transformer
David R. So
Chen Liang
Quoc V. Le
ViT
38
460
0
30 Jan 2019
Semantic Redundancies in Image-Classification Datasets: The 10% You Don't Need
Vighnesh Birodkar
H. Mobahi
Samy Bengio
21
82
0
29 Jan 2019
Pay Less Attention with Lightweight and Dynamic Convolutions
Felix Wu
Angela Fan
Alexei Baevski
Yann N. Dauphin
Michael Auli
11
604
0
29 Jan 2019
Using Pre-Training Can Improve Model Robustness and Uncertainty
Dan Hendrycks
Kimin Lee
Mantas Mazeika
NoLa
34
721
0
28 Jan 2019
Deep Learning on Small Datasets without Pre-Training using Cosine Loss
Björn Barz
Joachim Denzler
27
129
0
25 Jan 2019
Pricing options and computing implied volatilities using neural networks
Shuaiqiang Liu
C. Oosterlee
S. Bohté
19
119
0
25 Jan 2019
Simultaneous lesion and neuroanatomy segmentation in Multiple Sclerosis using deep neural networks
Richard McKinley
Rik Wepfer
F. Aschwanden
L. Grunder
Raphaela Muri
...
M. Reyes
A. Salmen
A. Chan
F. Wagner
Roland Wiest
24
15
0
22 Jan 2019
Backbone Can Not be Trained at Once: Rolling Back to Pre-trained Network for Person Re-Identification
Youngmin Ro
Jongwon Choi
D. Jo
Byeongho Heo
Jongin Lim
J. Choi
27
16
0
18 Jan 2019
EAT-NAS: Elastic Architecture Transfer for Accelerating Large-scale Neural Architecture Search
Jiemin Fang
Yukang Chen
Xinbang Zhang
Qian Zhang
Chang Huang
Gaofeng Meng
Wenyu Liu
Xinggang Wang
36
24
0
17 Jan 2019
Deep learning-based electroencephalography analysis: a systematic review
Yannick Roy
Hubert J. Banville
Isabela Albuquerque
Alexandre Gramfort
T. Falk
J. Faubert
25
937
0
16 Jan 2019
URNet : User-Resizable Residual Networks with Conditional Gating Module
Sang-ho Lee
Simyung Chang
Nojun Kwak
21
11
0
15 Jan 2019
Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering
Victor Zhong
Caiming Xiong
N. Keskar
R. Socher
27
63
0
03 Jan 2019
Actor Conditioned Attention Maps for Video Action Detection
Oytun Ulutan
S. Rallapalli
Mudhakar Srivatsa
Carlos Torres
B. S. Manjunath
19
42
0
30 Dec 2018
AVRA: Automatic Visual Ratings of Atrophy from MRI images using Recurrent Convolutional Neural Networks
G. Mårtensson
D. Ferreira
L. Cavallin
J.-Sebastian Muehlboeck
L. Wahlund
Chunliang Wang
E. Westman
38
20
0
23 Dec 2018
Meta Architecture Search
Albert Eaton Shaw
Wei Wei
Weiyang Liu
Le Song
Bo Dai
BDL
23
35
0
22 Dec 2018
Rethinking Layer-wise Feature Amounts in Convolutional Neural Network Architectures
Martin Mundt
Sagnik Majumder
Tobias Weis
Visvanathan Ramesh
FAtt
11
0
0
14 Dec 2018
Learning representations of molecules and materials with atomistic neural networks
Kristof T. Schütt
A. Tkatchenko
K. Müller
NAI
30
13
0
11 Dec 2018
Deep Anomaly Detection with Outlier Exposure
Dan Hendrycks
Mantas Mazeika
Thomas G. Dietterich
OODD
31
1,457
0
11 Dec 2018
Hyperbolic Deep Learning for Chinese Natural Language Understanding
Marko Valentin Micic
Hugo Chu
11
7
0
11 Dec 2018
SlowFast Networks for Video Recognition
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
116
3,222
0
10 Dec 2018
ShuffleNASNets: Efficient CNN models through modified Efficient Neural Architecture Search
Kevin Laube
A. Zell
UQCV
22
10
0
07 Dec 2018
Bag of Tricks for Image Classification with Convolutional Neural Networks
Tong He
Zhi-Li Zhang
Hang Zhang
Zhongyue Zhang
Junyuan Xie
Mu Li
224
1,400
0
04 Dec 2018
Transferring Knowledge across Learning Processes
Sebastian Flennerhag
Pablo G. Moreno
Neil D. Lawrence
Andreas C. Damianou
21
64
0
03 Dec 2018
Attention-based Adaptive Selection of Operations for Image Restoration in the Presence of Unknown Combined Distortions
Masanori Suganuma
Xing Liu
Takayuki Okatani
85
82
0
03 Dec 2018
Snapshot Distillation: Teacher-Student Optimization in One Generation
Chenglin Yang
Lingxi Xie
Chi Su
Alan Yuille
10
193
0
01 Dec 2018
ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural Network
Sachin Mehta
Mohammad Rastegari
Linda G. Shapiro
Hannaneh Hajishirzi
VLM
29
393
0
28 Nov 2018
Attention-Based Deep Neural Networks for Detection of Cancerous and Precancerous Esophagus Tissue on Histopathological Slides
Naofumi Tomita
B. Abdollahi
Jason W. Wei
Bing Ren
A. Suriawinata
Saeed Hassanpour
MedIm
26
167
0
20 Nov 2018
Do Normalization Layers in a Deep ConvNet Really Need to Be Distinct?
Ping Luo
Zhanglin Peng
Jiamin Ren
Ruimao Zhang
FAtt
OOD
14
7
0
19 Nov 2018
Beyond Attributes: Adversarial Erasing Embedding Network for Zero-shot Learning
Xiaobo Jin
Kaizhu Huang
Jianyu Miao
24
0
0
19 Nov 2018
Deep Frank-Wolfe For Neural Network Optimization
Leonard Berrada
Andrew Zisserman
M. P. Kumar
ODL
21
40
0
19 Nov 2018
Learning data augmentation policies using augmented random search
Mingyang Geng
Kele Xu
Bo Ding
Huaimin Wang
Lei Zhang
27
9
0
12 Nov 2018
Measuring the Effects of Data Parallelism on Neural Network Training
Christopher J. Shallue
Jaehoon Lee
J. Antognini
J. Mamou
J. Ketterling
Yao Wang
49
408
0
08 Nov 2018
Packing Sparse Convolutional Neural Networks for Efficient Systolic Array Implementations: Column Combining Under Joint Optimization
H. T. Kung
Bradley McDanel
Shanghang Zhang
35
133
0
07 Nov 2018
Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs
Gaurav Maheshwari
Priyansh Trivedi
Denis Lukovnikov
Nilesh Chakraborty
Asja Fischer
Jens Lehmann
GNN
15
72
0
02 Nov 2018
Analysing Dropout and Compounding Errors in Neural Language Models
James OÑeill
Danushka Bollegala
28
1
0
02 Nov 2018
Online Embedding Compression for Text Classification using Low Rank Matrix Factorization
Anish Acharya
Rahul Goel
A. Metallinou
Inderjit Dhillon
25
58
0
01 Nov 2018
A Bayesian Perspective of Convolutional Neural Networks through a Deconvolutional Generative Model
Yujia Wang
Nhat Ho
David J. Miller
Anima Anandkumar
Michael I. Jordan
Richard G. Baraniuk
BDL
GAN
29
8
0
01 Nov 2018
Automated Machine Learning: From Principles to Practices
Quanming Yao
Mengshuo Wang
Hugo Jair Escalante
Huan Zhao
Qiang Yang
25
258
0
31 Oct 2018
A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation
Akhilesh Deepak Gotmare
N. Keskar
Caiming Xiong
R. Socher
ODL
19
275
0
29 Oct 2018
DropFilter: Dropout for Convolutions
Zhengsu Chen
9
4
0
23 Oct 2018
Analysis of Atomistic Representations Using Weighted Skip-Connections
K. Nicoli
Pan Kessel
M. Gastegger
Kristof T. Schütt
33
0
0
23 Oct 2018
How to train your MAML
Antreas Antoniou
Harrison Edwards
Amos Storkey
23
769
0
22 Oct 2018
Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks
Xiaodong Cui
Wei Zhang
Zoltán Tüske
M. Picheny
ODL
16
89
0
16 Oct 2018
Domain Confusion with Self Ensembling for Unsupervised Adaptation
Jiawei Wang
Zhaoshui He
Chengjian Feng
Zhouping Zhu
Q. Lin
Jun Lv
Shengli Xie
12
3
0
10 Oct 2018
NSGA-Net: Neural Architecture Search using Multi-Objective Genetic Algorithm
Zhichao Lu
Ian Whalen
Vishnu Boddeti
Yashesh D. Dhebar
Kalyanmoy Deb
E. Goodman
W. Banzhaf
34
81
0
08 Oct 2018
Adaptive Input Representations for Neural Language Modeling
Alexei Baevski
Michael Auli
32
388
0
28 Sep 2018
Hierarchy-based Image Embeddings for Semantic Image Retrieval
Björn Barz
Joachim Denzler
SSL
14
96
0
26 Sep 2018
Geometric Operator Convolutional Neural Network
Yangling Ma
Yixin Luo
Zhouwang Yang
19
4
0
04 Sep 2018
Penalizing Top Performers: Conservative Loss for Semantic Segmentation Adaptation
Xinge Zhu
Hui Zhou
Ceyuan Yang
Jianping Shi
Dahua Lin
22
104
0
04 Sep 2018
Towards Understanding Regularization in Batch Normalization
Ping Luo
Xinjiang Wang
Wenqi Shao
Zhanglin Peng
MLT
AI4CE
23
179
0
04 Sep 2018
Previous
1
2
3
...
83
84
85
86
Next