Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1608.03983
Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts
13 August 2016
I. Loshchilov
Frank Hutter
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SGDR: Stochastic Gradient Descent with Warm Restarts"
50 / 4,280 papers shown
Title
PFDet: 2nd Place Solution to Open Images Challenge 2018 Object Detection Track
Takuya Akiba
Tommi Kerola
Yusuke Niitani
Toru Ogawa
Shotaro Sano
Shuji Suzuki
14
20
0
04 Sep 2018
A Survey of Modern Object Detection Literature using Deep Learning
K. Chahal
Kuntal Dey
ObjD
22
35
0
22 Aug 2018
Neural Architecture Optimization
Renqian Luo
Fei Tian
Tao Qin
Enhong Chen
Tie-Yan Liu
3DV
37
649
0
22 Aug 2018
Don't Use Large Mini-Batches, Use Local SGD
Tao R. Lin
Sebastian U. Stich
Kumar Kshitij Patel
Martin Jaggi
57
429
0
22 Aug 2018
BlockQNN: Efficient Block-wise Neural Network Architecture Generation
Zhaobai Zhong
Zichen Yang
Boyang Deng
Junjie Yan
Wei Wu
Jing Shao
Cheng-Lin Liu
14
113
0
16 Aug 2018
Backprop Evolution
Maximilian Alber
Irwan Bello
Barret Zoph
Pieter-Jan Kindermans
Prajit Ramachandran
Quoc V. Le
21
9
0
08 Aug 2018
DeepTAM: Deep Tracking and Mapping
Huizhong Zhou
Benjamin Ummenhofer
Thomas Brox
3DV
38
227
0
06 Aug 2018
Classification of Dermoscopy Images using Deep Learning
Dinesh Reddy Narapureddy
17
6
0
05 Aug 2018
Actor-Centric Relation Network
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Kevin Patrick Murphy
Rahul Sukthankar
Cordelia Schmid
47
220
0
28 Jul 2018
Towards Automated Deep Learning: Efficient Joint Neural Architecture and Hyperparameter Search
Arber Zela
Aaron Klein
Stefan Falkner
Frank Hutter
35
159
0
18 Jul 2018
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Chun-Fu Chen
Quanfu Fan
Neil Rohit Mallinar
Tom Sercu
Rogerio Feris
20
96
0
10 Jul 2018
Benchmarking Neural Network Robustness to Common Corruptions and Surface Variations
Dan Hendrycks
Thomas G. Dietterich
OOD
22
197
0
04 Jul 2018
DARTS: Differentiable Architecture Search
Hanxiao Liu
Karen Simonyan
Yiming Yang
112
4,310
0
24 Jun 2018
DPP-Net: Device-aware Progressive Search for Pareto-optimal Neural Architectures
Jin-Dong Dong
A. Cheng
Da-Cheng Juan
Wei Wei
Min Sun
25
181
0
21 Jun 2018
Banach Wasserstein GAN
J. Adler
Sebastian Lunz
GAN
13
217
0
18 Jun 2018
There Are Many Consistent Explanations of Unlabeled Data: Why You Should Average
Ben Athiwaratkun
Marc Finzi
Pavel Izmailov
A. Wilson
208
243
0
14 Jun 2018
Efficient Full-Matrix Adaptive Regularization
Naman Agarwal
Brian Bullins
Xinyi Chen
Elad Hazan
Karan Singh
Cyril Zhang
Yi Zhang
18
21
0
08 Jun 2018
Path-Level Network Transformation for Efficient Architecture Search
Han Cai
Jiacheng Yang
Weinan Zhang
Song Han
Yong Yu
27
210
0
07 Jun 2018
Deep Fluids: A Generative Network for Parameterized Fluid Simulations
Byungsoo Kim
Vinicius Azevedo
N. Thürey
Theodore Kim
Markus Gross
B. Solenthaler
GAN
20
385
0
06 Jun 2018
Stochastic Gradient Descent with Hyperbolic-Tangent Decay on Classification
B. Hsueh
Wei Li
I-Chen Wu
13
22
0
05 Jun 2018
AutoAugment: Learning Augmentation Policies from Data
E. D. Cubuk
Barret Zoph
Dandelion Mané
Vijay Vasudevan
Quoc V. Le
63
1,758
0
24 May 2018
Input and Weight Space Smoothing for Semi-supervised Learning
Safa Cicek
Stefano Soatto
22
6
0
23 May 2018
Neural Architecture Search using Deep Neural Networks and Monte Carlo Tree Search
Linnan Wang
Yiyang Zhao
Yuu Jinnai
Yuandong Tian
Rodrigo Fonseca
BDL
36
50
0
18 May 2018
Born Again Neural Networks
Tommaso Furlanello
Zachary Chase Lipton
Michael Tschannen
Laurent Itti
Anima Anandkumar
36
1,026
0
12 May 2018
Intracranial Error Detection via Deep Learning
M. Völker
Jiří Hammer
R. Schirrmeister
Joos Behncke
L. Fiederer
A. Schulze-Bonhage
Petr Marusič
Wolfram Burgard
T. Ball
14
10
0
04 May 2018
SdcNet: A Computation-Efficient CNN for Object Recognition
Yunlong Ma
Chunyan Wang
21
3
0
03 May 2018
Efficient Multi-objective Neural Architecture Search via Lamarckian Evolution
T. Elsken
J. H. Metzen
Frank Hutter
131
499
0
24 Apr 2018
Pelee: A Real-Time Object Detection System on Mobile Devices
R. Wang
Xiang Li
Charles X. Ling
ObjD
22
454
0
18 Apr 2018
Understanding Actors and Evaluating Personae with Gaussian Embeddings
Hannah Kim
Denys Katerenchuk
Daniel Billet
Jun Huan
Haesun Park
Boyang Albert Li
21
4
0
06 Apr 2018
On the Intrinsic Dimensionality of Image Representations
Sixue Gong
Vishnu Boddeti
Anil K. Jain
13
71
0
26 Mar 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Xihui Liu
Hongsheng Li
Jing Shao
Dapeng Chen
Xiaogang Wang
25
133
0
22 Mar 2018
Deep Co-Training for Semi-Supervised Image Recognition
Siyuan Qiao
Wei Shen
Zhishuai Zhang
Bo Wang
Alan Yuille
10
445
0
15 Mar 2018
Averaging Weights Leads to Wider Optima and Better Generalization
Pavel Izmailov
Dmitrii Podoprikhin
T. Garipov
Dmitry Vetrov
A. Wilson
FedML
MoMe
60
1,621
0
14 Mar 2018
Learning Sparse Structured Ensembles with SG-MCMC and Network Pruning
Yichi Zhang
Zhijian Ou
27
0
0
01 Mar 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
33
704
0
26 Feb 2018
Training wide residual networks for deployment using a single bit for each weight
Mark D Mcdonnell
MQ
35
71
0
23 Feb 2018
3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning
Hyeontaek Lim
D. Andersen
M. Kaminsky
21
70
0
21 Feb 2018
Uncertainty Estimates and Multi-Hypotheses Networks for Optical Flow
Eddy Ilg
Özgün Çiçek
Silvio Galesso
Aaron Klein
Osama Makansi
Frank Hutter
Thomas Brox
UQCV
40
220
0
20 Feb 2018
Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise
Dan Hendrycks
Mantas Mazeika
Duncan Wilson
Kevin Gimpel
NoLa
70
547
0
14 Feb 2018
Efficient Neural Architecture Search via Parameter Sharing
Hieu H. Pham
M. Guan
Barret Zoph
Quoc V. Le
J. Dean
53
2,747
0
09 Feb 2018
Dynamic Graph CNN for Learning on Point Clouds
Yue Wang
Yongbin Sun
Ziwei Liu
Sanjay E. Sarma
M. Bronstein
Justin Solomon
GNN
3DPC
158
6,039
0
24 Jan 2018
Universal Language Model Fine-tuning for Text Classification
Jeremy Howard
Sebastian Ruder
VLM
29
274
0
18 Jan 2018
Improving Generalization Performance by Switching from Adam to SGD
N. Keskar
R. Socher
ODL
41
521
0
20 Dec 2017
Progressive Neural Architecture Search
Chenxi Liu
Barret Zoph
Maxim Neumann
Jonathon Shlens
Wei Hua
Li-Jia Li
Li Fei-Fei
Alan Yuille
Jonathan Huang
Kevin Patrick Murphy
11
1,981
0
02 Dec 2017
Population Based Training of Neural Networks
Max Jaderberg
Valentin Dalibard
Simon Osindero
Wojciech M. Czarnecki
Jeff Donahue
...
Tim Green
Iain Dunning
Karen Simonyan
Chrisantha Fernando
Koray Kavukcuoglu
11
734
0
27 Nov 2017
CondenseNet: An Efficient DenseNet using Learned Group Convolutions
Gao Huang
Shichen Liu
Laurens van der Maaten
Kilian Q. Weinberger
50
796
0
25 Nov 2017
AOGNets: Compositional Grammatical Architectures for Deep Learning
Xilai Li
Xi Song
Tianfu Wu
37
25
0
15 Nov 2017
Decoupled Weight Decay Regularization
I. Loshchilov
Frank Hutter
OffRL
65
2,087
0
14 Nov 2017
Simple And Efficient Architecture Search for Convolutional Neural Networks
T. Elsken
J. H. Metzen
Frank Hutter
36
230
0
13 Nov 2017
Scale out for large minibatch SGD: Residual network training on ImageNet-1K with improved accuracy and reduced time to train
V. Codreanu
Damian Podareanu
V. Saletore
39
55
0
12 Nov 2017
Previous
1
2
3
...
84
85
86
Next