Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1608.03983
Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts
13 August 2016
I. Loshchilov
Frank Hutter
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SGDR: Stochastic Gradient Descent with Warm Restarts"
50 / 4,280 papers shown
Title
An Exponential Learning Rate Schedule for Deep Learning
Zhiyuan Li
Sanjeev Arora
12
212
0
16 Oct 2019
Aerial Images Processing for Car Detection using Convolutional Neural Networks: Comparison between Faster R-CNN and YoloV3
Adel Ammar
Anis Koubaa
Mohanned Ahmed
Abdulrahman Saad
Bilel Benjdira
27
90
0
16 Oct 2019
Learning Generalisable Omni-Scale Representations for Person Re-Identification
Kaiyang Zhou
Yongxin Yang
Andrea Cavallaro
Tao Xiang
30
217
0
15 Oct 2019
Automatic Neural Network Compression by Sparsity-Quantization Joint Learning: A Constrained Optimization-based Approach
Haichuan Yang
Shupeng Gui
Yuhao Zhu
Ji Liu
MQ
28
5
0
14 Oct 2019
One-Shot Neural Architecture Search via Self-Evaluated Template Network
Xuanyi Dong
Yezhou Yang
ViT
16
184
0
13 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
28
661
0
12 Oct 2019
Demon: Improved Neural Network Training with Momentum Decay
John Chen
Cameron R. Wolfe
Zhaoqi Li
Anastasios Kyrillidis
ODL
24
15
0
11 Oct 2019
Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation
Hang Gao
Xizhou Zhu
Steve Lin
Jifeng Dai
21
64
0
07 Oct 2019
Covariance-free Partial Least Squares: An Incremental Dimensionality Reduction Method
Artur Jordão
M. Lie
V. H. C. Melo
William Robson Schwartz
23
3
0
05 Oct 2019
SELF: Learning to Filter Noisy Labels with Self-Ensembling
Philipp Kratzer
Marc Toussaint
Thi Phuong Nhung Ngo
T. Nguyen
Jim Mainprice
Thomas Brox
NoLa
42
310
0
04 Oct 2019
Distillation
≈
\approx
≈
Early Stopping? Harvesting Dark Knowledge Utilizing Anisotropic Information Retrieval For Overparameterized Neural Network
Bin Dong
Jikai Hou
Yiping Lu
Zhihua Zhang
28
41
0
02 Oct 2019
Emotion Recognition with Spatial Attention and Temporal Softmax Pooling
Masih Aminbeidokhti
Jikai Hou
Yiping Lu
Zhihua Zhang
CVBM
14
19
0
02 Oct 2019
Distilling Effective Supervision from Severe Label Noise
Zizhao Zhang
Han Zhang
Sercan O. Arik
Honglak Lee
Tomas Pfister
NoLa
14
2
0
01 Oct 2019
Graph convolutional networks for learning with few clean and many noisy labels
Ahmet Iscen
Giorgos Tolias
Yannis Avrithis
Ondřej Chum
Cordelia Schmid
SSL
22
19
0
01 Oct 2019
Automated design of error-resilient and hardware-efficient deep neural networks
Christoph Schorn
T. Elsken
Sebastian Vogel
Armin Runge
A. Guntoro
G. Ascheid
AAML
17
32
0
30 Sep 2019
Spatio-Temporal FAST 3D Convolutions for Human Action Recognition
Alexandros Stergiou
R. Poppe
3DH
20
19
0
30 Sep 2019
Gated Task Interaction Framework for Multi-task Sequence Tagging
Isaac. K. E. Ampomah
S. McClean
Zhiwei Lin
G. Hawe
12
1
0
29 Sep 2019
Pruning from Scratch
Yulong Wang
Xiaolu Zhang
Lingxi Xie
Jun Zhou
Hang Su
Bo Zhang
Xiaolin Hu
25
192
0
27 Sep 2019
ES-MAML: Simple Hessian-Free Meta Learning
Xingyou Song
Wenbo Gao
Yuxiang Yang
K. Choromanski
Aldo Pacchiano
Yunhao Tang
25
119
0
25 Sep 2019
Reducing Transformer Depth on Demand with Structured Dropout
Angela Fan
Edouard Grave
Armand Joulin
48
585
0
25 Sep 2019
Gated Channel Transformation for Visual Recognition
Zongxin Yang
Linchao Zhu
Yu Wu
Yezhou Yang
ViT
22
204
0
25 Sep 2019
Balanced One-shot Neural Architecture Optimization
Renqian Luo
Tao Qin
Enhong Chen
BDL
22
14
0
24 Sep 2019
A Simple yet Effective Baseline for Robust Deep Learning with Noisy Labels
Yucen Luo
Jun Zhu
Tomas Pfister
NoLa
26
6
0
20 Sep 2019
Memory-Efficient Hierarchical Neural Architecture Search for Image Denoising
Haokui Zhang
Ying Li
Hao Chen
Chunhua Shen
AI4CE
20
57
0
18 Sep 2019
Empirical study towards understanding line search approximations for training neural networks
Younghwan Chae
D. Wilke
27
11
0
15 Sep 2019
Ouroboros: On Accelerating Training of Transformer-Based Language Models
Qian Yang
Zhouyuan Huo
Wenlin Wang
Heng-Chiao Huang
Lawrence Carin
25
9
0
14 Sep 2019
Dependency-Aware Named Entity Recognition with Relative and Global Attentions
Gustavo Aguilar
Thamar Solorio
15
9
0
11 Sep 2019
Learning Enhanced Resolution-wise features for Human Pose Estimation
Kun Zhang
Peng He
Ping Yao
Ge Chen
Chuanguang Yang
Min Du
Huimin Li
Li Fu
Tianyao Zheng
3DV
3DH
33
12
0
11 Sep 2019
Neural Architecture Search in Embedding Space
Chunmiao Liu
25
0
0
09 Sep 2019
A Baseline for Few-Shot Image Classification
Guneet Singh Dhillon
Pratik Chaudhari
Avinash Ravichandran
Stefano Soatto
36
575
0
06 Sep 2019
Best Practices for Scientific Research on Neural Architecture Search
Marius Lindauer
Frank Hutter
14
142
0
05 Sep 2019
Rethinking the Number of Channels for the Convolutional Neural Network
Hui Zhu
Zhulin An
Chuanguang Yang
Xiaolong Hu
Kaiqiang Xu
Yongjun Xu
OOD
11
3
0
04 Sep 2019
MetaMixUp: Learning Adaptive Interpolation Policy of MixUp with Meta-Learning
Zhijun Mai
Guosheng Hu
Dexiong Chen
Fumin Shen
Heng Tao Shen
22
41
0
27 Aug 2019
Once-for-All: Train One Network and Specialize it for Efficient Deployment
Han Cai
Chuang Gan
Tianzhe Wang
Zhekai Zhang
Song Han
OOD
51
1,262
0
26 Aug 2019
Gated Convolutional Networks with Hybrid Connectivity for Image Classification
Chuanguang Yang
Zhulin An
Hui Zhu
Xiaolong Hu
Boyu Diao
Kaiqiang Xu
Chao Li
Yongjun Xu
30
51
0
26 Aug 2019
Mish: A Self Regularized Non-Monotonic Activation Function
Diganta Misra
28
678
0
23 Aug 2019
Pixel-wise Segmentation of Right Ventricle of Heart
Yaman Dang
Deepak Anand
A. Sethi
21
5
0
21 Aug 2019
Restricted Recurrent Neural Networks
Enmao Diao
Jie Ding
Vahid Tarokh
29
20
0
21 Aug 2019
Adaptative Inference Cost With Convolutional Neural Mixture Models
Adria Ruiz
Jakob Verbeek
VLM
30
22
0
19 Aug 2019
Demystifying Learning Rate Policies for High Accuracy Training of Deep Neural Networks
Yanzhao Wu
Ling Liu
Juhyun Bae
Ka-Ho Chow
Arun Iyengar
C. Pu
Wenqi Wei
Lei Yu
Qi Zhang
22
70
0
18 Aug 2019
Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks
Zhaoyang Zhang
Jingyu Li
Wenqi Shao
Zhanglin Peng
Ruimao Zhang
Xiaogang Wang
Ping Luo
27
37
0
16 Aug 2019
PHYRE: A New Benchmark for Physical Reasoning
A. Bakhtin
Laurens van der Maaten
Justin Johnson
Laura Gustafson
Ross B. Girshick
LRM
24
122
0
15 Aug 2019
Mix & Match: training convnets with mixed image sizes for improved accuracy, speed and scale resiliency
Elad Hoffer
Berry Weinstein
Itay Hubara
Tal Ben-Nun
Torsten Hoefler
Daniel Soudry
29
20
0
12 Aug 2019
Repetitive Reprediction Deep Decipher for Semi-Supervised Learning
G. Wang
Jianxin Wu
27
30
0
09 Aug 2019
Deep Learning for Visual Recognition of Environmental Enteropathy and Celiac Disease
A. Shrivastava
K. Kant
S. Sengupta
Sung-Jun Kang
Marium N. Khan
...
S. Moore
B. Amadi
P. Kelly
Donald E. Brown
Sana Syed
11
9
0
08 Aug 2019
How Does Learning Rate Decay Help Modern Neural Networks?
Kaichao You
Mingsheng Long
Jianmin Wang
Michael I. Jordan
30
4
0
05 Aug 2019
Attentive Normalization
Xilai Li
Wei Sun
Tianfu Wu
OOD
ViT
28
31
0
04 Aug 2019
Sound source detection, localization and classification using consecutive ensemble of CRNN models
Slawomir Kapka
M. Lewandowski
16
66
0
02 Aug 2019
Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action Localization
Chunfei Ma
Joonhyang Choi
Byeongwon Lee
Seungji Yang
19
0
0
25 Jul 2019
Temporally Consistent Horizon Lines
Florian Kluger
H. Ackermann
M. Yang
Bodo Rosenhahn
AI4TS
25
16
0
23 Jul 2019
Previous
1
2
3
...
80
81
82
...
84
85
86
Next