ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.03983
  4. Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts

SGDR: Stochastic Gradient Descent with Warm Restarts

13 August 2016
I. Loshchilov
Frank Hutter
    ODL
ArXivPDFHTML

Papers citing "SGDR: Stochastic Gradient Descent with Warm Restarts"

50 / 4,280 papers shown
Title
An Exponential Learning Rate Schedule for Deep Learning
An Exponential Learning Rate Schedule for Deep Learning
Zhiyuan Li
Sanjeev Arora
12
212
0
16 Oct 2019
Aerial Images Processing for Car Detection using Convolutional Neural
  Networks: Comparison between Faster R-CNN and YoloV3
Aerial Images Processing for Car Detection using Convolutional Neural Networks: Comparison between Faster R-CNN and YoloV3
Adel Ammar
Anis Koubaa
Mohanned Ahmed
Abdulrahman Saad
Bilel Benjdira
27
90
0
16 Oct 2019
Learning Generalisable Omni-Scale Representations for Person
  Re-Identification
Learning Generalisable Omni-Scale Representations for Person Re-Identification
Kaiyang Zhou
Yongxin Yang
Andrea Cavallaro
Tao Xiang
30
217
0
15 Oct 2019
Automatic Neural Network Compression by Sparsity-Quantization Joint
  Learning: A Constrained Optimization-based Approach
Automatic Neural Network Compression by Sparsity-Quantization Joint Learning: A Constrained Optimization-based Approach
Haichuan Yang
Shupeng Gui
Yuhao Zhu
Ji Liu
MQ
28
5
0
14 Oct 2019
One-Shot Neural Architecture Search via Self-Evaluated Template Network
One-Shot Neural Architecture Search via Self-Evaluated Template Network
Xuanyi Dong
Yezhou Yang
ViT
16
184
0
13 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
28
661
0
12 Oct 2019
Demon: Improved Neural Network Training with Momentum Decay
Demon: Improved Neural Network Training with Momentum Decay
John Chen
Cameron R. Wolfe
Zhaoqi Li
Anastasios Kyrillidis
ODL
24
15
0
11 Oct 2019
Deformable Kernels: Adapting Effective Receptive Fields for Object
  Deformation
Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation
Hang Gao
Xizhou Zhu
Steve Lin
Jifeng Dai
21
64
0
07 Oct 2019
Covariance-free Partial Least Squares: An Incremental Dimensionality
  Reduction Method
Covariance-free Partial Least Squares: An Incremental Dimensionality Reduction Method
Artur Jordão
M. Lie
V. H. C. Melo
William Robson Schwartz
23
3
0
05 Oct 2019
SELF: Learning to Filter Noisy Labels with Self-Ensembling
SELF: Learning to Filter Noisy Labels with Self-Ensembling
Philipp Kratzer
Marc Toussaint
Thi Phuong Nhung Ngo
T. Nguyen
Jim Mainprice
Thomas Brox
NoLa
42
310
0
04 Oct 2019
Distillation $\approx$ Early Stopping? Harvesting Dark Knowledge
  Utilizing Anisotropic Information Retrieval For Overparameterized Neural
  Network
Distillation ≈\approx≈ Early Stopping? Harvesting Dark Knowledge Utilizing Anisotropic Information Retrieval For Overparameterized Neural Network
Bin Dong
Jikai Hou
Yiping Lu
Zhihua Zhang
28
41
0
02 Oct 2019
Emotion Recognition with Spatial Attention and Temporal Softmax Pooling
Emotion Recognition with Spatial Attention and Temporal Softmax Pooling
Masih Aminbeidokhti
Jikai Hou
Yiping Lu
Zhihua Zhang
CVBM
14
19
0
02 Oct 2019
Distilling Effective Supervision from Severe Label Noise
Distilling Effective Supervision from Severe Label Noise
Zizhao Zhang
Han Zhang
Sercan O. Arik
Honglak Lee
Tomas Pfister
NoLa
14
2
0
01 Oct 2019
Graph convolutional networks for learning with few clean and many noisy
  labels
Graph convolutional networks for learning with few clean and many noisy labels
Ahmet Iscen
Giorgos Tolias
Yannis Avrithis
Ondřej Chum
Cordelia Schmid
SSL
22
19
0
01 Oct 2019
Automated design of error-resilient and hardware-efficient deep neural
  networks
Automated design of error-resilient and hardware-efficient deep neural networks
Christoph Schorn
T. Elsken
Sebastian Vogel
Armin Runge
A. Guntoro
G. Ascheid
AAML
17
32
0
30 Sep 2019
Spatio-Temporal FAST 3D Convolutions for Human Action Recognition
Spatio-Temporal FAST 3D Convolutions for Human Action Recognition
Alexandros Stergiou
R. Poppe
3DH
20
19
0
30 Sep 2019
Gated Task Interaction Framework for Multi-task Sequence Tagging
Gated Task Interaction Framework for Multi-task Sequence Tagging
Isaac. K. E. Ampomah
S. McClean
Zhiwei Lin
G. Hawe
12
1
0
29 Sep 2019
Pruning from Scratch
Pruning from Scratch
Yulong Wang
Xiaolu Zhang
Lingxi Xie
Jun Zhou
Hang Su
Bo Zhang
Xiaolin Hu
25
192
0
27 Sep 2019
ES-MAML: Simple Hessian-Free Meta Learning
ES-MAML: Simple Hessian-Free Meta Learning
Xingyou Song
Wenbo Gao
Yuxiang Yang
K. Choromanski
Aldo Pacchiano
Yunhao Tang
25
119
0
25 Sep 2019
Reducing Transformer Depth on Demand with Structured Dropout
Reducing Transformer Depth on Demand with Structured Dropout
Angela Fan
Edouard Grave
Armand Joulin
48
585
0
25 Sep 2019
Gated Channel Transformation for Visual Recognition
Gated Channel Transformation for Visual Recognition
Zongxin Yang
Linchao Zhu
Yu Wu
Yezhou Yang
ViT
22
204
0
25 Sep 2019
Balanced One-shot Neural Architecture Optimization
Balanced One-shot Neural Architecture Optimization
Renqian Luo
Tao Qin
Enhong Chen
BDL
22
14
0
24 Sep 2019
A Simple yet Effective Baseline for Robust Deep Learning with Noisy
  Labels
A Simple yet Effective Baseline for Robust Deep Learning with Noisy Labels
Yucen Luo
Jun Zhu
Tomas Pfister
NoLa
26
6
0
20 Sep 2019
Memory-Efficient Hierarchical Neural Architecture Search for Image
  Denoising
Memory-Efficient Hierarchical Neural Architecture Search for Image Denoising
Haokui Zhang
Ying Li
Hao Chen
Chunhua Shen
AI4CE
20
57
0
18 Sep 2019
Empirical study towards understanding line search approximations for
  training neural networks
Empirical study towards understanding line search approximations for training neural networks
Younghwan Chae
D. Wilke
27
11
0
15 Sep 2019
Ouroboros: On Accelerating Training of Transformer-Based Language Models
Ouroboros: On Accelerating Training of Transformer-Based Language Models
Qian Yang
Zhouyuan Huo
Wenlin Wang
Heng-Chiao Huang
Lawrence Carin
25
9
0
14 Sep 2019
Dependency-Aware Named Entity Recognition with Relative and Global
  Attentions
Dependency-Aware Named Entity Recognition with Relative and Global Attentions
Gustavo Aguilar
Thamar Solorio
15
9
0
11 Sep 2019
Learning Enhanced Resolution-wise features for Human Pose Estimation
Learning Enhanced Resolution-wise features for Human Pose Estimation
Kun Zhang
Peng He
Ping Yao
Ge Chen
Chuanguang Yang
Min Du
Huimin Li
Li Fu
Tianyao Zheng
3DV
3DH
33
12
0
11 Sep 2019
Neural Architecture Search in Embedding Space
Neural Architecture Search in Embedding Space
Chunmiao Liu
25
0
0
09 Sep 2019
A Baseline for Few-Shot Image Classification
A Baseline for Few-Shot Image Classification
Guneet Singh Dhillon
Pratik Chaudhari
Avinash Ravichandran
Stefano Soatto
36
575
0
06 Sep 2019
Best Practices for Scientific Research on Neural Architecture Search
Best Practices for Scientific Research on Neural Architecture Search
Marius Lindauer
Frank Hutter
14
142
0
05 Sep 2019
Rethinking the Number of Channels for the Convolutional Neural Network
Rethinking the Number of Channels for the Convolutional Neural Network
Hui Zhu
Zhulin An
Chuanguang Yang
Xiaolong Hu
Kaiqiang Xu
Yongjun Xu
OOD
11
3
0
04 Sep 2019
MetaMixUp: Learning Adaptive Interpolation Policy of MixUp with
  Meta-Learning
MetaMixUp: Learning Adaptive Interpolation Policy of MixUp with Meta-Learning
Zhijun Mai
Guosheng Hu
Dexiong Chen
Fumin Shen
Heng Tao Shen
22
41
0
27 Aug 2019
Once-for-All: Train One Network and Specialize it for Efficient
  Deployment
Once-for-All: Train One Network and Specialize it for Efficient Deployment
Han Cai
Chuang Gan
Tianzhe Wang
Zhekai Zhang
Song Han
OOD
51
1,262
0
26 Aug 2019
Gated Convolutional Networks with Hybrid Connectivity for Image
  Classification
Gated Convolutional Networks with Hybrid Connectivity for Image Classification
Chuanguang Yang
Zhulin An
Hui Zhu
Xiaolong Hu
Boyu Diao
Kaiqiang Xu
Chao Li
Yongjun Xu
30
51
0
26 Aug 2019
Mish: A Self Regularized Non-Monotonic Activation Function
Mish: A Self Regularized Non-Monotonic Activation Function
Diganta Misra
28
678
0
23 Aug 2019
Pixel-wise Segmentation of Right Ventricle of Heart
Pixel-wise Segmentation of Right Ventricle of Heart
Yaman Dang
Deepak Anand
A. Sethi
21
5
0
21 Aug 2019
Restricted Recurrent Neural Networks
Restricted Recurrent Neural Networks
Enmao Diao
Jie Ding
Vahid Tarokh
29
20
0
21 Aug 2019
Adaptative Inference Cost With Convolutional Neural Mixture Models
Adaptative Inference Cost With Convolutional Neural Mixture Models
Adria Ruiz
Jakob Verbeek
VLM
30
22
0
19 Aug 2019
Demystifying Learning Rate Policies for High Accuracy Training of Deep
  Neural Networks
Demystifying Learning Rate Policies for High Accuracy Training of Deep Neural Networks
Yanzhao Wu
Ling Liu
Juhyun Bae
Ka-Ho Chow
Arun Iyengar
C. Pu
Wenqi Wei
Lei Yu
Qi Zhang
22
70
0
18 Aug 2019
Differentiable Learning-to-Group Channels via Groupable Convolutional
  Neural Networks
Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks
Zhaoyang Zhang
Jingyu Li
Wenqi Shao
Zhanglin Peng
Ruimao Zhang
Xiaogang Wang
Ping Luo
27
37
0
16 Aug 2019
PHYRE: A New Benchmark for Physical Reasoning
PHYRE: A New Benchmark for Physical Reasoning
A. Bakhtin
Laurens van der Maaten
Justin Johnson
Laura Gustafson
Ross B. Girshick
LRM
24
122
0
15 Aug 2019
Mix & Match: training convnets with mixed image sizes for improved
  accuracy, speed and scale resiliency
Mix & Match: training convnets with mixed image sizes for improved accuracy, speed and scale resiliency
Elad Hoffer
Berry Weinstein
Itay Hubara
Tal Ben-Nun
Torsten Hoefler
Daniel Soudry
29
20
0
12 Aug 2019
Repetitive Reprediction Deep Decipher for Semi-Supervised Learning
Repetitive Reprediction Deep Decipher for Semi-Supervised Learning
G. Wang
Jianxin Wu
27
30
0
09 Aug 2019
Deep Learning for Visual Recognition of Environmental Enteropathy and
  Celiac Disease
Deep Learning for Visual Recognition of Environmental Enteropathy and Celiac Disease
A. Shrivastava
K. Kant
S. Sengupta
Sung-Jun Kang
Marium N. Khan
...
S. Moore
B. Amadi
P. Kelly
Donald E. Brown
Sana Syed
11
9
0
08 Aug 2019
How Does Learning Rate Decay Help Modern Neural Networks?
How Does Learning Rate Decay Help Modern Neural Networks?
Kaichao You
Mingsheng Long
Jianmin Wang
Michael I. Jordan
30
4
0
05 Aug 2019
Attentive Normalization
Attentive Normalization
Xilai Li
Wei Sun
Tianfu Wu
OOD
ViT
28
31
0
04 Aug 2019
Sound source detection, localization and classification using
  consecutive ensemble of CRNN models
Sound source detection, localization and classification using consecutive ensemble of CRNN models
Slawomir Kapka
M. Lewandowski
16
66
0
02 Aug 2019
Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action
  Localization
Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action Localization
Chunfei Ma
Joonhyang Choi
Byeongwon Lee
Seungji Yang
19
0
0
25 Jul 2019
Temporally Consistent Horizon Lines
Temporally Consistent Horizon Lines
Florian Kluger
H. Ackermann
M. Yang
Bodo Rosenhahn
AI4TS
25
16
0
23 Jul 2019
Previous
123...808182...848586
Next