ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.03983
  4. Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts

SGDR: Stochastic Gradient Descent with Warm Restarts

13 August 2016
I. Loshchilov
Frank Hutter
    ODL
ArXivPDFHTML

Papers citing "SGDR: Stochastic Gradient Descent with Warm Restarts"

50 / 4,280 papers shown
Title
Adaptive Regularization via Residual Smoothing in Deep Learning
  Optimization
Adaptive Regularization via Residual Smoothing in Deep Learning Optimization
Jung-Kyun Cho
Junseok Kwon
Byung-Woo Hong
31
1
0
23 Jul 2019
MemNet: Memory-Efficiency Guided Neural Architecture Search with
  Augment-Trim learning
MemNet: Memory-Efficiency Guided Neural Architecture Search with Augment-Trim learning
Peiye Liu
Bo Wu
Huadong Ma
Mingoo Seok
28
6
0
22 Jul 2019
XferNAS: Transfer Neural Architecture Search
XferNAS: Transfer Neural Architecture Search
Martin Wistuba
29
12
0
18 Jul 2019
MintNet: Building Invertible Neural Networks with Masked Convolutions
MintNet: Building Invertible Neural Networks with Masked Convolutions
Yang Song
Chenlin Meng
Stefano Ermon
24
68
0
18 Jul 2019
A Unified Deep Framework for Joint 3D Pose Estimation and Action
  Recognition from a Single RGB Camera
A Unified Deep Framework for Joint 3D Pose Estimation and Action Recognition from a Single RGB Camera
H. Pham
H. Salmane
L. Khoudour
Alain Crouzil
Pablo Zegers
S. Velastín
21
46
0
16 Jul 2019
Towards Explaining the Regularization Effect of Initial Large Learning
  Rate in Training Neural Networks
Towards Explaining the Regularization Effect of Initial Large Learning Rate in Training Neural Networks
Yuanzhi Li
Colin Wei
Tengyu Ma
14
292
0
10 Jul 2019
EPNAS: Efficient Progressive Neural Architecture Search
EPNAS: Efficient Progressive Neural Architecture Search
Yanqi Zhou
Peng Wang
Sercan O. Arik
Haonan Yu
Syed Zawad
Feng Yan
G. Diamos
24
5
0
07 Jul 2019
FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural
  Architecture Search
FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search
Xiangxiang Chu
Bo Zhang
Ruijun Xu
24
332
0
03 Jul 2019
Using Self-Supervised Learning Can Improve Model Robustness and
  Uncertainty
Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty
Dan Hendrycks
Mantas Mazeika
Saurav Kadavath
D. Song
OOD
SSL
10
936
0
28 Jun 2019
EmotionX-KU: BERT-Max based Contextual Emotion Classifier
EmotionX-KU: BERT-Max based Contextual Emotion Classifier
Kisu Yang
Dongyub Lee
Taesun Whang
Seolhwa Lee
Heuiseok Lim
27
32
0
27 Jun 2019
Learning Data Augmentation Strategies for Object Detection
Learning Data Augmentation Strategies for Object Detection
Barret Zoph
E. D. Cubuk
Golnaz Ghiasi
Nayeon Lee
Jonathon Shlens
Quoc V. Le
39
523
0
26 Jun 2019
Monte Carlo Gradient Estimation in Machine Learning
Monte Carlo Gradient Estimation in Machine Learning
S. Mohamed
Mihaela Rosca
Michael Figurnov
A. Mnih
45
400
0
25 Jun 2019
Variations on the Chebyshev-Lagrange Activation Function
Variations on the Chebyshev-Lagrange Activation Function
Yuchen Li
Frank Rudzicz
Jekaterina Novikova
24
1
0
24 Jun 2019
Densely Connected Search Space for More Flexible Neural Architecture
  Search
Densely Connected Search Space for More Flexible Neural Architecture Search
Jiemin Fang
Yuzhu Sun
Qian Zhang
Yuan Li
Wenyu Liu
Xinggang Wang
21
122
0
23 Jun 2019
XNAS: Neural Architecture Search with Expert Advice
XNAS: Neural Architecture Search with Expert Advice
Niv Nayman
Asaf Noy
T. Ridnik
Itamar Friedman
Rong Jin
Lihi Zelnik-Manor
18
128
0
19 Jun 2019
Towards White-box Benchmarks for Algorithm Control
Towards White-box Benchmarks for Algorithm Control
André Biedenkapp
H. Bozkurt
Frank Hutter
Marius Lindauer
76
7
0
18 Jun 2019
Stand-Alone Self-Attention in Vision Models
Stand-Alone Self-Attention in Vision Models
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
VLM
SLR
ViT
38
1,199
0
13 Jun 2019
Table-Based Neural Units: Fully Quantizing Networks for Multiply-Free
  Inference
Table-Based Neural Units: Fully Quantizing Networks for Multiply-Free Inference
Michele Covell
David Marwood
S. Baluja
Nick Johnston
MQ
19
7
0
11 Jun 2019
FASTER Recurrent Networks for Efficient Video Classification
FASTER Recurrent Networks for Efficient Video Classification
Linchao Zhu
Laura Sevilla-Lara
Du Tran
Matt Feiszli
Yi Yang
Heng Wang
49
6
0
10 Jun 2019
BlockSwap: Fisher-guided Block Substitution for Network Compression on a
  Budget
BlockSwap: Fisher-guided Block Substitution for Network Compression on a Budget
Jack Turner
Elliot J. Crowley
Michael F. P. O'Boyle
Amos Storkey
Gavia Gray
33
37
0
10 Jun 2019
Large-scale Landmark Retrieval/Recognition under a Noisy and Diverse
  Dataset
Large-scale Landmark Retrieval/Recognition under a Noisy and Diverse Dataset
Kohei Ozaki
Shuhei Yokoo
17
35
0
10 Jun 2019
Neural Spline Flows
Neural Spline Flows
Conor Durkan
Artur Bekasov
Iain Murray
George Papamakarios
DRL
41
748
0
10 Jun 2019
Real or Fake? Learning to Discriminate Machine from Human Generated Text
Real or Fake? Learning to Discriminate Machine from Human Generated Text
A. Bakhtin
Sam Gross
Myle Ott
Yuntian Deng
MarcÁurelio Ranzato
Arthur Szlam
DeLMO
16
170
0
07 Jun 2019
Video Modeling with Correlation Networks
Video Modeling with Correlation Networks
Heng Wang
Du Tran
Lorenzo Torresani
Matt Feiszli
24
127
0
07 Jun 2019
STN-Homography: estimate homography parameters directly
STN-Homography: estimate homography parameters directly
Qiang-feng Zhou
Xin Li
19
6
0
06 Jun 2019
Cubic-Spline Flows
Cubic-Spline Flows
Conor Durkan
Artur Bekasov
Iain Murray
George Papamakarios
TPM
53
57
0
05 Jun 2019
Learning Deep Image Priors for Blind Image Denoising
Learning Deep Image Priors for Blind Image Denoising
Xianxu Hou
Hongming Luo
Jingxin Liu
Bolei Xu
Ke Sun
Yuanhao Gong
Bozhi Liu
Guoping Qiu
33
7
0
04 Jun 2019
Hierarchical Auxiliary Learning
Hierarchical Auxiliary Learning
Jaehoon Cha
Kyeong Soo Kim
Sanghyuk Lee
22
4
0
03 Jun 2019
Robust Learning Under Label Noise With Iterative Noise-Filtering
Robust Learning Under Label Noise With Iterative Noise-Filtering
D. Nguyen
Thi-Phuong-Nhung Ngo
Zhongyu Lou
Michael Klar
Laura Beggel
Thomas Brox
NoLa
19
16
0
01 Jun 2019
Unlabeled Data Improves Adversarial Robustness
Unlabeled Data Improves Adversarial Robustness
Y. Carmon
Aditi Raghunathan
Ludwig Schmidt
Percy Liang
John C. Duchi
63
746
0
31 May 2019
Efficient Forward Architecture Search
Efficient Forward Architecture Search
Hanzhang Hu
John Langford
R. Caruana
Saurajit Mukherjee
Eric Horvitz
Debadeepta Dey
17
40
0
31 May 2019
Time Matters in Regularizing Deep Networks: Weight Decay and Data
  Augmentation Affect Early Learning Dynamics, Matter Little Near Convergence
Time Matters in Regularizing Deep Networks: Weight Decay and Data Augmentation Affect Early Learning Dynamics, Matter Little Near Convergence
Aditya Golatkar
Alessandro Achille
Stefano Soatto
30
95
0
30 May 2019
Toward Runtime-Throttleable Neural Networks
Toward Runtime-Throttleable Neural Networks
Jesse Hostetler
27
2
0
30 May 2019
Global Momentum Compression for Sparse Communication in Distributed
  Learning
Global Momentum Compression for Sparse Communication in Distributed Learning
Chang-Wei Shi
Shen-Yi Zhao
Yin-Peng Xie
Hao Gao
Wu-Jun Li
35
1
0
30 May 2019
RecNets: Channel-wise Recurrent Convolutional Neural Networks
RecNets: Channel-wise Recurrent Convolutional Neural Networks
George Retsinas
Athena Elafrou
G. Goumas
Petros Maragos
20
2
0
28 May 2019
Stochastic Gradient Methods with Layer-wise Adaptive Moments for
  Training of Deep Networks
Stochastic Gradient Methods with Layer-wise Adaptive Moments for Training of Deep Networks
Boris Ginsburg
P. Castonguay
Oleksii Hrinchuk
Oleksii Kuchaiev
Vitaly Lavrukhin
Ryan Leary
Jason Chun Lok Li
Huyen Nguyen
Yang Zhang
Jonathan M. Cohen
ODL
25
13
0
27 May 2019
Value Iteration Networks on Multiple Levels of Abstraction
Value Iteration Networks on Multiple Levels of Abstraction
Daniel Schleich
Tobias Klamt
Sven Behnke
14
18
0
27 May 2019
Painless Stochastic Gradient: Interpolation, Line-Search, and
  Convergence Rates
Painless Stochastic Gradient: Interpolation, Line-Search, and Convergence Rates
Sharan Vaswani
Aaron Mishkin
I. Laradji
Mark Schmidt
Gauthier Gidel
Simon Lacoste-Julien
ODL
44
205
0
24 May 2019
EnsembleNet: End-to-End Optimization of Multi-headed Models
EnsembleNet: End-to-End Optimization of Multi-headed Models
Hanhan Li
Joe Yue-Hei Ng
Apostol Natsev
20
16
0
24 May 2019
Blockwise Adaptivity: Faster Training and Better Generalization in Deep
  Learning
Blockwise Adaptivity: Faster Training and Better Generalization in Deep Learning
Shuai Zheng
James T. Kwok
ODL
27
5
0
23 May 2019
Network Pruning via Transformable Architecture Search
Network Pruning via Transformable Architecture Search
Xuanyi Dong
Yi Yang
3DPC
23
241
0
23 May 2019
Countering Noisy Labels By Learning From Auxiliary Clean Labels
Countering Noisy Labels By Learning From Auxiliary Clean Labels
Tsung Wei Tsai
Chongxuan Li
Jun Zhu
SSL
10
1
0
23 May 2019
Adaptive Stochastic Natural Gradient Method for One-Shot Neural
  Architecture Search
Adaptive Stochastic Natural Gradient Method for One-Shot Neural Architecture Search
Youhei Akimoto
Shinichi Shirakawa
Nozomu Yoshinari
Kento Uchida
Shota Saito
K. Nishida
27
86
0
21 May 2019
AutoDispNet: Improving Disparity Estimation With AutoML
AutoDispNet: Improving Disparity Estimation With AutoML
Tonmoy Saikia
Yassine Marrakchi
Arber Zela
Frank Hutter
Thomas Brox
19
78
0
17 May 2019
Learning What and Where to Transfer
Learning What and Where to Transfer
Yunhun Jang
Hankook Lee
Sung Ju Hwang
Jinwoo Shin
22
149
0
15 May 2019
Population Based Augmentation: Efficient Learning of Augmentation Policy
  Schedules
Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules
Daniel Ho
Eric Liang
Ion Stoica
Pieter Abbeel
Xi Chen
35
397
0
14 May 2019
BayesNAS: A Bayesian Approach for Neural Architecture Search
BayesNAS: A Bayesian Approach for Neural Architecture Search
Hongpeng Zhou
Minghao Yang
Jun Wang
Wei Pan
BDL
22
197
0
13 May 2019
Dynamic Routing Networks
Dynamic Routing Networks
Shaofeng Cai
Yao Shu
Wei Wang
Beng Chin Ooi
23
3
0
13 May 2019
Budgeted Training: Rethinking Deep Neural Network Training Under
  Resource Constraints
Budgeted Training: Rethinking Deep Neural Network Training Under Resource Constraints
Mengtian Li
Ersin Yumer
Deva Ramanan
22
46
0
12 May 2019
Training CNNs with Selective Allocation of Channels
Training CNNs with Selective Allocation of Channels
Jongheon Jeong
Jinwoo Shin
CVBM
39
15
0
11 May 2019
Previous
123...818283848586
Next