ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.03983
  4. Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts

SGDR: Stochastic Gradient Descent with Warm Restarts

13 August 2016
I. Loshchilov
Frank Hutter
    ODL
ArXivPDFHTML

Papers citing "SGDR: Stochastic Gradient Descent with Warm Restarts"

50 / 4,280 papers shown
Title
Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent
Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent
Bao Wang
T. Nguyen
Andrea L. Bertozzi
Richard G. Baraniuk
Stanley J. Osher
ODL
9
48
0
24 Feb 2020
Real-time Fusion Network for RGB-D Semantic Segmentation Incorporating
  Unexpected Obstacle Detection for Road-driving Images
Real-time Fusion Network for RGB-D Semantic Segmentation Incorporating Unexpected Obstacle Detection for Road-driving Images
Lei Sun
Kailun Yang
Xinxin Hu
Weijian Hu
Kaiwei Wang
SSeg
23
130
0
24 Feb 2020
Stochastic Polyak Step-size for SGD: An Adaptive Learning Rate for Fast
  Convergence
Stochastic Polyak Step-size for SGD: An Adaptive Learning Rate for Fast Convergence
Nicolas Loizou
Sharan Vaswani
I. Laradji
Simon Lacoste-Julien
29
181
0
24 Feb 2020
Semi-Supervised Neural Architecture Search
Semi-Supervised Neural Architecture Search
Renqian Luo
Xu Tan
Rui Wang
Tao Qin
Enhong Chen
Tie-Yan Liu
13
88
0
24 Feb 2020
The Two Regimes of Deep Network Training
The Two Regimes of Deep Network Training
Guillaume Leclerc
Aleksander Madry
27
45
0
24 Feb 2020
Self-Adaptive Training: beyond Empirical Risk Minimization
Self-Adaptive Training: beyond Empirical Risk Minimization
Lang Huang
Chaoning Zhang
Hongyang R. Zhang
NoLa
29
198
0
24 Feb 2020
A New Unified Deep Learning Approach with
  Decomposition-Reconstruction-Ensemble Framework for Time Series Forecasting
A New Unified Deep Learning Approach with Decomposition-Reconstruction-Ensemble Framework for Time Series Forecasting
Guowei Zhang
Tao Ren
Yifan Yang
AI4TS
6
4
0
22 Feb 2020
Structured Sparsification with Joint Optimization of Group Convolution
  and Channel Shuffle
Structured Sparsification with Joint Optimization of Group Convolution and Channel Shuffle
Xinyu Zhang
Kai Zhao
Taihong Xiao
Mingg-Ming Cheng
Ming-Hsuan Yang
28
1
0
19 Feb 2020
Learning Architectures for Binary Networks
Learning Architectures for Binary Networks
Dahyun Kim
Kunal Pratap Singh
Jonghyun Choi
MQ
25
44
0
17 Feb 2020
Class-Imbalanced Semi-Supervised Learning
Class-Imbalanced Semi-Supervised Learning
Minsung Hyun
Jisoo Jeong
Nojun Kwak
20
49
0
17 Feb 2020
BatchEnsemble: An Alternative Approach to Efficient Ensemble and
  Lifelong Learning
BatchEnsemble: An Alternative Approach to Efficient Ensemble and Lifelong Learning
Yeming Wen
Dustin Tran
Jimmy Ba
OOD
FedML
UQCV
32
483
0
17 Feb 2020
AutoLR: Layer-wise Pruning and Auto-tuning of Learning Rates in
  Fine-tuning of Deep Networks
AutoLR: Layer-wise Pruning and Auto-tuning of Learning Rates in Fine-tuning of Deep Networks
Youngmin Ro
J. Choi
22
5
0
14 Feb 2020
A Simple Framework for Contrastive Learning of Visual Representations
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
78
18,362
0
13 Feb 2020
Neuromorphologicaly-preserving Volumetric data encoding using VQ-VAE
Neuromorphologicaly-preserving Volumetric data encoding using VQ-VAE
Petru-Daniel Tudosiu
Thomas Varsavsky
Richard Shaw
M. Graham
P. Nachev
Sebastien Ourselin
Carole H. Sudre
M. Jorge Cardoso
MedIm
39
18
0
13 Feb 2020
A Second look at Exponential and Cosine Step Sizes: Simplicity,
  Adaptivity, and Performance
A Second look at Exponential and Cosine Step Sizes: Simplicity, Adaptivity, and Performance
Xiaoyun Li
Zhenxun Zhuang
Francesco Orabona
35
18
0
12 Feb 2020
Topologically Densified Distributions
Topologically Densified Distributions
Christoph Hofer
Florian Graf
Marc Niethammer
Roland Kwitt
27
15
0
12 Feb 2020
Machine-Learning-Based Diagnostics of EEG Pathology
Machine-Learning-Based Diagnostics of EEG Pathology
Lukas A. W. Gemein
R. Schirrmeister
P. Chrabaszcz
Daniel Wilson
Joschka Boedecker
A. Schulze-Bonhage
Frank Hutter
T. Ball
22
153
0
11 Feb 2020
Time-aware Large Kernel Convolutions
Time-aware Large Kernel Convolutions
Vasileios Lioutas
Yuhong Guo
AI4TS
24
29
0
08 Feb 2020
How Does BN Increase Collapsed Neural Network Filters?
How Does BN Increase Collapsed Neural Network Filters?
Sheng Zhou
Xinjiang Wang
Ping Luo
Xue Jiang
Wenjie Li
Wei Zhang
21
1
0
30 Jan 2020
H-OWAN: Multi-distorted Image Restoration with Tensor 1x1 Convolution
H-OWAN: Multi-distorted Image Restoration with Tensor 1x1 Convolution
Zihao Huang
Chao Li
Feng Duan
Qibin Zhao
30
5
0
29 Jan 2020
Audiovisual SlowFast Networks for Video Recognition
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
197
207
0
23 Jan 2020
Lipreading using Temporal Convolutional Networks
Lipreading using Temporal Convolutional Networks
Brais Martínez
Pingchuan Ma
Stavros Petridis
Maja Pantic
168
239
0
23 Jan 2020
Multi-objective Neural Architecture Search via Non-stationary Policy
  Gradient
Multi-objective Neural Architecture Search via Non-stationary Policy Gradient
Zewei Chen
Fengwei Zhou
George Trimponias
Zhenguo Li
17
9
0
23 Jan 2020
Optimized Generic Feature Learning for Few-shot Classification across
  Domains
Optimized Generic Feature Learning for Few-shot Classification across Domains
Tonmoy Saikia
Thomas Brox
Cordelia Schmid
VLM
30
48
0
22 Jan 2020
FixMatch: Simplifying Semi-Supervised Learning with Consistency and
  Confidence
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
Kihyuk Sohn
David Berthelot
Chun-Liang Li
Zizhao Zhang
Nicholas Carlini
E. D. Cubuk
Alexey Kurakin
Han Zhang
Colin Raffel
AAML
104
3,479
0
21 Jan 2020
Towards More Efficient and Effective Inference: The Joint Decision of
  Multi-Participants
Towards More Efficient and Effective Inference: The Joint Decision of Multi-Participants
Hui Zhu
Zhulin An
Kaiqiang Xu
Xiaolong Hu
Yongjun Xu
11
0
0
19 Jan 2020
BNAS:An Efficient Neural Architecture Search Approach Using Broad
  Scalable Architecture
BNAS:An Efficient Neural Architecture Search Approach Using Broad Scalable Architecture
Zixiang Ding
Yaran Chen
Nannan Li
Dongbin Zhao
Zhiquan Sun
C. L. Philip Chen
13
0
0
18 Jan 2020
Harmonic Convolutional Networks based on Discrete Cosine Transform
Harmonic Convolutional Networks based on Discrete Cosine Transform
Matej Ulicny
V. Krylov
Rozenn Dahyot
27
34
0
18 Jan 2020
Compounding the Performance Improvements of Assembled Techniques in a
  Convolutional Neural Network
Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network
Jungkyu Lee
Taeryun Won
Tae Kwan Lee
Hyemin Lee
Geonmo Gu
K. Hong
34
57
0
17 Jan 2020
Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised
  Learning
Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning
Paola Cascante-Bonilla
Fuwen Tan
Yanjun Qi
Vicente Ordonez
ODL
50
23
0
16 Jan 2020
Assessing Robustness of Deep learning Methods in Dermatological Workflow
Assessing Robustness of Deep learning Methods in Dermatological Workflow
Sourav Mishra
Subhajit Chaudhury
Hideaki Imaizumi
T. Yamasaki
OOD
13
4
0
15 Jan 2020
CycleCluster: Modernising Clustering Regularisation for Deep
  Semi-Supervised Classification
CycleCluster: Modernising Clustering Regularisation for Deep Semi-Supervised Classification
P. Sellars
Angelica Aviles-Rivero
Carola Bibiane Schönlieb
14
0
0
15 Jan 2020
SEERL: Sample Efficient Ensemble Reinforcement Learning
SEERL: Sample Efficient Ensemble Reinforcement Learning
Rohan Saphal
Balaraman Ravindran
Dheevatsa Mudigere
Sasikanth Avancha
Bharat Kaul
18
19
0
15 Jan 2020
Invertible Generative Modeling using Linear Rational Splines
Invertible Generative Modeling using Linear Rational Splines
H. M. Dolatabadi
S. Erfani
C. Leckie
40
65
0
15 Jan 2020
Noisy Machines: Understanding Noisy Neural Networks and Enhancing
  Robustness to Analog Hardware Errors Using Distillation
Noisy Machines: Understanding Noisy Neural Networks and Enhancing Robustness to Analog Hardware Errors Using Distillation
Chuteng Zhou
Prad Kadambi
Matthew Mattina
P. Whatmough
21
35
0
14 Jan 2020
Natural Image Matting via Guided Contextual Attention
Natural Image Matting via Guided Contextual Attention
Yaoyi Li
Hongtao Lu
20
163
0
13 Jan 2020
A Continuous Space Neural Language Model for Bengali Language
A Continuous Space Neural Language Model for Bengali Language
Hemayet Ahmed Chowdhury
Md. Azizul Haque Imon
Anisur Rahman
Aisha Khatun
Md. Saiful Islam
25
2
0
11 Jan 2020
Pruning Convolutional Neural Networks with Self-Supervision
Pruning Convolutional Neural Networks with Self-Supervision
Mathilde Caron
Ari S. Morcos
Piotr Bojanowski
Julien Mairal
Armand Joulin
SSL
3DPC
25
12
0
10 Jan 2020
CONSAC: Robust Multi-Model Fitting by Conditional Sample Consensus
CONSAC: Robust Multi-Model Fitting by Conditional Sample Consensus
Florian Kluger
Eric Brachmann
H. Ackermann
Carsten Rother
M. Yang
Bodo Rosenhahn
46
58
0
08 Jan 2020
Fast Neural Network Adaptation via Parameter Remapping and Architecture
  Search
Fast Neural Network Adaptation via Parameter Remapping and Architecture Search
Jiemin Fang
Yuzhu Sun
Kangjian Peng
Qian Zhang
Yuan Li
Wenyu Liu
Xinggang Wang
SSeg
14
34
0
08 Jan 2020
Deeper Insights into Weight Sharing in Neural Architecture Search
Deeper Insights into Weight Sharing in Neural Architecture Search
Yuge Zhang
Zejun Lin
Junyan Jiang
Quanlu Zhang
Yujing Wang
Hui Xue
Chen Zhang
Yaming Yang
35
49
0
06 Jan 2020
Discrimination-aware Network Pruning for Deep Model Compression
Discrimination-aware Network Pruning for Deep Model Compression
Jing Liu
Bohan Zhuang
Zhuangwei Zhuang
Yong Guo
Junzhou Huang
Jin-Hui Zhu
Mingkui Tan
CVBM
19
119
0
04 Jan 2020
NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture
  Search
NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture Search
Xuanyi Dong
Yi Yang
46
698
0
02 Jan 2020
AdderNet: Do We Really Need Multiplications in Deep Learning?
AdderNet: Do We Really Need Multiplications in Deep Learning?
Hanting Chen
Yunhe Wang
Chunjing Xu
Boxin Shi
Chao Xu
Qi Tian
Chang Xu
29
194
0
31 Dec 2019
NAS evaluation is frustratingly hard
NAS evaluation is frustratingly hard
Antoine Yang
P. Esperança
Fabio Maria Carlucci
16
167
0
28 Dec 2019
CProp: Adaptive Learning Rate Scaling from Past Gradient Conformity
CProp: Adaptive Learning Rate Scaling from Past Gradient Conformity
Konpat Preechakul
B. Kijsirikul
ODL
36
3
0
24 Dec 2019
Big Transfer (BiT): General Visual Representation Learning
Big Transfer (BiT): General Visual Representation Learning
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
J. Puigcerver
Jessica Yung
Sylvain Gelly
N. Houlsby
MQ
114
1,183
0
24 Dec 2019
Robustness of Brain Tumor Segmentation
Robustness of Brain Tumor Segmentation
Sabine Müller
Joachim Weickert
N. Graf
AAML
OOD
28
5
0
24 Dec 2019
Computation Reallocation for Object Detection
Computation Reallocation for Object Detection
Feng Liang
Chen Lin
Ronghao Guo
Ming Sun
Wei Wu
Junjie Yan
Wanli Ouyang
ObjD
50
35
0
24 Dec 2019
Towards Efficient Training for Neural Network Quantization
Towards Efficient Training for Neural Network Quantization
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
27
42
0
21 Dec 2019
Previous
123...787980...848586
Next