ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.03983
  4. Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts

SGDR: Stochastic Gradient Descent with Warm Restarts

13 August 2016
I. Loshchilov
Frank Hutter
    ODL
ArXivPDFHTML

Papers citing "SGDR: Stochastic Gradient Descent with Warm Restarts"

50 / 4,280 papers shown
Title
Residual Energy-Based Models for Text
Residual Energy-Based Models for Text
A. Bakhtin
Yuntian Deng
Sam Gross
Myle Ott
MarcÁurelio Ranzato
Arthur Szlam
37
13
0
06 Apr 2020
Steering Self-Supervised Feature Learning Beyond Local Pixel Statistics
Steering Self-Supervised Feature Learning Beyond Local Pixel Statistics
Simon Jenni
Hailin Jin
Paolo Favaro
SSL
22
45
0
05 Apr 2020
PaStaNet: Toward Human Activity Knowledge Engine
PaStaNet: Toward Human Activity Knowledge Engine
Yong-Lu Li
Liang Xu
Xinpeng Liu
Xijie Huang
Yue Xu
Shiyi Wang
Haoshu Fang
Ze Ma
Mingyang Chen
Cewu Lu
28
151
0
02 Apr 2020
Long Short-Term Relation Networks for Video Action Detection
Long Short-Term Relation Networks for Video Action Detection
Dong Li
Ting Yao
Zhaofan Qiu
Houqiang Li
Tao Mei
12
22
0
31 Mar 2020
RetinaTrack: Online Single Stage Joint Detection and Tracking
RetinaTrack: Online Single Stage Joint Detection and Tracking
Zhichao Lu
V. Rathod
Ronny Votel
Jonathan Huang
VOT
41
188
0
30 Mar 2020
Learning Memory-guided Normality for Anomaly Detection
Learning Memory-guided Normality for Anomaly Detection
Hyunjong Park
Jongyoun Noh
Bumsub Ham
23
625
0
30 Mar 2020
Disturbance-immune Weight Sharing for Neural Architecture Search
Disturbance-immune Weight Sharing for Neural Architecture Search
Shuaicheng Niu
Jiaxiang Wu
Yifan Zhang
Yong Guo
P. Zhao
Junzhou Huang
Mingkui Tan
20
27
0
29 Mar 2020
NPENAS: Neural Predictor Guided Evolution for Neural Architecture Search
NPENAS: Neural Predictor Guided Evolution for Neural Architecture Search
Chen Wei
Chuang Niu
Yiping Tang
Yue Wang
Haihong Hu
Jimin Liang
30
7
0
28 Mar 2020
An Investigation into the Stochasticity of Batch Whitening
An Investigation into the Stochasticity of Batch Whitening
Lei Huang
Lei Zhao
Yi Zhou
Fan Zhu
Li Liu
Ling Shao
12
18
0
27 Mar 2020
Spatiotemporal Adaptive Neural Network for Long-term Forecasting of
  Financial Time Series
Spatiotemporal Adaptive Neural Network for Long-term Forecasting of Financial Time Series
Philippe Chatigny
Jean-Marc Patenaude
Shengrui Wang
AI4TS
19
5
0
27 Mar 2020
Negative Margin Matters: Understanding Margin in Few-shot Classification
Negative Margin Matters: Understanding Margin in Few-shot Classification
Bin Liu
Yue Cao
Yutong Lin
Qi Li
Zheng-Wei Zhang
Mingsheng Long
Han Hu
37
318
0
26 Mar 2020
Are Labels Necessary for Neural Architecture Search?
Are Labels Necessary for Neural Architecture Search?
Chenxi Liu
Piotr Dollár
Kaiming He
Ross B. Girshick
Alan Yuille
Saining Xie
24
75
0
26 Mar 2020
Auto-Ensemble: An Adaptive Learning Rate Scheduling based Deep Learning
  Model Ensembling
Auto-Ensemble: An Adaptive Learning Rate Scheduling based Deep Learning Model Ensembling
Jun Yang
Fei Wang
20
32
0
25 Mar 2020
Two-stage Discriminative Re-ranking for Large-scale Landmark Retrieval
Two-stage Discriminative Re-ranking for Large-scale Landmark Retrieval
Shuhei Yokoo
Kohei Ozaki
E. Simo-Serra
S. Iizuka
22
31
0
25 Mar 2020
BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage
  Models
BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models
Jiahui Yu
Pengchong Jin
Hanxiao Liu
Gabriel Bender
Pieter-Jan Kindermans
Mingxing Tan
Thomas Huang
Xiaodan Song
Ruoming Pang
Quoc V. Le
29
302
0
24 Mar 2020
Model-based Asynchronous Hyperparameter and Neural Architecture Search
Model-based Asynchronous Hyperparameter and Neural Architecture Search
Aaron Klein
Louis C. Tiao
Thibaut Lienart
Cédric Archambeau
Matthias Seeger
31
5
0
24 Mar 2020
Robust and On-the-fly Dataset Denoising for Image Classification
Robust and On-the-fly Dataset Denoising for Image Classification
Jiaming Song
Lunjia Hu
Michael Auli
Yann N. Dauphin
Tengyu Ma
NoLa
OOD
24
13
0
24 Mar 2020
Meta Pseudo Labels
Meta Pseudo Labels
Hieu H. Pham
Zihang Dai
Qizhe Xie
Minh-Thang Luong
Quoc V. Le
VLM
262
659
0
23 Mar 2020
BS-NAS: Broadening-and-Shrinking One-Shot NAS with Searchable Numbers of
  Channels
BS-NAS: Broadening-and-Shrinking One-Shot NAS with Searchable Numbers of Channels
Zan Shen
Jiang Qian
Bojin Zhuang
Shaojun Wang
Jing Xiao
28
5
0
22 Mar 2020
Robust Out-of-distribution Detection for Neural Networks
Robust Out-of-distribution Detection for Neural Networks
Jiefeng Chen
Yixuan Li
Xi Wu
Yingyu Liang
S. Jha
OODD
161
85
0
21 Mar 2020
FTT-NAS: Discovering Fault-Tolerant Convolutional Neural Architecture
FTT-NAS: Discovering Fault-Tolerant Convolutional Neural Architecture
Xuefei Ning
Guangjun Ge
Wenshuo Li
Zhenhua Zhu
Yin Zheng
Xiaoming Chen
Zhen Gao
Yu Wang
Huazhong Yang
39
24
0
20 Mar 2020
Selecting Relevant Features from a Multi-domain Representation for
  Few-shot Classification
Selecting Relevant Features from a Multi-domain Representation for Few-shot Classification
Nikita Dvornik
Cordelia Schmid
Julien Mairal
VLM
178
24
0
20 Mar 2020
FocalMix: Semi-Supervised Learning for 3D Medical Image Detection
FocalMix: Semi-Supervised Learning for 3D Medical Image Detection
Dong Wang
Yuan Zhang
Kexin Zhang
Liwei Wang
81
120
0
20 Mar 2020
Learning to Structure an Image with Few Colors
Learning to Structure an Image with Few Colors
Yunzhong Hou
Liang Zheng
Stephen Gould
MQ
40
19
0
17 Mar 2020
Domain Adaptive Ensemble Learning
Domain Adaptive Ensemble Learning
Kaiyang Zhou
Yongxin Yang
Yu Qiao
Tao Xiang
OOD
140
274
0
16 Mar 2020
Gated Texture CNN for Efficient and Configurable Image Denoising
Gated Texture CNN for Efficient and Configurable Image Denoising
Kaito Imai
T. Miyata
31
2
0
16 Mar 2020
Stochastic gradient descent with random learning rate
Stochastic gradient descent with random learning rate
Daniele Musso
ODL
12
4
0
15 Mar 2020
Learning Enriched Features for Real Image Restoration and Enhancement
Learning Enriched Features for Real Image Restoration and Enhancement
Syed Waqas Zamir
Aditya Arora
Salman Khan
Munawar Hayat
Fahad Shahbaz Khan
Ming-Hsuan Yang
Ling Shao
SupR
32
595
0
15 Mar 2020
Finnish Language Modeling with Deep Transformer Models
Finnish Language Modeling with Deep Transformer Models
Abhilash Jain
Aku Rouhe
Stig-Arne Gronroos
M. Kurimo
14
0
0
14 Mar 2020
Top-1 Solution of Multi-Moments in Time Challenge 2019
Top-1 Solution of Multi-Moments in Time Challenge 2019
Manyuan Zhang
Hao Shao
Guanglu Song
Yu Liu
Junjie Yan
29
3
0
12 Mar 2020
PONAS: Progressive One-shot Neural Architecture Search for Very
  Efficient Deployment
PONAS: Progressive One-shot Neural Architecture Search for Very Efficient Deployment
Sian-Yao Huang
W. Chu
14
11
0
11 Mar 2020
Hierarchical Neural Architecture Search for Single Image
  Super-Resolution
Hierarchical Neural Architecture Search for Single Image Super-Resolution
Yong Guo
Yongsheng Luo
Zhenhao He
Jin Huang
Jian Chen
SupR
62
49
0
10 Mar 2020
Improved Baselines with Momentum Contrastive Learning
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
281
3,381
0
09 Mar 2020
Flexible numerical optimization with ensmallen
Flexible numerical optimization with ensmallen
Ryan R. Curtin
Marcus Edel
Rahul Prabhu
S. Basak
Zhihao Lou
Conrad Sanderson
18
1
0
09 Mar 2020
Wide-minima Density Hypothesis and the Explore-Exploit Learning Rate
  Schedule
Wide-minima Density Hypothesis and the Explore-Exploit Learning Rate Schedule
Nikhil Iyer
V. Thejas
Nipun Kwatra
Ramachandran Ramjee
Muthian Sivathanu
16
28
0
09 Mar 2020
Fine-Grained Visual Classification via Progressive Multi-Granularity
  Training of Jigsaw Patches
Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches
Ruoyi Du
Dongliang Chang
A. Bhunia
Jiyang Xie
Zhanyu Ma
Yi-Zhe Song
Jun Guo
76
291
0
08 Mar 2020
GeoConv: Geodesic Guided Convolution for Facial Action Unit Recognition
GeoConv: Geodesic Guided Convolution for Facial Action Unit Recognition
Yuedong Chen
Guoxian Song
Zhiwen Shao
Jianfei Cai
Tat-Jen Cham
Jianming Zheng
CVBM
12
27
0
06 Mar 2020
Deep Learning Approach to Diabetic Retinopathy Detection
Deep Learning Approach to Diabetic Retinopathy Detection
B. Tymchenko
Philip Marchenko
D. Spodarets
38
157
0
03 Mar 2020
Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection
Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection
Mao Ye
Chengyue Gong
Lizhen Nie
Denny Zhou
Adam R. Klivans
Qiang Liu
43
108
0
03 Mar 2020
BATS: Binary ArchitecTure Search
BATS: Binary ArchitecTure Search
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
MQ
25
67
0
03 Mar 2020
Anytime Inference with Distilled Hierarchical Neural Ensembles
Anytime Inference with Distilled Hierarchical Neural Ensembles
Adria Ruiz
Jakob Verbeek
UQCV
BDL
FedML
52
6
0
03 Mar 2020
Heterogeneous Graph Transformer
Heterogeneous Graph Transformer
Ziniu Hu
Yuxiao Dong
Kuansan Wang
Yizhou Sun
190
1,171
0
03 Mar 2020
Variational inference formulation for a model-free simulation of a
  dynamical system with unknown parameters by a recurrent neural network
Variational inference formulation for a model-free simulation of a dynamical system with unknown parameters by a recurrent neural network
K. Yeo
D. E. C. Grullon
Fan-Keng Sun
Duane S. Boning
Jayant Kalagnanam
BDL
26
3
0
02 Mar 2020
Disentangling Adaptive Gradient Methods from Learning Rates
Disentangling Adaptive Gradient Methods from Learning Rates
Naman Agarwal
Rohan Anil
Elad Hazan
Tomer Koren
Cyril Zhang
27
34
0
26 Feb 2020
Train Large, Then Compress: Rethinking Model Size for Efficient Training
  and Inference of Transformers
Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers
Zhuohan Li
Eric Wallace
Sheng Shen
Kevin Lin
Kurt Keutzer
Dan Klein
Joseph E. Gonzalez
22
148
0
26 Feb 2020
Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video
  Super-Resolution
Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution
Xiaoyu Xiang
Yapeng Tian
Yulun Zhang
Y. Fu
J. Allebach
Chenliang Xu
SupR
19
169
0
26 Feb 2020
On Feature Normalization and Data Augmentation
On Feature Normalization and Data Augmentation
Boyi Li
Felix Wu
Ser-Nam Lim
Serge J. Belongie
Kilian Q. Weinberger
23
134
0
25 Feb 2020
Denoising IMU Gyroscopes with Deep Learning for Open-Loop Attitude
  Estimation
Denoising IMU Gyroscopes with Deep Learning for Open-Loop Attitude Estimation
Martin Brossard
Silvere Bonnabel
Axel Barrau
27
125
0
25 Feb 2020
Searching for Winograd-aware Quantized Networks
Searching for Winograd-aware Quantized Networks
Javier Fernandez-Marques
P. Whatmough
Andrew Mundy
Matthew Mattina
MQ
17
40
0
25 Feb 2020
FPConv: Learning Local Flattening for Point Convolution
FPConv: Learning Local Flattening for Point Convolution
Yiqun Lin
Zizheng Yan
Haibin Huang
Dong Du
Ligang Liu
Shuguang Cui
Xiaoguang Han
3DPC
22
145
0
25 Feb 2020
Previous
123...777879...848586
Next