ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.03983
  4. Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts

SGDR: Stochastic Gradient Descent with Warm Restarts

13 August 2016
I. Loshchilov
Frank Hutter
    ODL
ArXivPDFHTML

Papers citing "SGDR: Stochastic Gradient Descent with Warm Restarts"

50 / 4,296 papers shown
Title
Well-tuned Simple Nets Excel on Tabular Datasets
Well-tuned Simple Nets Excel on Tabular Datasets
Arlind Kadra
Marius Lindauer
Frank Hutter
Josif Grabocka
23
186
0
21 Jun 2021
Better Training using Weight-Constrained Stochastic Dynamics
Better Training using Weight-Constrained Stochastic Dynamics
Benedict Leimkuhler
Tiffany J. Vlaar
Timothée Pouchon
Amos Storkey
31
9
0
20 Jun 2021
More than Encoder: Introducing Transformer Decoder to Upsample
More than Encoder: Introducing Transformer Decoder to Upsample
Yijiang Li
Wentian Cai
Ying Gao
Chengming Li
Xiping Hu
ViT
MedIm
35
52
0
20 Jun 2021
Teacher's pet: understanding and mitigating biases in distillation
Teacher's pet: understanding and mitigating biases in distillation
Michal Lukasik
Srinadh Bhojanapalli
A. Menon
Sanjiv Kumar
23
25
0
19 Jun 2021
Towards Single Stage Weakly Supervised Semantic Segmentation
Towards Single Stage Weakly Supervised Semantic Segmentation
Peri Akiva
Kristin J. Dana
56
8
0
18 Jun 2021
It's FLAN time! Summing feature-wise latent representations for
  interpretability
It's FLAN time! Summing feature-wise latent representations for interpretability
An-phi Nguyen
María Rodríguez Martínez
FAtt
18
0
0
18 Jun 2021
Being a Bit Frequentist Improves Bayesian Neural Networks
Being a Bit Frequentist Improves Bayesian Neural Networks
Agustinus Kristiadi
Matthias Hein
Philipp Hennig
BDL
UQCV
29
15
0
18 Jun 2021
Medical Matting: A New Perspective on Medical Segmentation with
  Uncertainty
Medical Matting: A New Perspective on Medical Segmentation with Uncertainty
Lin Wang
Lie Ju
Xin Wang
Wanji He
Donghao Zhang
...
Zhiwen Yang
Xuan-Liang Yao
Xin Zhao
Xiufen Ye
Z. Ge
37
3
0
18 Jun 2021
Pruning Randomly Initialized Neural Networks with Iterative
  Randomization
Pruning Randomly Initialized Neural Networks with Iterative Randomization
Daiki Chijiwa
Shin'ya Yamaguchi
Yasutoshi Ida
Kenji Umakoshi
T. Inoue
22
25
0
17 Jun 2021
Long-Short Temporal Contrastive Learning of Video Transformers
Long-Short Temporal Contrastive Learning of Video Transformers
Jue Wang
Gedas Bertasius
Du Tran
Lorenzo Torresani
VLM
ViT
52
50
0
17 Jun 2021
Amortized Auto-Tuning: Cost-Efficient Bayesian Transfer Optimization for
  Hyperparameter Recommendation
Amortized Auto-Tuning: Cost-Efficient Bayesian Transfer Optimization for Hyperparameter Recommendation
Yuxin Xiao
Eric P. Xing
Willie Neiswanger
40
5
0
17 Jun 2021
To Raise or Not To Raise: The Autonomous Learning Rate Question
To Raise or Not To Raise: The Autonomous Learning Rate Question
Xiaomeng Dong
Tao Tan
Michael Potter
Yun-Chan Tsai
Gaurav Kumar
V. R. Saripalli
Theodore Trafalis
OOD
13
2
0
16 Jun 2021
EdgeConv with Attention Module for Monocular Depth Estimation
EdgeConv with Attention Module for Monocular Depth Estimation
Minhyeok Lee
Sangwon Hwang
Chaewon Park
Sangyoun Lee
MDE
3DPC
18
17
0
16 Jun 2021
FastAno: Fast Anomaly Detection via Spatio-temporal Patch Transformation
FastAno: Fast Anomaly Detection via Spatio-temporal Patch Transformation
Chaewon Park
Myeongah Cho
Minhyeok Lee
Sangyoun Lee
29
31
0
16 Jun 2021
Domain Consistency Regularization for Unsupervised Multi-source Domain
  Adaptive Classification
Domain Consistency Regularization for Unsupervised Multi-source Domain Adaptive Classification
Zhipeng Luo
Xiaobing Zhang
Shijian Lu
Shuai Yi
69
21
0
16 Jun 2021
Delving Deep into the Generalization of Vision Transformers under
  Distribution Shifts
Delving Deep into the Generalization of Vision Transformers under Distribution Shifts
Chongzhi Zhang
Mingyuan Zhang
Shanghang Zhang
Daisheng Jin
Qiang-feng Zhou
Zhongang Cai
Haiyu Zhao
Xianglong Liu
Ziwei Liu
31
103
0
14 Jun 2021
Revisiting consistency for semi-supervised semantic segmentation
Revisiting consistency for semi-supervised semantic segmentation
Ivan Grubišić
Marin Orsic
Sinisa Segvic
19
4
0
13 Jun 2021
Wide Mean-Field Variational Bayesian Neural Networks Ignore the Data
Wide Mean-Field Variational Bayesian Neural Networks Ignore the Data
Beau Coker
Weiwei Pan
Finale Doshi-Velez
BDL
35
9
0
13 Jun 2021
A Free Lunch From ANN: Towards Efficient, Accurate Spiking Neural
  Networks Calibration
A Free Lunch From ANN: Towards Efficient, Accurate Spiking Neural Networks Calibration
Yuhang Li
Shi-Wee Deng
Xin Dong
Ruihao Gong
Shi Gu
30
187
0
13 Jun 2021
Video Super-Resolution Transformer
Video Super-Resolution Transformer
Jie Cao
Yawei Li
Peng Sun
Luc Van Gool
ViT
44
168
0
12 Jun 2021
Adversarial Robustness via Fisher-Rao Regularization
Adversarial Robustness via Fisher-Rao Regularization
Marine Picot
Francisco Messina
Malik Boudiaf
Fabrice Labeau
Ismail Ben Ayed
Pablo Piantanida
AAML
31
23
0
12 Jun 2021
Hybrid Generative-Contrastive Representation Learning
Hybrid Generative-Contrastive Representation Learning
Saehoon Kim
Sungwoong Kim
Juho Lee
SSL
22
11
0
11 Jun 2021
Overcoming Difficulty in Obtaining Dark-skinned Subjects for Remote-PPG
  by Synthetic Augmentation
Overcoming Difficulty in Obtaining Dark-skinned Subjects for Remote-PPG by Synthetic Augmentation
Yunhao Ba
Zhen Wang
Kerim Doruk Karinca
Oyku Deniz Bozkurt
A. Kadambi
38
10
0
10 Jun 2021
Space-time Mixing Attention for Video Transformer
Space-time Mixing Attention for Video Transformer
Adrian Bulat
Juan-Manuel Perez-Rua
Swathikiran Sudhakaran
Brais Martínez
Georgios Tzimiropoulos
ViT
41
125
0
10 Jun 2021
MST: Masked Self-Supervised Transformer for Visual Representation
MST: Masked Self-Supervised Transformer for Visual Representation
Zhaowen Li
Zhiyang Chen
Fan Yang
Wei Li
Yousong Zhu
...
Rui Deng
Liwei Wu
Rui Zhao
Ming Tang
Jinqiao Wang
ViT
52
163
0
10 Jun 2021
DUET: Detection Utilizing Enhancement for Text in Scanned or Captured
  Documents
DUET: Detection Utilizing Enhancement for Text in Scanned or Captured Documents
Eun-Soo Jung
HyeongGwan Son
Kyusam Oh
Yongkeun Yun
Soonhwan Kwon
Min Soo Kim
54
4
0
10 Jun 2021
Tractable Density Estimation on Learned Manifolds with Conformal
  Embedding Flows
Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows
Brendan Leigh Ross
Jesse C. Cresswell
TPM
47
32
0
09 Jun 2021
Knowledge distillation: A good teacher is patient and consistent
Knowledge distillation: A good teacher is patient and consistent
Lucas Beyer
Xiaohua Zhai
Amelie Royer
L. Markeeva
Rohan Anil
Alexander Kolesnikov
VLM
52
288
0
09 Jun 2021
Geometry-Consistent Neural Shape Representation with Implicit
  Displacement Fields
Geometry-Consistent Neural Shape Representation with Implicit Displacement Fields
Yifan Wang
Lukas Rahmann
O. Sorkine-Hornung
27
65
0
09 Jun 2021
Broadcasted Residual Learning for Efficient Keyword Spotting
Broadcasted Residual Learning for Efficient Keyword Spotting
Byeonggeun Kim
Simyung Chang
Jinkyu Lee
Dooyong Sung
37
122
0
08 Jun 2021
Multi-dataset Pretraining: A Unified Model for Semantic Segmentation
Multi-dataset Pretraining: A Unified Model for Semantic Segmentation
Bowen Shi
Xiaopeng Zhang
Haohang Xu
Wenrui Dai
Junni Zou
H. Xiong
Qi Tian
19
7
0
08 Jun 2021
Mixture Outlier Exposure: Towards Out-of-Distribution Detection in
  Fine-grained Environments
Mixture Outlier Exposure: Towards Out-of-Distribution Detection in Fine-grained Environments
Jingyang Zhang
Nathan Inkawhich
Randolph Linderman
Yiran Chen
H. Li
OODD
27
53
0
07 Jun 2021
Incremental False Negative Detection for Contrastive Learning
Incremental False Negative Detection for Contrastive Learning
Tsai-Shien Chen
Wei-Chih Hung
Hung-Yu Tseng
Shao-Yi Chien
Ming-Hsuan Yang
SSL
CLL
18
61
0
07 Jun 2021
Channel DropBlock: An Improved Regularization Method for Fine-Grained
  Visual Classification
Channel DropBlock: An Improved Regularization Method for Fine-Grained Visual Classification
Yifeng Ding
Shuwei Dong
Yujun Tong
Zhanyu Ma
Bo Xiao
Haibin Ling
35
7
0
07 Jun 2021
Vision Transformers with Hierarchical Attention
Vision Transformers with Hierarchical Attention
Yun-Hai Liu
Yu-Huan Wu
Guolei Sun
Le Zhang
Ajad Chhatkuli
Luc Van Gool
ViT
48
34
0
06 Jun 2021
Bandwidth-based Step-Sizes for Non-Convex Stochastic Optimization
Bandwidth-based Step-Sizes for Non-Convex Stochastic Optimization
Xiaoyu Wang
M. Johansson
33
2
0
05 Jun 2021
RegionViT: Regional-to-Local Attention for Vision Transformers
RegionViT: Regional-to-Local Attention for Vision Transformers
Chun-Fu Chen
Yikang Shen
Quanfu Fan
ViT
59
195
0
04 Jun 2021
Aligning Pretraining for Detection via Object-Level Contrastive Learning
Aligning Pretraining for Detection via Object-Level Contrastive Learning
Fangyun Wei
Yue Gao
Zhirong Wu
Han Hu
Stephen Lin
ObjD
32
144
0
04 Jun 2021
COLD: Concurrent Loads Disaggregator for Non-Intrusive Load Monitoring
COLD: Concurrent Loads Disaggregator for Non-Intrusive Load Monitoring
I. Kamyshev
Dmitrii Kriukov
E. Gryazina
Elena Gryazina
Henni Ouerdane
29
8
0
04 Jun 2021
Convergent Graph Solvers
Convergent Graph Solvers
Junyoung Park
J. Choo
Jinkyoo Park
39
13
0
03 Jun 2021
CT-Net: Channel Tensorization Network for Video Classification
CT-Net: Channel Tensorization Network for Video Classification
Kunchang Li
Xianhang Li
Yali Wang
Jun Wang
Yu Qiao
ViT
35
55
0
03 Jun 2021
SemiFL: Semi-Supervised Federated Learning for Unlabeled Clients with
  Alternate Training
SemiFL: Semi-Supervised Federated Learning for Unlabeled Clients with Alternate Training
Enmao Diao
Jie Ding
Vahid Tarokh
FedML
30
74
0
02 Jun 2021
A Generalizable Approach to Learning Optimizers
A Generalizable Approach to Learning Optimizers
Diogo Almeida
Clemens Winter
Jie Tang
Wojciech Zaremba
AI4CE
25
29
0
02 Jun 2021
Adversarially Adaptive Normalization for Single Domain Generalization
Adversarially Adaptive Normalization for Single Domain Generalization
Xinjie Fan
Qifei Wang
Junjie Ke
Feng Yang
Boqing Gong
Mingyuan Zhou
37
130
0
01 Jun 2021
Towards Light-weight and Real-time Line Segment Detection
Towards Light-weight and Real-time Line Segment Detection
Geonmo Gu
ByungSoo Ko
SeoungHyun Go
Sung-Hyun Lee
Jingeun Lee
Minchul Shin
3DGS
13
60
0
01 Jun 2021
Effect of Pre-Training Scale on Intra- and Inter-Domain Full and
  Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images
Effect of Pre-Training Scale on Intra- and Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images
Mehdi Cherti
J. Jitsev
LM&MA
40
23
0
31 May 2021
MSG-Transformer: Exchanging Local Spatial Information by Manipulating
  Messenger Tokens
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens
Jiemin Fang
Lingxi Xie
Xinggang Wang
Xiaopeng Zhang
Wenyu Liu
Qi Tian
ViT
23
74
0
31 May 2021
VidFace: A Full-Transformer Solver for Video FaceHallucination with
  Unaligned Tiny Snapshots
VidFace: A Full-Transformer Solver for Video FaceHallucination with Unaligned Tiny Snapshots
Y. Gan
Yawei Luo
Xin Yu
Bang Zhang
Yi Yang
ViT
CVBM
37
3
0
31 May 2021
Informing Geometric Deep Learning with Electronic Interactions to
  Accelerate Quantum Chemistry
Informing Geometric Deep Learning with Electronic Interactions to Accelerate Quantum Chemistry
Zhuoran Qiao
Anders S. Christensen
Matthew Welborn
F. Manby
Anima Anandkumar
Thomas F. Miller
49
74
0
31 May 2021
Universal Adder Neural Networks
Universal Adder Neural Networks
Hanting Chen
Yunhe Wang
Chang Xu
Chao Xu
Chunjing Xu
Tong Zhang
37
3
0
29 May 2021
Previous
123...666768...848586
Next