ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.03983
  4. Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts

SGDR: Stochastic Gradient Descent with Warm Restarts

13 August 2016
I. Loshchilov
Frank Hutter
    ODL
ArXivPDFHTML

Papers citing "SGDR: Stochastic Gradient Descent with Warm Restarts"

50 / 4,280 papers shown
Title
MoViNets: Mobile Video Networks for Efficient Video Recognition
MoViNets: Mobile Video Networks for Efficient Video Recognition
Dan Kondratyuk
Liangzhe Yuan
Yandong Li
Li Zhang
Mingxing Tan
Matthew A. Brown
Boqing Gong
21
229
0
21 Mar 2021
PGT: A Progressive Method for Training Models on Long Videos
PGT: A Progressive Method for Training Models on Long Videos
Bo Pang
Gao Peng
Yizhuo Li
Cewu Lu
VLM
27
12
0
21 Mar 2021
TrivialAugment: Tuning-free Yet State-of-the-Art Data Augmentation
TrivialAugment: Tuning-free Yet State-of-the-Art Data Augmentation
Samuel G. Müller
Frank Hutter
ViT
MQ
26
277
0
18 Mar 2021
Hierarchical Attention-based Age Estimation and Bias Estimation
Hierarchical Attention-based Age Estimation and Bias Estimation
Shakediel Hiba
Y. Keller
CVBM
37
10
0
17 Mar 2021
Frequency-aware Discriminative Feature Learning Supervised by
  Single-Center Loss for Face Forgery Detection
Frequency-aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection
Jiaming Li
Hongtao Xie
Jiahong Li
Zhongyuan Wang
Yongdong Zhang
CVBM
52
238
0
16 Mar 2021
Revisiting ResNets: Improved Training and Scaling Strategies
Revisiting ResNets: Improved Training and Scaling Strategies
Irwan Bello
W. Fedus
Xianzhi Du
E. D. Cubuk
A. Srinivas
Nayeon Lee
Jonathon Shlens
Barret Zoph
36
298
0
13 Mar 2021
Searching by Generating: Flexible and Efficient One-Shot NAS with
  Architecture Generator
Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator
Sian-Yao Huang
W. Chu
25
25
0
12 Mar 2021
Learnable Companding Quantization for Accurate Low-bit Neural Networks
Learnable Companding Quantization for Accurate Low-bit Neural Networks
Kohei Yamamoto
MQ
41
65
0
12 Mar 2021
Towards Learning an Unbiased Classifier from Biased Data via Conditional
  Adversarial Debiasing
Towards Learning an Unbiased Classifier from Biased Data via Conditional Adversarial Debiasing
Christian Reimers
P. Bodesheim
Jakob Runge
Joachim Denzler
FaML
CML
14
6
0
10 Mar 2021
Spatially Consistent Representation Learning
Spatially Consistent Representation Learning
Byungseok Roh
Wuhyun Shin
Ildoo Kim
Sungwoong Kim
SSL
33
88
0
10 Mar 2021
Variable-rate discrete representation learning
Variable-rate discrete representation learning
Sander Dieleman
C. Nash
Jesse Engel
Karen Simonyan
BDL
DRL
32
23
0
10 Mar 2021
SimTriplet: Simple Triplet Representation Learning with a Single GPU
SimTriplet: Simple Triplet Representation Learning with a Single GPU
Quan Liu
Peter C. Louis
Yuzhe Lu
Aadarsh Jha
Mengyang Zhao
...
Joseph T. Roland
Haichun Yang
Shilin Zhao
L. Wheless
Yuankai Huo
26
36
0
09 Mar 2021
Knowledge Evolution in Neural Networks
Knowledge Evolution in Neural Networks
Ahmed Taha
Abhinav Shrivastava
L. Davis
54
21
0
09 Mar 2021
Nondeterminism and Instability in Neural Network Optimization
Nondeterminism and Instability in Neural Network Optimization
Cecilia Summers
M. Dinneen
30
38
0
08 Mar 2021
Better SGD using Second-order Momentum
Better SGD using Second-order Momentum
Hoang Tran
Ashok Cutkosky
ODL
29
12
0
04 Mar 2021
Barlow Twins: Self-Supervised Learning via Redundancy Reduction
Barlow Twins: Self-Supervised Learning via Redundancy Reduction
Jure Zbontar
Li Jing
Ishan Misra
Yann LeCun
Stéphane Deny
SSL
84
2,309
0
04 Mar 2021
Learning Granularity-Aware Convolutional Neural Network for Fine-Grained
  Visual Classification
Learning Granularity-Aware Convolutional Neural Network for Fine-Grained Visual Classification
Jianwei Song
Ruoyu Yang
ObjD
24
4
0
04 Mar 2021
Feature Boosting, Suppression, and Diversification for Fine-Grained
  Visual Classification
Feature Boosting, Suppression, and Diversification for Fine-Grained Visual Classification
Jianwei Song
Ruoyu Yang
21
37
0
04 Mar 2021
Shift Invariance Can Reduce Adversarial Robustness
Shift Invariance Can Reduce Adversarial Robustness
Songwei Ge
Vasu Singla
Ronen Basri
David Jacobs
AAML
OOD
18
26
0
03 Mar 2021
Deep Clustering by Semantic Contrastive Learning
Deep Clustering by Semantic Contrastive Learning
Jiabo Huang
S. Gong
30
15
0
03 Mar 2021
Adaptive Consistency Regularization for Semi-Supervised Transfer
  Learning
Adaptive Consistency Regularization for Semi-Supervised Transfer Learning
Abulikemu Abuduweili
Xingjian Li
Humphrey Shi
Chengzhong Xu
Dejing Dou
47
77
0
03 Mar 2021
Touchless Palmprint Recognition based on 3D Gabor Template and Block
  Feature Refinement
Touchless Palmprint Recognition based on 3D Gabor Template and Block Feature Refinement
Zhaoqun Li
Xu Liang
Dandan Fan
Jinxing Li
Wei Jia
David Zhang
33
16
0
03 Mar 2021
PML: Progressive Margin Loss for Long-tailed Age Classification
PML: Progressive Margin Loss for Long-tailed Age Classification
Zongyong Deng
Hao Liu
Yaoxing Wang
Chenyang Wang
Zekuan Yu
Xuehong Sun
22
57
0
03 Mar 2021
Self-supervised Pretraining of Visual Features in the Wild
Self-supervised Pretraining of Visual Features in the Wild
Priya Goyal
Mathilde Caron
Benjamin Lefaudeux
Min Xu
Pengchao Wang
...
Mannat Singh
Vitaliy Liptchinsky
Ishan Misra
Armand Joulin
Piotr Bojanowski
VLM
SSL
27
272
0
02 Mar 2021
Fixing Data Augmentation to Improve Adversarial Robustness
Fixing Data Augmentation to Improve Adversarial Robustness
Sylvestre-Alvise Rebuffi
Sven Gowal
D. A. Calian
Florian Stimberg
Olivia Wiles
Timothy A. Mann
AAML
41
271
0
02 Mar 2021
Acceleration via Fractal Learning Rate Schedules
Acceleration via Fractal Learning Rate Schedules
Naman Agarwal
Surbhi Goel
Cyril Zhang
34
18
0
01 Mar 2021
A Multiclass Boosting Framework for Achieving Fast and Provable
  Adversarial Robustness
A Multiclass Boosting Framework for Achieving Fast and Provable Adversarial Robustness
Jacob D. Abernethy
Pranjal Awasthi
Satyen Kale
AAML
32
6
0
01 Mar 2021
Generative Chemical Transformer: Neural Machine Learning of Molecular
  Geometric Structures from Chemical Language via Attention
Generative Chemical Transformer: Neural Machine Learning of Molecular Geometric Structures from Chemical Language via Attention
Hyunseung Kim
Jonggeol Na
Won Bo Lee
22
46
0
27 Feb 2021
GRAD-MATCH: Gradient Matching based Data Subset Selection for Efficient
  Deep Model Training
GRAD-MATCH: Gradient Matching based Data Subset Selection for Efficient Deep Model Training
Krishnateja Killamsetty
D. Sivasubramanian
Ganesh Ramakrishnan
A. De
Rishabh K. Iyer
OOD
94
192
0
27 Feb 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
265
28,145
0
26 Feb 2021
On the Validity of Modeling SGD with Stochastic Differential Equations
  (SDEs)
On the Validity of Modeling SGD with Stochastic Differential Equations (SDEs)
Zhiyuan Li
Sadhika Malladi
Sanjeev Arora
60
78
0
24 Feb 2021
Learning to Generate Wasserstein Barycenters
Learning to Generate Wasserstein Barycenters
Julien Lacombe
Julie Digne
Nicolas Courty
Nicolas Bonneel
19
12
0
24 Feb 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
323
3,644
0
24 Feb 2021
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning
  of Deep Neural Networks
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks
Jungmin Kwon
Jeongseop Kim
Hyunseong Park
I. Choi
48
285
0
23 Feb 2021
UniT: Multimodal Multitask Learning with a Unified Transformer
UniT: Multimodal Multitask Learning with a Unified Transformer
Ronghang Hu
Amanpreet Singh
ViT
35
297
0
22 Feb 2021
A Novel Framework for Neural Architecture Search in the Hill Climbing
  Domain
A Novel Framework for Neural Architecture Search in the Hill Climbing Domain
Mudit Verma
Pradyumn Sinha
Karan Goyal
Apoorva Verma
Seba Susan
21
7
0
22 Feb 2021
Provable Super-Convergence with a Large Cyclical Learning Rate
Provable Super-Convergence with a Large Cyclical Learning Rate
Samet Oymak
43
12
0
22 Feb 2021
Learning Neural Network Subspaces
Learning Neural Network Subspaces
Mitchell Wortsman
Maxwell Horton
Carlos Guestrin
Ali Farhadi
Mohammad Rastegari
UQCV
32
85
0
20 Feb 2021
Kanerva++: extending The Kanerva Machine with differentiable, locally
  block allocated latent memory
Kanerva++: extending The Kanerva Machine with differentiable, locally block allocated latent memory
Jason Ramapuram
Yan Wu
Alexandros Kalousis
22
4
0
20 Feb 2021
Physical Reasoning Using Dynamics-Aware Models
Physical Reasoning Using Dynamics-Aware Models
Eltayeb Ahmed
A. Bakhtin
Laurens van der Maaten
Rohit Girdhar
LRM
23
3
0
20 Feb 2021
CReST: A Class-Rebalancing Self-Training Framework for Imbalanced
  Semi-Supervised Learning
CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning
Chen Wei
Kihyuk Sohn
Clayton Mellina
Alan Yuille
Fan Yang
CLL
45
258
0
18 Feb 2021
LambdaNetworks: Modeling Long-Range Interactions Without Attention
LambdaNetworks: Modeling Long-Range Interactions Without Attention
Irwan Bello
281
179
0
17 Feb 2021
Instance Localization for Self-supervised Detection Pretraining
Instance Localization for Self-supervised Detection Pretraining
Ceyuan Yang
Zhirong Wu
Bolei Zhou
Stephen Lin
ViT
SSL
105
147
0
16 Feb 2021
GradInit: Learning to Initialize Neural Networks for Stable and
  Efficient Training
GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training
Chen Zhu
Renkun Ni
Zheng Xu
Kezhi Kong
Wenjie Huang
Tom Goldstein
ODL
48
54
0
16 Feb 2021
Learning to Recognize Actions on Objects in Egocentric Video with
  Attention Dictionaries
Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
EgoV
35
15
0
16 Feb 2021
VA-RED$^2$: Video Adaptive Redundancy Reduction
VA-RED2^22: Video Adaptive Redundancy Reduction
Bowen Pan
Yikang Shen
Camilo Luciano Fosco
Chung-Ching Lin
A. Andonian
Yue Meng
Kate Saenko
A. Oliva
Rogerio Feris
20
19
0
15 Feb 2021
Online hyperparameter optimization by real-time recurrent learning
Online hyperparameter optimization by real-time recurrent learning
Daniel Jiwoong Im
Cristina Savin
Kyunghyun Cho
35
7
0
15 Feb 2021
GradPIM: A Practical Processing-in-DRAM Architecture for Gradient
  Descent
GradPIM: A Practical Processing-in-DRAM Architecture for Gradient Descent
Heesu Kim
Hanmin Park
Taehyun Kim
Kwanheum Cho
Eojin Lee
Soojung Ryu
Hyuk-Jae Lee
Kiyoung Choi
Jinho Lee
24
36
0
15 Feb 2021
Learning Self-Similarity in Space and Time as Generalized Motion for
  Video Action Recognition
Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition
Heeseung Kwon
Manjin Kim
Suha Kwak
Minsu Cho
TTA
27
39
0
14 Feb 2021
Understanding Negative Samples in Instance Discriminative
  Self-supervised Representation Learning
Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning
Kento Nozawa
Issei Sato
SSL
39
43
0
13 Feb 2021
Previous
123...697071...848586
Next