ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.03983
  4. Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts

SGDR: Stochastic Gradient Descent with Warm Restarts

13 August 2016
I. Loshchilov
Frank Hutter
    ODL
ArXivPDFHTML

Papers citing "SGDR: Stochastic Gradient Descent with Warm Restarts"

50 / 4,280 papers shown
Title
Self-Supervised MultiModal Versatile Networks
Self-Supervised MultiModal Versatile Networks
Jean-Baptiste Alayrac
Adrià Recasens
R. Schneider
Relja Arandjelović
Jason Ramapuram
J. Fauw
Lucas Smaira
Sander Dieleman
Andrew Zisserman
SSL
40
372
0
29 Jun 2020
Gradient-only line searches to automatically determine learning rates
  for a variety of stochastic training algorithms
Gradient-only line searches to automatically determine learning rates for a variety of stochastic training algorithms
D. Kafka
D. Wilke
ODL
27
0
0
29 Jun 2020
Video Representation Learning with Visual Tempo Consistency
Video Representation Learning with Visual Tempo Consistency
Ceyuan Yang
Yinghao Xu
Bo Dai
Bolei Zhou
13
89
0
28 Jun 2020
GPT-GNN: Generative Pre-Training of Graph Neural Networks
GPT-GNN: Generative Pre-Training of Graph Neural Networks
Ziniu Hu
Yuxiao Dong
Kuansan Wang
Kai-Wei Chang
Yizhou Sun
SSL
AI4CE
18
549
0
27 Jun 2020
Traditional and accelerated gradient descent for neural architecture
  search
Traditional and accelerated gradient descent for neural architecture search
Nicolas García Trillos
Félix Morales
Javier Morales
6
3
0
26 Jun 2020
Region-of-interest guided Supervoxel Inpainting for Self-supervision
Region-of-interest guided Supervoxel Inpainting for Self-supervision
Subhradeep Kayal
Shuai Chen
Marleen de Bruijne
SSL
6
9
0
26 Jun 2020
Supermasks in Superposition
Supermasks in Superposition
Mitchell Wortsman
Vivek Ramanujan
Rosanne Liu
Aniruddha Kembhavi
Mohammad Rastegari
J. Yosinski
Ali Farhadi
SSL
CLL
33
281
0
26 Jun 2020
Auto-PyTorch Tabular: Multi-Fidelity MetaLearning for Efficient and
  Robust AutoDL
Auto-PyTorch Tabular: Multi-Fidelity MetaLearning for Efficient and Robust AutoDL
Lucas Zimmer
Marius Lindauer
Frank Hutter
MU
14
90
0
24 Jun 2020
Single-Shot 3D Detection of Vehicles from Monocular RGB Images via
  Geometry Constrained Keypoints in Real-Time
Single-Shot 3D Detection of Vehicles from Monocular RGB Images via Geometry Constrained Keypoints in Real-Time
Nils Gählert
Jun-Jun Wan
Nicolas Jourdan
Jan Finkbeiner
Uwe Franke
Joachim Denzler
3DPC
24
18
0
23 Jun 2020
Telescoping Density-Ratio Estimation
Telescoping Density-Ratio Estimation
Benjamin Rhodes
Kai Xu
Michael U. Gutmann
30
94
0
22 Jun 2020
Feature Alignment and Restoration for Domain Generalization and
  Adaptation
Feature Alignment and Restoration for Domain Generalization and Adaptation
Xin Jin
Cuiling Lan
Wenjun Zeng
Zhibo Chen
OOD
40
39
0
22 Jun 2020
Sequential Feature Filtering Classifier
Sequential Feature Filtering Classifier
Min-seok Seo
Jaemin Lee
Jongchan Park
Dong-Geol Choi
34
3
0
21 Jun 2020
FNA++: Fast Network Adaptation via Parameter Remapping and Architecture
  Search
FNA++: Fast Network Adaptation via Parameter Remapping and Architecture Search
Jiemin Fang
Yuzhu Sun
Qian Zhang
Kangjian Peng
Yuan Li
Wenyu Liu
Xinggang Wang
SSeg
17
34
0
21 Jun 2020
Collective Learning by Ensembles of Altruistic Diversifying Neural
  Networks
Collective Learning by Ensembles of Altruistic Diversifying Neural Networks
Benjamin Brazowski
E. Schneidman
FedML
25
4
0
20 Jun 2020
Pyramidal Convolution: Rethinking Convolutional Neural Networks for
  Visual Recognition
Pyramidal Convolution: Rethinking Convolutional Neural Networks for Visual Recognition
Ionut Cosmin Duta
Li Liu
Fan Zhu
Ling Shao
40
195
0
20 Jun 2020
Paying more attention to snapshots of Iterative Pruning: Improving Model
  Compression via Ensemble Distillation
Paying more attention to snapshots of Iterative Pruning: Improving Model Compression via Ensemble Distillation
Duong H. Le
Vo Trung Nhan
N. Thoai
VLM
33
7
0
20 Jun 2020
Supervision Accelerates Pre-training in Contrastive Semi-Supervised
  Learning of Visual Representations
Supervision Accelerates Pre-training in Contrastive Semi-Supervised Learning of Visual Representations
Mahmoud Assran
Nicolas Ballas
Lluis Castrejon
Michael G. Rabbat
SSL
8
3
0
18 Jun 2020
Cyclic Differentiable Architecture Search
Cyclic Differentiable Architecture Search
Hongyuan Yu
Houwen Peng
Yan Huang
Jianlong Fu
Hao Du
Liang Wang
Haibin Ling
3DPC
27
48
0
18 Jun 2020
MMCGAN: Generative Adversarial Network with Explicit Manifold Prior
MMCGAN: Generative Adversarial Network with Explicit Manifold Prior
Guanhua Zheng
Jitao Sang
Changsheng Xu
GAN
14
1
0
18 Jun 2020
Unsupervised Learning of Visual Features by Contrasting Cluster
  Assignments
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments
Mathilde Caron
Ishan Misra
Julien Mairal
Priya Goyal
Piotr Bojanowski
Armand Joulin
OCL
SSL
48
4,019
0
17 Jun 2020
Fine-Tuning DARTS for Image Classification
Fine-Tuning DARTS for Image Classification
M. Tanveer
Muhammad Umar Karim Khan
C. Kyung
31
49
0
16 Jun 2020
Ordering Dimensions with Nested Dropout Normalizing Flows
Ordering Dimensions with Nested Dropout Normalizing Flows
Artur Bekasov
Iain Murray
DRL
28
5
0
15 Jun 2020
Multiscale Deep Equilibrium Models
Multiscale Deep Equilibrium Models
Shaojie Bai
V. Koltun
J. Zico Kolter
BDL
40
211
0
15 Jun 2020
Neural Ensemble Search for Uncertainty Estimation and Dataset Shift
Neural Ensemble Search for Uncertainty Estimation and Dataset Shift
Sheheryar Zaidi
Arber Zela
T. Elsken
Chris Holmes
Frank Hutter
Yee Whye Teh
OOD
UQCV
18
71
0
15 Jun 2020
The Limit of the Batch Size
The Limit of the Batch Size
Yang You
Yuhui Wang
Huan Zhang
Zhao-jie Zhang
J. Demmel
Cho-Jui Hsieh
16
15
0
15 Jun 2020
Improving Adversarial Robustness via Unlabeled Out-of-Domain Data
Improving Adversarial Robustness via Unlabeled Out-of-Domain Data
Zhun Deng
Linjun Zhang
Amirata Ghorbani
James Zou
36
32
0
15 Jun 2020
AdamP: Slowing Down the Slowdown for Momentum Optimizers on
  Scale-invariant Weights
AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights
Byeongho Heo
Sanghyuk Chun
Seong Joon Oh
Dongyoon Han
Sangdoo Yun
Gyuwan Kim
Youngjung Uh
Jung-Woo Ha
ODL
290
27
0
15 Jun 2020
Optimal Lottery Tickets via SubsetSum: Logarithmic Over-Parameterization
  is Sufficient
Optimal Lottery Tickets via SubsetSum: Logarithmic Over-Parameterization is Sufficient
Ankit Pensia
Shashank Rajput
Alliot Nagle
Harit Vishwakarma
Dimitris Papailiopoulos
24
103
0
14 Jun 2020
Meta Approach to Data Augmentation Optimization
Meta Approach to Data Augmentation Optimization
Ryuichiro Hataya
Jan Zdenek
Kazuki Yoshizoe
Hideki Nakayama
32
34
0
14 Jun 2020
Explicitly Modeled Attention Maps for Image Classification
Explicitly Modeled Attention Maps for Image Classification
Andong Tan
D. Nguyen
Maximilian Dax
Matthias Nießner
Thomas Brox
35
8
0
14 Jun 2020
Bootstrap your own latent: A new approach to self-supervised Learning
Bootstrap your own latent: A new approach to self-supervised Learning
Jean-Bastien Grill
Florian Strub
Florent Altché
Corentin Tallec
Pierre Harvey Richemond
...
M. G. Azar
Bilal Piot
Koray Kavukcuoglu
Rémi Munos
Michal Valko
SSL
169
6,671
0
13 Jun 2020
Adversarial Self-Supervised Contrastive Learning
Adversarial Self-Supervised Contrastive Learning
Minseon Kim
Jihoon Tack
Sung Ju Hwang
SSL
28
247
0
13 Jun 2020
Rethinking Pre-training and Self-training
Rethinking Pre-training and Self-training
Barret Zoph
Golnaz Ghiasi
Nayeon Lee
Huayu Chen
Hanxiao Liu
E. D. Cubuk
Quoc V. Le
SSeg
48
646
0
11 Jun 2020
Adaptive Gradient Methods Converge Faster with Over-Parameterization
  (but you should do a line-search)
Adaptive Gradient Methods Converge Faster with Over-Parameterization (but you should do a line-search)
Sharan Vaswani
I. Laradji
Frederik Kunstner
S. Meng
Mark Schmidt
Simon Lacoste-Julien
27
27
0
11 Jun 2020
Automated Identification of Thoracic Pathology from Chest Radiographs
  with Enhanced Training Pipeline
Automated Identification of Thoracic Pathology from Chest Radiographs with Enhanced Training Pipeline
Adora M. DSouza
A. Abidin
A. Wismüller
6
11
0
11 Jun 2020
VirTex: Learning Visual Representations from Textual Annotations
VirTex: Learning Visual Representations from Textual Annotations
Karan Desai
Justin Johnson
SSL
VLM
30
433
0
11 Jun 2020
AdaS: Adaptive Scheduling of Stochastic Gradients
AdaS: Adaptive Scheduling of Stochastic Gradients
Mahdi S. Hosseini
Konstantinos N. Plataniotis
ODL
39
12
0
11 Jun 2020
Deep learning reconstruction of digital breast tomosynthesis images for
  accurate breast density and patient-specific radiation dose estimation
Deep learning reconstruction of digital breast tomosynthesis images for accurate breast density and patient-specific radiation dose estimation
Jonas Teuwen
N. Moriakov
C. Fedon
M. Caballo
I. Reiser
Pedrag Bakic
E. García
Oliver Díaz
K. Michielsen
I. Sechopoulos
18
27
0
11 Jun 2020
DcardNet: Diabetic Retinopathy Classification at Multiple Levels Based
  on Structural and Angiographic Optical Coherence Tomography
DcardNet: Diabetic Retinopathy Classification at Multiple Levels Based on Structural and Angiographic Optical Coherence Tomography
P. Zang
Liqin Gao
T. Hormel
Jie Wang
Q. You
T. Hwang
Yali Jia
19
50
0
09 Jun 2020
Parameter-Efficient Person Re-identification in the 3D Space
Parameter-Efficient Person Re-identification in the 3D Space
Zhedong Zheng
Nenggan Zheng
Yi Yang
3DPC
28
62
0
08 Jun 2020
Multi-step Estimation for Gradient-based Meta-learning
Multi-step Estimation for Gradient-based Meta-learning
Jin-Hwa Kim
Junyoung Park
Yongseok Choi
22
1
0
08 Jun 2020
Scaling Equilibrium Propagation to Deep ConvNets by Drastically Reducing
  its Gradient Estimator Bias
Scaling Equilibrium Propagation to Deep ConvNets by Drastically Reducing its Gradient Estimator Bias
Axel Laborieux
M. Ernoult
B. Scellier
Yoshua Bengio
Julie Grollier
D. Querlioz
16
69
0
06 Jun 2020
AutoHAS: Efficient Hyperparameter and Architecture Search
AutoHAS: Efficient Hyperparameter and Architecture Search
Xuanyi Dong
Mingxing Tan
Adams Wei Yu
Daiyi Peng
Bogdan Gabrys
Quoc V. Le
TPM
27
23
0
05 Jun 2020
End-to-End Adversarial Text-to-Speech
End-to-End Adversarial Text-to-Speech
Jeff Donahue
Sander Dieleman
Mikolaj Binkowski
Erich Elsen
Karen Simonyan
22
185
0
05 Jun 2020
Learning to Rank Learning Curves
Learning to Rank Learning Curves
Martin Wistuba
Tejaswini Pedapati
14
24
0
05 Jun 2020
Bayesian Neural Network via Stochastic Gradient Descent
Abhinav Sagar
UQCV
BDL
18
2
0
04 Jun 2020
Exploring the Potential of Low-bit Training of Convolutional Neural
  Networks
Exploring the Potential of Low-bit Training of Convolutional Neural Networks
Kai Zhong
Xuefei Ning
Guohao Dai
Zhenhua Zhu
Tianchen Zhao
Shulin Zeng
Yu Wang
Huazhong Yang
MQ
25
9
0
04 Jun 2020
Weight Pruning via Adaptive Sparsity Loss
Weight Pruning via Adaptive Sparsity Loss
George Retsinas
Athena Elafrou
G. Goumas
Petros Maragos
45
10
0
04 Jun 2020
Deep Learning Methods for Real-time Detection and Analysis of Wagner
  Ulcer Classification System
Deep Learning Methods for Real-time Detection and Analysis of Wagner Ulcer Classification System
Aifu Han
Yongze Zhang
Ajuan Li
Changjin Li
Fengying Zhao
...
Qin Liu
Yanting Liu
Ximei Shen
Sunjie Yan
Shengzong Zhou
6
1
0
03 Jun 2020
Consistent Estimators for Learning to Defer to an Expert
Consistent Estimators for Learning to Defer to an Expert
Hussein Mozannar
David Sontag
15
198
0
02 Jun 2020
Previous
123...757677...848586
Next