Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1608.03983
Cited By
SGDR: Stochastic Gradient Descent with Warm Restarts
13 August 2016
I. Loshchilov
Frank Hutter
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SGDR: Stochastic Gradient Descent with Warm Restarts"
50 / 4,296 papers shown
Title
Well-tuned Simple Nets Excel on Tabular Datasets
Arlind Kadra
Marius Lindauer
Frank Hutter
Josif Grabocka
23
186
0
21 Jun 2021
Better Training using Weight-Constrained Stochastic Dynamics
Benedict Leimkuhler
Tiffany J. Vlaar
Timothée Pouchon
Amos Storkey
31
9
0
20 Jun 2021
More than Encoder: Introducing Transformer Decoder to Upsample
Yijiang Li
Wentian Cai
Ying Gao
Chengming Li
Xiping Hu
ViT
MedIm
35
52
0
20 Jun 2021
Teacher's pet: understanding and mitigating biases in distillation
Michal Lukasik
Srinadh Bhojanapalli
A. Menon
Sanjiv Kumar
23
25
0
19 Jun 2021
Towards Single Stage Weakly Supervised Semantic Segmentation
Peri Akiva
Kristin J. Dana
56
8
0
18 Jun 2021
It's FLAN time! Summing feature-wise latent representations for interpretability
An-phi Nguyen
María Rodríguez Martínez
FAtt
18
0
0
18 Jun 2021
Being a Bit Frequentist Improves Bayesian Neural Networks
Agustinus Kristiadi
Matthias Hein
Philipp Hennig
BDL
UQCV
29
15
0
18 Jun 2021
Medical Matting: A New Perspective on Medical Segmentation with Uncertainty
Lin Wang
Lie Ju
Xin Wang
Wanji He
Donghao Zhang
...
Zhiwen Yang
Xuan-Liang Yao
Xin Zhao
Xiufen Ye
Z. Ge
37
3
0
18 Jun 2021
Pruning Randomly Initialized Neural Networks with Iterative Randomization
Daiki Chijiwa
Shin'ya Yamaguchi
Yasutoshi Ida
Kenji Umakoshi
T. Inoue
22
25
0
17 Jun 2021
Long-Short Temporal Contrastive Learning of Video Transformers
Jue Wang
Gedas Bertasius
Du Tran
Lorenzo Torresani
VLM
ViT
52
50
0
17 Jun 2021
Amortized Auto-Tuning: Cost-Efficient Bayesian Transfer Optimization for Hyperparameter Recommendation
Yuxin Xiao
Eric P. Xing
Willie Neiswanger
40
5
0
17 Jun 2021
To Raise or Not To Raise: The Autonomous Learning Rate Question
Xiaomeng Dong
Tao Tan
Michael Potter
Yun-Chan Tsai
Gaurav Kumar
V. R. Saripalli
Theodore Trafalis
OOD
13
2
0
16 Jun 2021
EdgeConv with Attention Module for Monocular Depth Estimation
Minhyeok Lee
Sangwon Hwang
Chaewon Park
Sangyoun Lee
MDE
3DPC
18
17
0
16 Jun 2021
FastAno: Fast Anomaly Detection via Spatio-temporal Patch Transformation
Chaewon Park
Myeongah Cho
Minhyeok Lee
Sangyoun Lee
29
31
0
16 Jun 2021
Domain Consistency Regularization for Unsupervised Multi-source Domain Adaptive Classification
Zhipeng Luo
Xiaobing Zhang
Shijian Lu
Shuai Yi
69
21
0
16 Jun 2021
Delving Deep into the Generalization of Vision Transformers under Distribution Shifts
Chongzhi Zhang
Mingyuan Zhang
Shanghang Zhang
Daisheng Jin
Qiang-feng Zhou
Zhongang Cai
Haiyu Zhao
Xianglong Liu
Ziwei Liu
31
103
0
14 Jun 2021
Revisiting consistency for semi-supervised semantic segmentation
Ivan Grubišić
Marin Orsic
Sinisa Segvic
19
4
0
13 Jun 2021
Wide Mean-Field Variational Bayesian Neural Networks Ignore the Data
Beau Coker
Weiwei Pan
Finale Doshi-Velez
BDL
35
9
0
13 Jun 2021
A Free Lunch From ANN: Towards Efficient, Accurate Spiking Neural Networks Calibration
Yuhang Li
Shi-Wee Deng
Xin Dong
Ruihao Gong
Shi Gu
30
187
0
13 Jun 2021
Video Super-Resolution Transformer
Jie Cao
Yawei Li
Peng Sun
Luc Van Gool
ViT
44
168
0
12 Jun 2021
Adversarial Robustness via Fisher-Rao Regularization
Marine Picot
Francisco Messina
Malik Boudiaf
Fabrice Labeau
Ismail Ben Ayed
Pablo Piantanida
AAML
31
23
0
12 Jun 2021
Hybrid Generative-Contrastive Representation Learning
Saehoon Kim
Sungwoong Kim
Juho Lee
SSL
22
11
0
11 Jun 2021
Overcoming Difficulty in Obtaining Dark-skinned Subjects for Remote-PPG by Synthetic Augmentation
Yunhao Ba
Zhen Wang
Kerim Doruk Karinca
Oyku Deniz Bozkurt
A. Kadambi
38
10
0
10 Jun 2021
Space-time Mixing Attention for Video Transformer
Adrian Bulat
Juan-Manuel Perez-Rua
Swathikiran Sudhakaran
Brais Martínez
Georgios Tzimiropoulos
ViT
41
125
0
10 Jun 2021
MST: Masked Self-Supervised Transformer for Visual Representation
Zhaowen Li
Zhiyang Chen
Fan Yang
Wei Li
Yousong Zhu
...
Rui Deng
Liwei Wu
Rui Zhao
Ming Tang
Jinqiao Wang
ViT
52
163
0
10 Jun 2021
DUET: Detection Utilizing Enhancement for Text in Scanned or Captured Documents
Eun-Soo Jung
HyeongGwan Son
Kyusam Oh
Yongkeun Yun
Soonhwan Kwon
Min Soo Kim
54
4
0
10 Jun 2021
Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows
Brendan Leigh Ross
Jesse C. Cresswell
TPM
47
32
0
09 Jun 2021
Knowledge distillation: A good teacher is patient and consistent
Lucas Beyer
Xiaohua Zhai
Amelie Royer
L. Markeeva
Rohan Anil
Alexander Kolesnikov
VLM
52
288
0
09 Jun 2021
Geometry-Consistent Neural Shape Representation with Implicit Displacement Fields
Yifan Wang
Lukas Rahmann
O. Sorkine-Hornung
27
65
0
09 Jun 2021
Broadcasted Residual Learning for Efficient Keyword Spotting
Byeonggeun Kim
Simyung Chang
Jinkyu Lee
Dooyong Sung
37
122
0
08 Jun 2021
Multi-dataset Pretraining: A Unified Model for Semantic Segmentation
Bowen Shi
Xiaopeng Zhang
Haohang Xu
Wenrui Dai
Junni Zou
H. Xiong
Qi Tian
19
7
0
08 Jun 2021
Mixture Outlier Exposure: Towards Out-of-Distribution Detection in Fine-grained Environments
Jingyang Zhang
Nathan Inkawhich
Randolph Linderman
Yiran Chen
H. Li
OODD
27
53
0
07 Jun 2021
Incremental False Negative Detection for Contrastive Learning
Tsai-Shien Chen
Wei-Chih Hung
Hung-Yu Tseng
Shao-Yi Chien
Ming-Hsuan Yang
SSL
CLL
18
61
0
07 Jun 2021
Channel DropBlock: An Improved Regularization Method for Fine-Grained Visual Classification
Yifeng Ding
Shuwei Dong
Yujun Tong
Zhanyu Ma
Bo Xiao
Haibin Ling
35
7
0
07 Jun 2021
Vision Transformers with Hierarchical Attention
Yun-Hai Liu
Yu-Huan Wu
Guolei Sun
Le Zhang
Ajad Chhatkuli
Luc Van Gool
ViT
48
34
0
06 Jun 2021
Bandwidth-based Step-Sizes for Non-Convex Stochastic Optimization
Xiaoyu Wang
M. Johansson
33
2
0
05 Jun 2021
RegionViT: Regional-to-Local Attention for Vision Transformers
Chun-Fu Chen
Yikang Shen
Quanfu Fan
ViT
59
195
0
04 Jun 2021
Aligning Pretraining for Detection via Object-Level Contrastive Learning
Fangyun Wei
Yue Gao
Zhirong Wu
Han Hu
Stephen Lin
ObjD
32
144
0
04 Jun 2021
COLD: Concurrent Loads Disaggregator for Non-Intrusive Load Monitoring
I. Kamyshev
Dmitrii Kriukov
E. Gryazina
Elena Gryazina
Henni Ouerdane
29
8
0
04 Jun 2021
Convergent Graph Solvers
Junyoung Park
J. Choo
Jinkyoo Park
39
13
0
03 Jun 2021
CT-Net: Channel Tensorization Network for Video Classification
Kunchang Li
Xianhang Li
Yali Wang
Jun Wang
Yu Qiao
ViT
35
55
0
03 Jun 2021
SemiFL: Semi-Supervised Federated Learning for Unlabeled Clients with Alternate Training
Enmao Diao
Jie Ding
Vahid Tarokh
FedML
30
74
0
02 Jun 2021
A Generalizable Approach to Learning Optimizers
Diogo Almeida
Clemens Winter
Jie Tang
Wojciech Zaremba
AI4CE
25
29
0
02 Jun 2021
Adversarially Adaptive Normalization for Single Domain Generalization
Xinjie Fan
Qifei Wang
Junjie Ke
Feng Yang
Boqing Gong
Mingyuan Zhou
37
130
0
01 Jun 2021
Towards Light-weight and Real-time Line Segment Detection
Geonmo Gu
ByungSoo Ko
SeoungHyun Go
Sung-Hyun Lee
Jingeun Lee
Minchul Shin
3DGS
13
60
0
01 Jun 2021
Effect of Pre-Training Scale on Intra- and Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images
Mehdi Cherti
J. Jitsev
LM&MA
40
23
0
31 May 2021
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens
Jiemin Fang
Lingxi Xie
Xinggang Wang
Xiaopeng Zhang
Wenyu Liu
Qi Tian
ViT
23
74
0
31 May 2021
VidFace: A Full-Transformer Solver for Video FaceHallucination with Unaligned Tiny Snapshots
Y. Gan
Yawei Luo
Xin Yu
Bang Zhang
Yi Yang
ViT
CVBM
37
3
0
31 May 2021
Informing Geometric Deep Learning with Electronic Interactions to Accelerate Quantum Chemistry
Zhuoran Qiao
Anders S. Christensen
Matthew Welborn
F. Manby
Anima Anandkumar
Thomas F. Miller
49
74
0
31 May 2021
Universal Adder Neural Networks
Hanting Chen
Yunhe Wang
Chang Xu
Chao Xu
Chunjing Xu
Tong Zhang
37
3
0
29 May 2021
Previous
1
2
3
...
66
67
68
...
84
85
86
Next