Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1603.09382
Cited By
Deep Networks with Stochastic Depth
30 March 2016
Gao Huang
Yu Sun
Zhuang Liu
Daniel Sedra
Kilian Q. Weinberger
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Networks with Stochastic Depth"
50 / 477 papers shown
Title
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
280
2,606
0
04 May 2021
Single-Training Collaborative Object Detectors Adaptive to Bandwidth and Computation
Juliano S. Assine
José Cândido Silveira Santos Filho
Eduardo Valle
ObjD
47
8
0
03 May 2021
Vision Transformers with Patch Diversification
Chengyue Gong
Dilin Wang
Meng Li
Vikas Chandra
Qiang Liu
ViT
45
62
0
26 Apr 2021
Visformer: The Vision-friendly Transformer
Zhengsu Chen
Lingxi Xie
Jianwei Niu
Xuefeng Liu
Longhui Wei
Qi Tian
ViT
120
209
0
26 Apr 2021
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
63
1,224
0
22 Apr 2021
All Tokens Matter: Token Labeling for Training Better Vision Transformers
Zihang Jiang
Qibin Hou
Li-xin Yuan
Daquan Zhou
Yujun Shi
Xiaojie Jin
Anran Wang
Jiashi Feng
ViT
25
203
0
22 Apr 2021
UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
Zhen Wu
Lijun Wu
Qi Meng
Yingce Xia
Shufang Xie
Tao Qin
Xinyu Dai
Tie-Yan Liu
18
22
0
11 Apr 2021
Towards Self-Adaptive Metric Learning On the Fly
Y. Gao
Yifan Li
Swarup Chandra
Latifur Khan
B. Thuraisingham
19
20
0
03 Apr 2021
Going deeper with Image Transformers
Hugo Touvron
Matthieu Cord
Alexandre Sablayrolles
Gabriel Synnaeve
Hervé Jégou
ViT
27
986
0
31 Mar 2021
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
30
2,088
0
29 Mar 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng-Wei Zhang
Stephen Lin
B. Guo
ViT
145
20,710
0
25 Mar 2021
Scaling Local Self-Attention for Parameter Efficient Visual Backbones
Ashish Vaswani
Prajit Ramachandran
A. Srinivas
Niki Parmar
Blake A. Hechtman
Jonathon Shlens
27
395
0
23 Mar 2021
3D Human Pose Estimation with Spatial and Temporal Transformers
Ce Zheng
Sijie Zhu
Matías Mendieta
Taojiannan Yang
Chong Chen
Zhengming Ding
ViT
47
438
0
18 Mar 2021
Revisiting ResNets: Improved Training and Scaling Strategies
Irwan Bello
W. Fedus
Xianzhi Du
E. D. Cubuk
A. Srinivas
Nayeon Lee
Jonathon Shlens
Barret Zoph
29
298
0
13 Mar 2021
Experiments with Rich Regime Training for Deep Learning
Xinyan Li
A. Banerjee
32
2
0
26 Feb 2021
CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning
Chen Wei
Kihyuk Sohn
Clayton Mellina
Alan Yuille
Fan Yang
CLL
40
256
0
18 Feb 2021
LambdaNetworks: Modeling Long-Range Interactions Without Attention
Irwan Bello
281
179
0
17 Feb 2021
High-Performance Large-Scale Image Recognition Without Normalization
Andrew Brock
Soham De
Samuel L. Smith
Karen Simonyan
VLM
223
512
0
11 Feb 2021
Self-supervised driven consistency training for annotation efficient histopathology image analysis
C. Srinidhi
Seung Wook Kim
Fu-Der Chen
Anne L. Martel
SSL
21
109
0
07 Feb 2021
Re-labeling ImageNet: from Single to Multi-Labels, from Global to Localized Labels
Sangdoo Yun
Seong Joon Oh
Byeongho Heo
Dongyoon Han
Junsuk Choe
Sanghyuk Chun
414
142
0
13 Jan 2021
AutoDropout: Learning Dropout Patterns to Regularize Deep Networks
Hieu H. Pham
Quoc V. Le
76
56
0
05 Jan 2021
An Efficient Transformer Decoder with Compressed Sub-layers
Yanyang Li
Ye Lin
Tong Xiao
Jingbo Zhu
33
29
0
03 Jan 2021
Learning Light-Weight Translation Models from Deep Transformer
Bei Li
Ziyang Wang
Hui Liu
Quan Du
Tong Xiao
Chunliang Zhang
Jingbo Zhu
VLM
120
40
0
27 Dec 2020
Scaling Wide Residual Networks for Panoptic Segmentation
Liang-Chieh Chen
Huiyu Wang
Siyuan Qiao
SSeg
27
47
0
23 Nov 2020
FP-NAS: Fast Probabilistic Neural Architecture Search
Zhicheng Yan
Xiaoliang Dai
Peizhao Zhang
Yuandong Tian
Bichen Wu
Matt Feiszli
19
23
0
22 Nov 2020
Learning Loss for Test-Time Augmentation
Ildoo Kim
Younghoon Kim
Sungwoong Kim
OOD
23
90
0
22 Oct 2020
Combining Ensembles and Data Augmentation can Harm your Calibration
Yeming Wen
Ghassen Jerfel
Rafael Muller
Michael W. Dusenberry
Jasper Snoek
Balaji Lakshminarayanan
Dustin Tran
UQCV
32
63
0
19 Oct 2020
Review and Comparison of Commonly Used Activation Functions for Deep Neural Networks
Tomasz Szandała
59
277
0
15 Oct 2020
Sharpness-Aware Minimization for Efficiently Improving Generalization
Pierre Foret
Ariel Kleiner
H. Mobahi
Behnam Neyshabur
AAML
113
1,278
0
03 Oct 2020
Densely Guided Knowledge Distillation using Multiple Teacher Assistants
Wonchul Son
Jaemin Na
Junyong Choi
Wonjun Hwang
25
111
0
18 Sep 2020
S3NAS: Fast NPU-aware Neural Architecture Search Methodology
Jaeseong Lee
Duseok Kang
S. Ha
35
10
0
04 Sep 2020
Wasserstein Routed Capsule Networks
Alexander Fuchs
Franz Pernkopf
32
7
0
22 Jul 2020
Diverse Ensembles Improve Calibration
Asa Cooper Stickland
Iain Murray
UQCV
FedML
27
26
0
08 Jul 2020
Enabling On-Device CNN Training by Self-Supervised Instance Filtering and Error Map Pruning
Yawen Wu
Zhepeng Wang
Yiyu Shi
Jiaxi Hu
16
44
0
07 Jul 2020
Surrogate-assisted Particle Swarm Optimisation for Evolving Variable-length Transferable Blocks for Image Classification
Bin Wang
Bing Xue
Mengjie Zhang
23
53
0
03 Jul 2020
Multiple Expert Brainstorming for Domain Adaptive Person Re-identification
Yunpeng Zhai
QiXiang Ye
Shijian Lu
Mengxi Jia
Rongrong Ji
Yonghong Tian
13
163
0
03 Jul 2020
STEER: Simple Temporal Regularization For Neural ODEs
Arna Ghosh
Harkirat Singh Behl
Emilien Dupont
Philip Torr
Vinay P. Namboodiri
BDL
AI4TS
27
74
0
18 Jun 2020
Depth Uncertainty in Neural Networks
Javier Antorán
J. Allingham
José Miguel Hernández-Lobato
UQCV
OOD
BDL
41
100
0
15 Jun 2020
Visual Transformers: Token-based Image Representation and Processing for Computer Vision
Bichen Wu
Chenfeng Xu
Xiaoliang Dai
Alvin Wan
Peizhao Zhang
Zhicheng Yan
Masayoshi Tomizuka
Joseph E. Gonzalez
Kurt Keutzer
Peter Vajda
ViT
39
546
0
05 Jun 2020
A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions
Pengzhen Ren
Yun Xiao
Xiaojun Chang
Po-Yao (Bernie) Huang
Zhihui Li
Xiaojiang Chen
Xin Wang
AI4CE
48
653
0
01 Jun 2020
HourNAS: Extremely Fast Neural Architecture Search Through an Hourglass Lens
Zhaohui Yang
Yunhe Wang
Xinghao Chen
Jianyuan Guo
Wei Zhang
Chao Xu
Chunjing Xu
Dacheng Tao
Chang Xu
39
17
0
29 May 2020
Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation
Liang-Chieh Chen
Raphael Gontijo-Lopes
Bowen Cheng
Maxwell D. Collins
E. D. Cubuk
Barret Zoph
Hartwig Adam
Jonathon Shlens
28
76
0
20 May 2020
A Simple Semi-Supervised Learning Framework for Object Detection
Kihyuk Sohn
Zizhao Zhang
Chun-Liang Li
Han Zhang
Chen-Yu Lee
Tomas Pfister
38
493
0
10 May 2020
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
73
1,001
0
09 Apr 2020
A Survey of Convolutional Neural Networks: Analysis, Applications, and Prospects
Zewen Li
Wenjie Yang
Shouheng Peng
Fan Liu
HAI
3DV
54
2,600
0
01 Apr 2020
Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives
Duo Li
Qifeng Chen
153
19
0
24 Mar 2020
SASL: Saliency-Adaptive Sparsity Learning for Neural Network Acceleration
Jun Shi
Jianfeng Xu
K. Tasaka
Zhibo Chen
6
25
0
12 Mar 2020
An Empirical Evaluation on Robustness and Uncertainty of Regularization Methods
Sanghyuk Chun
Seong Joon Oh
Sangdoo Yun
Dongyoon Han
Junsuk Choe
Y. Yoo
AAML
OOD
345
53
0
09 Mar 2020
Super Resolution Using Segmentation-Prior Self-Attention Generative Adversarial Network
Yuxin Zhang
Zuquan Zheng
Roland Hu
21
10
0
07 Mar 2020
Learning When and Where to Zoom with Deep Reinforcement Learning
Burak Uzkent
Stefano Ermon
27
66
0
01 Mar 2020
Previous
1
2
3
...
10
6
7
8
9
Next