Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1603.09382
Cited By
Deep Networks with Stochastic Depth
30 March 2016
Gao Huang
Yu Sun
Zhuang Liu
Daniel Sedra
Kilian Q. Weinberger
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Networks with Stochastic Depth"
50 / 432 papers shown
Title
Deep residual learning with product units
Ziyuan Li
U. Jaekel
B. Dellen
57
0
0
07 May 2025
Transferable Adversarial Attacks on Black-Box Vision-Language Models
Kai Hu
Weichen Yu
L. Zhang
Alexander Robey
Andy Zou
Chengming Xu
Haoqi Hu
Matt Fredrikson
AAML
VLM
64
0
0
02 May 2025
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjD
VOS
103
0
0
17 Apr 2025
SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition
Zeqi Zheng
Yanchen Huang
Yingchao Yu
Zizheng Zhu
Junfeng Tang
Zhaofei Yu
Yaochu Jin
39
0
0
20 Mar 2025
Self-Supervised Pretraining for Fine-Grained Plankton Recognition
Joona Kareinen
T. Eerola
K. Kraft
L. Lensu
S. Suikkanen
Heikki Kälviäinen
SSL
174
0
0
14 Mar 2025
Enhancing Layer Attention Efficiency through Pruning Redundant Retrievals
Hanze Li
Xiande Huang
46
0
0
09 Mar 2025
USP: Unified Self-Supervised Pretraining for Image Generation and Understanding
Xiangxiang Chu
Renda Li
Yong Wang
62
0
0
08 Mar 2025
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
Meng Lou
Yizhou Yu
115
1
0
27 Feb 2025
Vision-LSTM: xLSTM as Generic Vision Backbone
Benedikt Alkin
M. Beck
Korbinian Poppel
Sepp Hochreiter
Johannes Brandstetter
VLM
61
43
0
24 Feb 2025
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
Benedikt Alkin
Lukas Miklautz
Sepp Hochreiter
Johannes Brandstetter
VLM
71
8
0
24 Feb 2025
CR-CTC: Consistency regularization on CTC for improved speech recognition
Zengwei Yao
Wei Kang
Xiaoyu Yang
Fangjun Kuang
Liyong Guo
Han Zhu
Zengrui Jin
Zhaoqing Li
Long Lin
Daniel Povey
53
0
0
17 Feb 2025
EmoNeXt: an Adapted ConvNeXt for Facial Emotion Recognition
Yassine El Boudouri
Amine Bohi
73
15
0
14 Jan 2025
Combining Priors with Experience: Confidence Calibration Based on Binomial Process Modeling
Jinzong Dong
Zhaohui Jiang
Dong Pan
Haoyang Yu
58
0
0
14 Dec 2024
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
45
1
0
12 Nov 2024
MoH: Multi-Head Attention as Mixture-of-Head Attention
Peng Jin
Bo Zhu
Li Yuan
Shuicheng Yan
MoE
31
13
0
15 Oct 2024
IceCloudNet: 3D reconstruction of cloud ice from Meteosat SEVIRI
K. Jeggle
Mikolaj Czerkawski
F. Serva
B. L. Saux
D. Neubauer
Ulrike Lohmann
25
1
0
05 Oct 2024
Streaming Neural Images
Marcos V. Conde
Andy Bigos
Radu Timofte
33
0
0
25 Sep 2024
MILAN: Milli-Annotations for Lidar Semantic Segmentation
Nermin Samet
Gilles Puy
Oriane Siméoni
Renaud Marlet
3DPC
32
0
0
22 Jul 2024
SS-SfP:Neural Inverse Rendering for Self Supervised Shape from (Mixed) Polarization
Ashish Tiwari
Shanmuganathan Raman
MDE
31
1
0
12 Jul 2024
DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting
Yuxuan Shu
Vasileios Lampos
AI4TS
AI4CE
70
0
0
11 Jun 2024
Vision Transformer with Sparse Scan Prior
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
ViT
48
5
0
22 May 2024
Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
Guoping Xu
Xiaxia Wang
Xinglong Wu
Xuesong Leng
Yongchao Xu
3DPC
36
8
0
02 May 2024
SpiralMLP: A Lightweight Vision MLP Architecture
Haojie Mu
Burhan Ul Tayyab
Nicholas Chua
43
0
0
31 Mar 2024
Efficient Modulation for Vision Networks
Xu Ma
Xiyang Dai
Jianwei Yang
Bin Xiao
Yinpeng Chen
Yun Fu
Lu Yuan
43
17
0
29 Mar 2024
Convection-Diffusion Equation: A Theoretically Certified Framework for Neural Networks
Tangjun Wang
Chenglong Bao
Zuoqiang Shi
DiffM
46
0
0
23 Mar 2024
Fusion Transformer with Object Mask Guidance for Image Forgery Analysis
Dimitrios Karageorgiou
Giorgos Kordopatis-Zilos
Symeon Papadopoulos
ViT
22
5
0
18 Mar 2024
Data augmentation with automated machine learning: approaches and performance comparison with classical data augmentation methods
A. Mumuni
F. Mumuni
65
5
0
13 Mar 2024
Fine-tuning with Very Large Dropout
Jianyu Zhang
Léon Bottou
42
1
0
01 Mar 2024
DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation
Sunghyeon Woo
Baeseong Park
Byeongwook Kim
Minjung Jo
S. Kwon
Dongsuk Jeon
Dongsoo Lee
59
2
0
27 Feb 2024
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Xinlei Chen
Zhuang Liu
Saining Xie
Kaiming He
DiffM
35
54
0
25 Jan 2024
Rethinking Patch Dependence for Masked Autoencoders
Letian Fu
Long Lian
Renhao Wang
Baifeng Shi
Xudong Wang
Adam Yala
Trevor Darrell
Alexei A. Efros
Ken Goldberg
34
14
0
25 Jan 2024
Adaptive Depth Networks with Skippable Sub-Paths
Woochul Kang
30
1
0
27 Dec 2023
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
44
63
0
11 Dec 2023
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
Xiaoyun Xu
Shujian Yu
Jingzheng Wu
S. Picek
AAML
35
0
0
08 Dec 2023
Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training
Arun V. Reddy
William Paul
Corban Rivera
Ketul Shah
Celso M. de Melo
Rama Chellappa
37
4
0
05 Dec 2023
Initializing Models with Larger Ones
Zhiqiu Xu
Yanjie Chen
Kirill Vishniakov
Yida Yin
Zhiqiang Shen
Trevor Darrell
Lingjie Liu
Zhuang Liu
30
17
0
30 Nov 2023
PolyMaX: General Dense Prediction with Mask Transformer
Xuan S. Yang
Liangzhe Yuan
Kimberly Wilber
Astuti Sharma
Xiuye Gu
...
Stephanie Debats
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Liang-Chieh Chen
28
14
0
09 Nov 2023
A Coefficient Makes SVRG Effective
Yida Yin
Zhiqiu Xu
Zhiyuan Li
Trevor Darrell
Zhuang Liu
33
1
0
09 Nov 2023
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
Meng Lou
Hong-Yu Zhou
Sibei Yang
Yizhou Yu
Chuan Wu
Yizhou Yu
ViT
44
36
0
30 Oct 2023
Stochastic interpolants with data-dependent couplings
M. S. Albergo
Mark Goldstein
Nicholas M. Boffi
Rajesh Ranganath
Eric Vanden-Eijnden
OT
30
29
0
05 Oct 2023
Decoupled Local Aggregation for Point Cloud Learning
Binjie Chen
Yunzhou Xia
Yu Zang
Cheng-Yu Wang
Jonathan Li
3DPC
29
9
0
31 Aug 2023
UnLoc: A Unified Framework for Video Localization Tasks
Shengjia Yan
Xuehan Xiong
Arsha Nagrani
Anurag Arnab
Zhonghao Wang
Weina Ge
David A. Ross
Cordelia Schmid
31
53
0
21 Aug 2023
CoNe: Contrast Your Neighbours for Supervised Image Classification
Mingkai Zheng
Shan You
Lang Huang
Xiu Su
Fei Wang
Chao Qian
Xiaogang Wang
Chang Xu
VLM
26
0
0
21 Aug 2023
Cost-effective On-device Continual Learning over Memory Hierarchy with Miro
Xinyue Ma
Suyeon Jeong
Minjia Zhang
Di Wang
Jonghyun Choi
Myeongjae Jeon
CLL
16
13
0
11 Aug 2023
SAfER: Layer-Level Sensitivity Assessment for Efficient and Robust Neural Network Inference
Edouard Yvinec
Arnaud Dapogny
Kévin Bailly
Xavier Fischer
AAML
8
2
0
09 Aug 2023
Efficient Learning of Discrete-Continuous Computation Graphs
David Friede
Mathias Niepert
13
3
0
26 Jul 2023
No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models
Jean Kaddour
Oscar Key
Piotr Nawrot
Pasquale Minervini
Matt J. Kusner
20
41
0
12 Jul 2023
Accelerated Training via Incrementally Growing Neural Networks using Variance Transfer and Learning Rate Adaptation
Xin Yuan
Pedro H. P. Savarese
Michael Maire
8
5
0
22 Jun 2023
How can objects help action recognition?
Xingyi Zhou
Anurag Arnab
Chen Sun
Cordelia Schmid
35
14
0
20 Jun 2023
Scaling of Class-wise Training Losses for Post-hoc Calibration
Seungjin Jung
Seung-Woo Seo
Yonghyun Jeong
Jongwon Choi
29
3
0
19 Jun 2023
1
2
3
4
5
6
7
8
9
Next