Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.11605
Cited By
Bottleneck Transformers for Visual Recognition
27 January 2021
A. Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
SLR
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bottleneck Transformers for Visual Recognition"
50 / 341 papers shown
Title
FMViT: A multiple-frequency mixing Vision Transformer
Wei Tan
Yifeng Geng
Xuansong Xie
ViT
16
3
0
09 Nov 2023
Selective Visual Representations Improve Convergence and Generalization for Embodied AI
Ainaz Eftekhar
Kuo-Hao Zeng
Jiafei Duan
Ali Farhadi
Aniruddha Kembhavi
Ranjay Krishna
27
13
0
07 Nov 2023
Scattering Vision Transformer: Spectral Mixing Matters
Badri N. Patro
Vijay Srinivas Agneeswaran
29
14
0
02 Nov 2023
Constructing Sample-to-Class Graph for Few-Shot Class-Incremental Learning
Fuyuan Hu
Jian Zhang
Fan Lyu
Linyan Li
Fenglei Xu
CLL
16
3
0
31 Oct 2023
Accelerating Vision Transformers Based on Heterogeneous Attention Patterns
Deli Yu
Teng Xi
Jianwei Li
Baopu Li
Gang Zhang
Haocheng Feng
Junyu Han
Jingtuo Liu
Errui Ding
Jingdong Wang
ViT
26
0
0
11 Oct 2023
Plug n' Play: Channel Shuffle Module for Enhancing Tiny Vision Transformers
Xuwei Xu
Sen Wang
Yudong Chen
Jiajun Liu
ViT
21
1
0
09 Oct 2023
Strength in Diversity: Multi-Branch Representation Learning for Vehicle Re-Identification
Eurico Almeida
Bruno Silva
Jorge Batista
19
5
0
02 Oct 2023
OSNet & MNetO: Two Types of General Reconstruction Architectures for Linear Computed Tomography in Multi-Scenarios
Zhisheng Wang
Z. Deng
Fenglin Liu
Yixing Huang
Haijun Yu
Junning Cui
6
3
0
21 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
19
24
0
04 Sep 2023
QKSAN: A Quantum Kernel Self-Attention Network
Ren-Xin Zhao
Jinjing Shi
Xuelong Li
17
19
0
25 Aug 2023
Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification
Chengguo Yuan
Yu Jin
Zong-Yao Wu
Fanting Wei
Yangzirui Wang
Langlang Chen
Xiao Wang
ViT
90
6
0
23 Aug 2023
Patch Is Not All You Need
Chang-bo Li
Jie M. Zhang
Yang Wei
Zhilong Ji
Jinfeng Bai
Shiguang Shan
ViT
44
1
0
21 Aug 2023
Transformer-based Detection of Microorganisms on High-Resolution Petri Dish Images
Nikolas Ebert
D. Stricker
Oliver Wasenmüller
MedIm
ViT
23
4
0
18 Aug 2023
Distributionally Robust Classification on a Data Budget
Ben Feuer
Ameya Joshi
Minh Pham
C. Hegde
OOD
27
2
0
07 Aug 2023
Causal reasoning in typical computer vision tasks
Kexuan Zhang
Qiyu Sun
Chaoqiang Zhao
Yang Tang
CML
26
11
0
26 Jul 2023
Regression-free Blind Image Quality Assessment with Content-Distortion Consistency
Xiaoqi Wang
Jian Xiong
Hao Gao
Weisi Lin
11
1
0
18 Jul 2023
PatchCT: Aligning Patch Set and Label Set with Conditional Transport for Multi-Label Image Classification
Miaoge Li
Dongsheng Wang
Xinyang Liu
Zequn Zeng
Ruiying Lu
Bo Chen
Mingyuan Zhou
VLM
OT
20
15
0
18 Jul 2023
Scale-Aware Modulation Meet Transformer
Wei-Shiang Lin
Ziheng Wu
Jiayu Chen
Jun Huang
Lianwen Jin
MoE
ViT
20
66
0
17 Jul 2023
ShiftNAS: Improving One-shot NAS via Probability Shift
Mingyang Zhang
Xinyi Yu
Haodong Zhao
Linlin Ou
25
5
0
17 Jul 2023
A Survey of Techniques for Optimizing Transformer Inference
Krishna Teja Chitty-Venkata
Sparsh Mittal
M. Emani
V. Vishwanath
Arun Somani
35
62
0
16 Jul 2023
Marine Debris Detection in Satellite Surveillance using Attention Mechanisms
Ao Shen
Yijie Zhu
Richard Jiang
17
6
0
09 Jul 2023
NAR-Former V2: Rethinking Transformer for Universal Neural Network Representation Learning
Yun Yi
Haokui Zhang
Rong Xiao
Nan Wang
Xiaoyu Wang
GNN
24
2
0
19 Jun 2023
Securing Visually-Aware Recommender Systems: An Adversarial Image Reconstruction and Detection Framework
Minglei Yin
Bing Liu
Neil Zhenqiang Gong
Xin Li
AAML
9
1
0
11 Jun 2023
InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding
Hanrong Ye
Dan Xu
ViT
27
10
0
08 Jun 2023
CVSNet: A Computer Implementation for Central Visual System of The Brain
Ruimin Gao
Hao-Li Zou
Zhekai Duan
26
3
0
31 May 2023
CageViT: Convolutional Activation Guided Efficient Vision Transformer
Hao Zheng
Jinbao Wang
Xiantong Zhen
H. Chen
Jingkuan Song
Feng Zheng
ViT
10
0
0
17 May 2023
CB-HVTNet: A channel-boosted hybrid vision transformer network for lymphocyte assessment in histopathological images
Momina Liaqat Ali
Zunaira Rauf
Asifullah Khan
A. Sohail
Rafi Ullah
Jeonghwan Gwak
MedIm
ViT
34
2
0
16 May 2023
Understanding Gaussian Attention Bias of Vision Transformers Using Effective Receptive Fields
Bum Jun Kim
Hyeyeon Choi
Hyeonah Jang
Sang Woo Kim
ViT
18
3
0
08 May 2023
Early Detection of Alzheimer's Disease using Bottleneck Transformers
Arunima Jaiswal
Ananya Sadana
MedIm
18
2
0
01 May 2023
Adaptive-Mask Fusion Network for Segmentation of Drivable Road and Negative Obstacle With Untrustworthy Features
Zhen Feng
Yuchao Feng
Yanning Guo
Yuxiang Sun
8
6
0
27 Apr 2023
AutoFocusFormer: Image Segmentation off the Grid
Chen Ziwen
K. Patnaik
Shuangfei Zhai
Alvin Wan
Zhile Ren
A. Schwing
Alex Colburn
Li Fuxin
17
9
0
24 Apr 2023
MLP-AIR: An Efficient MLP-Based Method for Actor Interaction Relation Learning in Group Activity Recognition
Guoliang Xu
Jianqin Yin
19
1
0
18 Apr 2023
SpectFormer: Frequency and Attention is what you need in a Vision Transformer
Badri N. Patro
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
ViT
27
47
0
13 Apr 2023
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention and Residual Connection in Kernel Space
Seokju Yun
Youngmin Ro
ViT
21
2
0
13 Apr 2023
Life Regression based Patch Slimming for Vision Transformers
Jiawei Chen
Lin Chen
Jianguo Yang
Tianqi Shi
Lechao Cheng
Zunlei Feng
Min-Gyoo Song
ViT
28
4
0
11 Apr 2023
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Gensheng Pei
Yazhou Yao
Fumin Shen
Daniel Huang
Xing-Rui Huang
Hengtao Shen
VOS
28
11
0
08 Apr 2023
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
Mingyu Ding
Yikang Shen
Lijie Fan
Zhenfang Chen
Z. Chen
Ping Luo
J. Tenenbaum
Chuang Gan
ViT
77
14
0
06 Apr 2023
RFAConv: Innovating Spatial Attention and Standard Convolutional Operation
X. Zhang
Chen Liu
Degang Yang
Tingting Song
Yichen Ye
Ke Li
Ying Song
21
110
0
06 Apr 2023
ReBotNet: Fast Real-time Video Enhancement
Jeya Maria Jose Valanarasu
Rahul Garg
Andeep S. Toor
Xin Tong
Weijuan Xi
Andreas Lugmayr
Vishal M. Patel
A. Menini
19
0
0
23 Mar 2023
FER-former: Multi-modal Transformer for Facial Expression Recognition
Yande Li
Mingjie Wang
Minglun Gong
Y. Lu
Li Liu
23
8
0
23 Mar 2023
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Vithursan Thangarasa
Shreyas Saxena
Abhay Gupta
Sean Lie
23
3
0
21 Mar 2023
CerviFormer: A Pap-smear based cervical cancer classification method using cross attention and latent transformer
Bhaswati Singha Deo
M. Pal
P. Panigrahi
A. Pradhan
MedIm
28
22
0
17 Mar 2023
LoG-CAN: local-global Class-aware Network for semantic segmentation of remote sensing images
Xiaowen Ma
Mengting Ma
Chenlu Hu
Zhiyuan Song
Zi-Shu Zhao
Tian Feng
Wei Zhang
38
12
0
14 Mar 2023
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention
Wenxiao Wang
Wei Chen
Qibo Qiu
Long Chen
Boxi Wu
Binbin Lin
Xiaofei He
Wei Liu
24
38
0
13 Mar 2023
RotoGBML: Towards Out-of-Distribution Generalization for Gradient-Based Meta-Learning
Min Zhang
Zifeng Zhuang
Zhitao Wang
Donglin Wang
Wen-Bin Li
46
5
0
12 Mar 2023
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Jierun Chen
Shiu-hong Kao
Hao He
Weipeng Zhuo
Song Wen
Chul-Ho Lee
Shueng-Han Gary Chan
OOD
32
776
0
07 Mar 2023
Self-attention in Vision Transformers Performs Perceptual Grouping, Not Attention
Paria Mehrani
John K. Tsotsos
25
24
0
02 Mar 2023
A Convolutional Vision Transformer for Semantic Segmentation of Side-Scan Sonar Data
Hayat Rajani
N. Gracias
Rafael García
ViT
19
12
0
24 Feb 2023
Deep Active Learning in the Presence of Label Noise: A Survey
Moseli Motsóehli
Kyungim Baek
NoLa
VLM
34
5
0
22 Feb 2023
Device Tuning for Multi-Task Large Model
Penghao Jiang
Xuanchen Hou
Y. Zhou
21
0
0
21 Feb 2023
Previous
1
2
3
4
5
6
7
Next