ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.11605
  4. Cited By
Bottleneck Transformers for Visual Recognition

Bottleneck Transformers for Visual Recognition

27 January 2021
A. Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
    SLR
ArXivPDFHTML

Papers citing "Bottleneck Transformers for Visual Recognition"

50 / 341 papers shown
Title
FMViT: A multiple-frequency mixing Vision Transformer
FMViT: A multiple-frequency mixing Vision Transformer
Wei Tan
Yifeng Geng
Xuansong Xie
ViT
16
3
0
09 Nov 2023
Selective Visual Representations Improve Convergence and Generalization
  for Embodied AI
Selective Visual Representations Improve Convergence and Generalization for Embodied AI
Ainaz Eftekhar
Kuo-Hao Zeng
Jiafei Duan
Ali Farhadi
Aniruddha Kembhavi
Ranjay Krishna
27
13
0
07 Nov 2023
Scattering Vision Transformer: Spectral Mixing Matters
Scattering Vision Transformer: Spectral Mixing Matters
Badri N. Patro
Vijay Srinivas Agneeswaran
29
14
0
02 Nov 2023
Constructing Sample-to-Class Graph for Few-Shot Class-Incremental
  Learning
Constructing Sample-to-Class Graph for Few-Shot Class-Incremental Learning
Fuyuan Hu
Jian Zhang
Fan Lyu
Linyan Li
Fenglei Xu
CLL
16
3
0
31 Oct 2023
Accelerating Vision Transformers Based on Heterogeneous Attention
  Patterns
Accelerating Vision Transformers Based on Heterogeneous Attention Patterns
Deli Yu
Teng Xi
Jianwei Li
Baopu Li
Gang Zhang
Haocheng Feng
Junyu Han
Jingtuo Liu
Errui Ding
Jingdong Wang
ViT
26
0
0
11 Oct 2023
Plug n' Play: Channel Shuffle Module for Enhancing Tiny Vision
  Transformers
Plug n' Play: Channel Shuffle Module for Enhancing Tiny Vision Transformers
Xuwei Xu
Sen Wang
Yudong Chen
Jiajun Liu
ViT
21
1
0
09 Oct 2023
Strength in Diversity: Multi-Branch Representation Learning for Vehicle
  Re-Identification
Strength in Diversity: Multi-Branch Representation Learning for Vehicle Re-Identification
Eurico Almeida
Bruno Silva
Jorge Batista
19
5
0
02 Oct 2023
OSNet & MNetO: Two Types of General Reconstruction Architectures for
  Linear Computed Tomography in Multi-Scenarios
OSNet & MNetO: Two Types of General Reconstruction Architectures for Linear Computed Tomography in Multi-Scenarios
Zhisheng Wang
Z. Deng
Fenglin Liu
Yixing Huang
Haijun Yu
Junning Cui
6
3
0
21 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
19
24
0
04 Sep 2023
QKSAN: A Quantum Kernel Self-Attention Network
QKSAN: A Quantum Kernel Self-Attention Network
Ren-Xin Zhao
Jinjing Shi
Xuelong Li
17
19
0
25 Aug 2023
Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion
  based Classification
Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification
Chengguo Yuan
Yu Jin
Zong-Yao Wu
Fanting Wei
Yangzirui Wang
Langlang Chen
Xiao Wang
ViT
90
6
0
23 Aug 2023
Patch Is Not All You Need
Patch Is Not All You Need
Chang-bo Li
Jie M. Zhang
Yang Wei
Zhilong Ji
Jinfeng Bai
Shiguang Shan
ViT
44
1
0
21 Aug 2023
Transformer-based Detection of Microorganisms on High-Resolution Petri
  Dish Images
Transformer-based Detection of Microorganisms on High-Resolution Petri Dish Images
Nikolas Ebert
D. Stricker
Oliver Wasenmüller
MedIm
ViT
23
4
0
18 Aug 2023
Distributionally Robust Classification on a Data Budget
Distributionally Robust Classification on a Data Budget
Ben Feuer
Ameya Joshi
Minh Pham
C. Hegde
OOD
27
2
0
07 Aug 2023
Causal reasoning in typical computer vision tasks
Causal reasoning in typical computer vision tasks
Kexuan Zhang
Qiyu Sun
Chaoqiang Zhao
Yang Tang
CML
26
11
0
26 Jul 2023
Regression-free Blind Image Quality Assessment with Content-Distortion
  Consistency
Regression-free Blind Image Quality Assessment with Content-Distortion Consistency
Xiaoqi Wang
Jian Xiong
Hao Gao
Weisi Lin
11
1
0
18 Jul 2023
PatchCT: Aligning Patch Set and Label Set with Conditional Transport for
  Multi-Label Image Classification
PatchCT: Aligning Patch Set and Label Set with Conditional Transport for Multi-Label Image Classification
Miaoge Li
Dongsheng Wang
Xinyang Liu
Zequn Zeng
Ruiying Lu
Bo Chen
Mingyuan Zhou
VLM
OT
20
15
0
18 Jul 2023
Scale-Aware Modulation Meet Transformer
Scale-Aware Modulation Meet Transformer
Wei-Shiang Lin
Ziheng Wu
Jiayu Chen
Jun Huang
Lianwen Jin
MoE
ViT
20
66
0
17 Jul 2023
ShiftNAS: Improving One-shot NAS via Probability Shift
ShiftNAS: Improving One-shot NAS via Probability Shift
Mingyang Zhang
Xinyi Yu
Haodong Zhao
Linlin Ou
25
5
0
17 Jul 2023
A Survey of Techniques for Optimizing Transformer Inference
A Survey of Techniques for Optimizing Transformer Inference
Krishna Teja Chitty-Venkata
Sparsh Mittal
M. Emani
V. Vishwanath
Arun Somani
35
62
0
16 Jul 2023
Marine Debris Detection in Satellite Surveillance using Attention
  Mechanisms
Marine Debris Detection in Satellite Surveillance using Attention Mechanisms
Ao Shen
Yijie Zhu
Richard Jiang
17
6
0
09 Jul 2023
NAR-Former V2: Rethinking Transformer for Universal Neural Network
  Representation Learning
NAR-Former V2: Rethinking Transformer for Universal Neural Network Representation Learning
Yun Yi
Haokui Zhang
Rong Xiao
Nan Wang
Xiaoyu Wang
GNN
24
2
0
19 Jun 2023
Securing Visually-Aware Recommender Systems: An Adversarial Image
  Reconstruction and Detection Framework
Securing Visually-Aware Recommender Systems: An Adversarial Image Reconstruction and Detection Framework
Minglei Yin
Bing Liu
Neil Zhenqiang Gong
Xin Li
AAML
9
1
0
11 Jun 2023
InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene
  Understanding
InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding
Hanrong Ye
Dan Xu
ViT
27
10
0
08 Jun 2023
CVSNet: A Computer Implementation for Central Visual System of The Brain
CVSNet: A Computer Implementation for Central Visual System of The Brain
Ruimin Gao
Hao-Li Zou
Zhekai Duan
26
3
0
31 May 2023
CageViT: Convolutional Activation Guided Efficient Vision Transformer
CageViT: Convolutional Activation Guided Efficient Vision Transformer
Hao Zheng
Jinbao Wang
Xiantong Zhen
H. Chen
Jingkuan Song
Feng Zheng
ViT
10
0
0
17 May 2023
CB-HVTNet: A channel-boosted hybrid vision transformer network for
  lymphocyte assessment in histopathological images
CB-HVTNet: A channel-boosted hybrid vision transformer network for lymphocyte assessment in histopathological images
Momina Liaqat Ali
Zunaira Rauf
Asifullah Khan
A. Sohail
Rafi Ullah
Jeonghwan Gwak
MedIm
ViT
34
2
0
16 May 2023
Understanding Gaussian Attention Bias of Vision Transformers Using
  Effective Receptive Fields
Understanding Gaussian Attention Bias of Vision Transformers Using Effective Receptive Fields
Bum Jun Kim
Hyeyeon Choi
Hyeonah Jang
Sang Woo Kim
ViT
18
3
0
08 May 2023
Early Detection of Alzheimer's Disease using Bottleneck Transformers
Early Detection of Alzheimer's Disease using Bottleneck Transformers
Arunima Jaiswal
Ananya Sadana
MedIm
18
2
0
01 May 2023
Adaptive-Mask Fusion Network for Segmentation of Drivable Road and
  Negative Obstacle With Untrustworthy Features
Adaptive-Mask Fusion Network for Segmentation of Drivable Road and Negative Obstacle With Untrustworthy Features
Zhen Feng
Yuchao Feng
Yanning Guo
Yuxiang Sun
8
6
0
27 Apr 2023
AutoFocusFormer: Image Segmentation off the Grid
AutoFocusFormer: Image Segmentation off the Grid
Chen Ziwen
K. Patnaik
Shuangfei Zhai
Alvin Wan
Zhile Ren
A. Schwing
Alex Colburn
Li Fuxin
17
9
0
24 Apr 2023
MLP-AIR: An Efficient MLP-Based Method for Actor Interaction Relation
  Learning in Group Activity Recognition
MLP-AIR: An Efficient MLP-Based Method for Actor Interaction Relation Learning in Group Activity Recognition
Guoliang Xu
Jianqin Yin
19
1
0
18 Apr 2023
SpectFormer: Frequency and Attention is what you need in a Vision
  Transformer
SpectFormer: Frequency and Attention is what you need in a Vision Transformer
Badri N. Patro
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
ViT
27
47
0
13 Apr 2023
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention
  and Residual Connection in Kernel Space
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention and Residual Connection in Kernel Space
Seokju Yun
Youngmin Ro
ViT
21
2
0
13 Apr 2023
Life Regression based Patch Slimming for Vision Transformers
Life Regression based Patch Slimming for Vision Transformers
Jiawei Chen
Lin Chen
Jianguo Yang
Tianqi Shi
Lechao Cheng
Zunlei Feng
Min-Gyoo Song
ViT
28
4
0
11 Apr 2023
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Gensheng Pei
Yazhou Yao
Fumin Shen
Daniel Huang
Xing-Rui Huang
Hengtao Shen
VOS
28
11
0
08 Apr 2023
Visual Dependency Transformers: Dependency Tree Emerges from Reversed
  Attention
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
Mingyu Ding
Yikang Shen
Lijie Fan
Zhenfang Chen
Z. Chen
Ping Luo
J. Tenenbaum
Chuang Gan
ViT
77
14
0
06 Apr 2023
RFAConv: Innovating Spatial Attention and Standard Convolutional
  Operation
RFAConv: Innovating Spatial Attention and Standard Convolutional Operation
X. Zhang
Chen Liu
Degang Yang
Tingting Song
Yichen Ye
Ke Li
Ying Song
21
110
0
06 Apr 2023
ReBotNet: Fast Real-time Video Enhancement
ReBotNet: Fast Real-time Video Enhancement
Jeya Maria Jose Valanarasu
Rahul Garg
Andeep S. Toor
Xin Tong
Weijuan Xi
Andreas Lugmayr
Vishal M. Patel
A. Menini
19
0
0
23 Mar 2023
FER-former: Multi-modal Transformer for Facial Expression Recognition
FER-former: Multi-modal Transformer for Facial Expression Recognition
Yande Li
Mingjie Wang
Minglun Gong
Y. Lu
Li Liu
23
8
0
23 Mar 2023
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training
  Efficiency
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Vithursan Thangarasa
Shreyas Saxena
Abhay Gupta
Sean Lie
23
3
0
21 Mar 2023
CerviFormer: A Pap-smear based cervical cancer classification method
  using cross attention and latent transformer
CerviFormer: A Pap-smear based cervical cancer classification method using cross attention and latent transformer
Bhaswati Singha Deo
M. Pal
P. Panigrahi
A. Pradhan
MedIm
28
22
0
17 Mar 2023
LoG-CAN: local-global Class-aware Network for semantic segmentation of
  remote sensing images
LoG-CAN: local-global Class-aware Network for semantic segmentation of remote sensing images
Xiaowen Ma
Mengting Ma
Chenlu Hu
Zhiyuan Song
Zi-Shu Zhao
Tian Feng
Wei Zhang
38
12
0
14 Mar 2023
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale
  Attention
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention
Wenxiao Wang
Wei Chen
Qibo Qiu
Long Chen
Boxi Wu
Binbin Lin
Xiaofei He
Wei Liu
24
38
0
13 Mar 2023
RotoGBML: Towards Out-of-Distribution Generalization for Gradient-Based
  Meta-Learning
RotoGBML: Towards Out-of-Distribution Generalization for Gradient-Based Meta-Learning
Min Zhang
Zifeng Zhuang
Zhitao Wang
Donglin Wang
Wen-Bin Li
46
5
0
12 Mar 2023
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Jierun Chen
Shiu-hong Kao
Hao He
Weipeng Zhuo
Song Wen
Chul-Ho Lee
Shueng-Han Gary Chan
OOD
32
776
0
07 Mar 2023
Self-attention in Vision Transformers Performs Perceptual Grouping, Not
  Attention
Self-attention in Vision Transformers Performs Perceptual Grouping, Not Attention
Paria Mehrani
John K. Tsotsos
25
24
0
02 Mar 2023
A Convolutional Vision Transformer for Semantic Segmentation of
  Side-Scan Sonar Data
A Convolutional Vision Transformer for Semantic Segmentation of Side-Scan Sonar Data
Hayat Rajani
N. Gracias
Rafael García
ViT
19
12
0
24 Feb 2023
Deep Active Learning in the Presence of Label Noise: A Survey
Deep Active Learning in the Presence of Label Noise: A Survey
Moseli Motsóehli
Kyungim Baek
NoLa
VLM
34
5
0
22 Feb 2023
Device Tuning for Multi-Task Large Model
Device Tuning for Multi-Task Large Model
Penghao Jiang
Xuanchen Hou
Y. Zhou
21
0
0
21 Feb 2023
Previous
1234567
Next