ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.11605
  4. Cited By
Bottleneck Transformers for Visual Recognition

Bottleneck Transformers for Visual Recognition

27 January 2021
A. Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
    SLR
ArXivPDFHTML

Papers citing "Bottleneck Transformers for Visual Recognition"

50 / 341 papers shown
Title
MedViT: A Robust Vision Transformer for Generalized Medical Image
  Classification
MedViT: A Robust Vision Transformer for Generalized Medical Image Classification
Omid Nejati Manzari
Hamid Ahmadabadi
Hossein Kashiani
S. B. Shokouhi
Ahmad Ayatollahi
ViT
MedIm
26
176
0
19 Feb 2023
Hyneter: Hybrid Network Transformer for Object Detection
Hyneter: Hybrid Network Transformer for Object Detection
Dong Chen
Duoqian Miao
Xuepeng Zhao
ViT
27
3
0
18 Feb 2023
Efficiency 360: Efficient Vision Transformers
Efficiency 360: Efficient Vision Transformers
Badri N. Patro
Vijay Srinivas Agneeswaran
26
6
0
16 Feb 2023
Invariant Slot Attention: Object Discovery with Slot-Centric Reference
  Frames
Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Ondrej Biza
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gamaleldin F. Elsayed
Aravindh Mahendran
Thomas Kipf
OCL
43
32
0
09 Feb 2023
DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition
DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition
Jiayu Jiao
Yuyao Tang
Kun-Li Channing Lin
Yipeng Gao
Jinhua Ma
Yaowei Wang
Wei-Shi Zheng
MedIm
ViT
19
136
0
03 Feb 2023
Cluster-CAM: Cluster-Weighted Visual Interpretation of CNNs' Decision in
  Image Classification
Cluster-CAM: Cluster-Weighted Visual Interpretation of CNNs' Decision in Image Classification
Zhenpeng Feng
H. Ji
M. Daković
Xiyang Cui
Mingzhe Zhu
Ljubisa Stankovic
19
7
0
03 Feb 2023
Semantic Segmentation Enhanced Transformer Model for Human Attention
  Prediction
Semantic Segmentation Enhanced Transformer Model for Human Attention Prediction
Shuo Zhang
ViT
18
0
0
26 Jan 2023
Part-guided Relational Transformers for Fine-grained Visual Recognition
Part-guided Relational Transformers for Fine-grained Visual Recognition
Yifan Zhao
Jia Li
Xiaowu Chen
Yonghong Tian
ViT
29
34
0
28 Dec 2022
Multi-Scale Feature Fusion Transformer Network for End-to-End Single
  Channel Speech Separation
Multi-Scale Feature Fusion Transformer Network for End-to-End Single Channel Speech Separation
Yinhao Xu
Jian Zhou
L. Tao
H. Kwan
22
0
0
14 Dec 2022
CamoFormer: Masked Separable Attention for Camouflaged Object Detection
CamoFormer: Masked Separable Attention for Camouflaged Object Detection
Bo Yin
Xuying Zhang
Qibin Hou
Bo Sun
Deng-Ping Fan
Luc Van Gool
18
51
0
10 Dec 2022
Cross-Domain Synthetic-to-Real In-the-Wild Depth and Normal Estimation
  for 3D Scene Understanding
Cross-Domain Synthetic-to-Real In-the-Wild Depth and Normal Estimation for 3D Scene Understanding
Jay Bhanushali
Manivannan Muniyandi
Praneeth Chakravarthula
3DPC
ViT
14
2
0
09 Dec 2022
Dunhuang murals contour generation network based on convolution and
  self-attention fusion
Dunhuang murals contour generation network based on convolution and self-attention fusion
Bao-Yu Liu
Fengjie He
Shiqiang Du
Kaiwu Zhang
Jianhua Wang
3DPC
40
6
0
02 Dec 2022
Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images
Meng Wang
Kai-An Yu
Chun-Mei Feng
K. Zou
Yanyu Xu
Qingquan Meng
Rick Siow Mong Goh
Yong Liu
H. Fu
MedIm
22
3
0
01 Dec 2022
Degenerate Swin to Win: Plain Window-based Transformer without
  Sophisticated Operations
Degenerate Swin to Win: Plain Window-based Transformer without Sophisticated Operations
Tan Yu
Ping Li
ViT
46
5
0
25 Nov 2022
Learnable Spectral Wavelets on Dynamic Graphs to Capture Global
  Interactions
Learnable Spectral Wavelets on Dynamic Graphs to Capture Global Interactions
Anson Bastos
Abhishek Nadgeri
Kuldeep Singh
Toyotaro Suzumura
Manish Singh
33
7
0
22 Nov 2022
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Qibin Hou
Cheng Lu
Mingg-Ming Cheng
Jiashi Feng
ViT
28
129
0
22 Nov 2022
Peeling the Onion: Hierarchical Reduction of Data Redundancy for
  Efficient Vision Transformer Training
Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
Zhenglun Kong
Haoyu Ma
Geng Yuan
Mengshu Sun
Yanyue Xie
...
Tianlong Chen
Xiaolong Ma
Xiaohui Xie
Zhangyang Wang
Yanzhi Wang
ViT
26
22
0
19 Nov 2022
Vision Transformers in Medical Imaging: A Review
Vision Transformers in Medical Imaging: A Review
Emerald U. Henry
Onyeka Emebob
C. Omonhinmin
ViT
MedIm
22
34
0
18 Nov 2022
Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical
  Image Segmentation
Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical Image Segmentation
Yiyue Hu
Lei Zhang
Nan Mu
Leijun Liu
ViT
MedIm
20
1
0
17 Nov 2022
Fcaformer: Forward Cross Attention in Hybrid Vision Transformer
Fcaformer: Forward Cross Attention in Hybrid Vision Transformer
Haokui Zhang
Wenze Hu
Xiaoyu Wang
ViT
17
8
0
14 Nov 2022
ParCNetV2: Oversized Kernel with Enhanced Attention
ParCNetV2: Oversized Kernel with Enhanced Attention
Ruihan Xu
Haokui Zhang
Wenze Hu
Shiliang Zhang
Xiaoyu Wang
ViT
25
6
0
14 Nov 2022
Studying inductive biases in image classification task
Studying inductive biases in image classification task
N. Arizumi
21
1
0
31 Oct 2022
An Effective Deep Network for Head Pose Estimation without Keypoints
An Effective Deep Network for Head Pose Estimation without Keypoints
Chien Thai
Viet Tran
Minh Bui
Huong Ninh
Hai Yen Tran
3DH
CVBM
8
3
0
25 Oct 2022
LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context
  Propagation in Transformers
LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context Propagation in Transformers
Zhuo Huang
Zhiyou Zhao
Banghuai Li
Jungong Han
3DPC
ViT
27
55
0
23 Oct 2022
Similarity of Neural Architectures using Adversarial Attack
  Transferability
Similarity of Neural Architectures using Adversarial Attack Transferability
Jaehui Hwang
Dongyoon Han
Byeongho Heo
Song Park
Sanghyuk Chun
Jong-Seok Lee
AAML
24
1
0
20 Oct 2022
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities
Brian Bartoldson
B. Kailkhura
Davis W. Blalock
29
47
0
13 Oct 2022
FontTransformer: Few-shot High-resolution Chinese Glyph Image Synthesis
  via Stacked Transformers
FontTransformer: Few-shot High-resolution Chinese Glyph Image Synthesis via Stacked Transformers
Yitian Liu
Z. Lian
39
13
0
12 Oct 2022
SaiT: Sparse Vision Transformers through Adaptive Token Pruning
SaiT: Sparse Vision Transformers through Adaptive Token Pruning
Ling Li
D. Thorsley
Joseph Hassoun
ViT
25
17
0
11 Oct 2022
Block Format Error Bounds and Optimal Block Size Selection
Block Format Error Bounds and Optimal Block Size Selection
I. Soloveychik
I. Lyubomirsky
Xin Eric Wang
S. Bhoja
MQ
27
4
0
11 Oct 2022
Fast-ParC: Capturing Position Aware Global Feature for ConvNets and ViTs
Fast-ParC: Capturing Position Aware Global Feature for ConvNets and ViTs
Taojiannan Yang
Haokui Zhang
Wenze Hu
C. L. P. Chen
Xiaoyu Wang
ViT
11
0
0
08 Oct 2022
Time-Space Transformers for Video Panoptic Segmentation
Time-Space Transformers for Video Panoptic Segmentation
Andra Petrovai
S. Nedevschi
ViT
19
3
0
07 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision
  Models
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Chenglin Yang
Siyuan Qiao
Qihang Yu
Xiaoding Yuan
Yukun Zhu
Alan Yuille
Hartwig Adam
Liang-Chieh Chen
ViT
MoE
30
58
0
04 Oct 2022
Exploring the Relationship between Architecture and Adversarially Robust
  Generalization
Exploring the Relationship between Architecture and Adversarially Robust Generalization
Aishan Liu
Shiyu Tang
Siyuan Liang
Ruihao Gong
Boxi Wu
Xianglong Liu
Dacheng Tao
AAML
26
18
0
28 Sep 2022
Dynamic Graph Message Passing Networks for Visual Recognition
Dynamic Graph Message Passing Networks for Visual Recognition
Li Zhang
Mohan Chen
Anurag Arnab
Xiangyang Xue
Philip H. S. Torr
GNN
29
1
0
20 Sep 2022
Swin-transformer-yolov5 For Real-time Wine Grape Bunch Detection
Swin-transformer-yolov5 For Real-time Wine Grape Bunch Detection
Shenglian Lu
Xiaoyu Liu
Zixaun He
Wenbo Liu
Xin Zhang
Manoj Karkee
10
38
0
30 Aug 2022
MRL: Learning to Mix with Attention and Convolutions
MRL: Learning to Mix with Attention and Convolutions
Shlok Mohta
Hisahiro Suganuma
Yoshiki Tanaka
20
2
0
30 Aug 2022
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted
  Window
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted Window
Mocho Go
Hideyuki Tachibana
ViT
29
9
0
24 Aug 2022
FocusFormer: Focusing on What We Need via Architecture Sampler
FocusFormer: Focusing on What We Need via Architecture Sampler
Jing Liu
Jianfei Cai
Bohan Zhuang
27
7
0
23 Aug 2022
DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
Jingyu Lin
Jie Jiang
Y. Yan
Chunchao Guo
Hongfa Wang
Wei Liu
Hanzi Wang
ViT
29
3
0
21 Aug 2022
Improved Image Classification with Token Fusion
Improved Image Classification with Token Fusion
Keong-Hun Choi
Jin-Woo Kim
Yaolong Wang
J. Ha
ViT
17
0
0
19 Aug 2022
A Vision Transformer-Based Approach to Bearing Fault Classification via
  Vibration Signals
A Vision Transformer-Based Approach to Bearing Fault Classification via Vibration Signals
Abid Hasan Zim
Aeyan Ashraf
Aquib Iqbal
Asad U. Malik
Minoru Kuribayashi
10
10
0
15 Aug 2022
Recent Progress in Transformer-based Medical Image Analysis
Recent Progress in Transformer-based Medical Image Analysis
Zhao-cheng Liu
Qiujie Lv
Ziduo Yang
Yifan Li
Chau Hung Lee
Leizhao Shen
MedIm
40
56
0
13 Aug 2022
Memorizing Complementation Network for Few-Shot Class-Incremental
  Learning
Memorizing Complementation Network for Few-Shot Class-Incremental Learning
Zhong Ji
Zhi Hou
Xiyao Liu
Yanwei Pang
Xuelong Li
CLL
16
45
0
11 Aug 2022
Label-Efficient Domain Generalization via Collaborative Exploration and
  Generalization
Label-Efficient Domain Generalization via Collaborative Exploration and Generalization
Junkun Yuan
Xu Ma
Defang Chen
Kun Kuang
Fei Wu
Lanfen Lin
16
25
0
07 Aug 2022
Understanding Adversarial Robustness of Vision Transformers via Cauchy
  Problem
Understanding Adversarial Robustness of Vision Transformers via Cauchy Problem
Zheng Wang
Wenjie Ruan
ViT
34
8
0
01 Aug 2022
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger
Cong Wang
Hongmin Xu
Xiong Zhang
Li Wang
Zhitong Zheng
Haifeng Liu
ViT
12
20
0
27 Jul 2022
Jigsaw-ViT: Learning Jigsaw Puzzles in Vision Transformer
Jigsaw-ViT: Learning Jigsaw Puzzles in Vision Transformer
Yingyi Chen
Xiaoke Shen
Yahui Liu
Qinghua Tao
Johan A. K. Suykens
AAML
ViT
21
22
0
25 Jul 2022
SSBNet: Improving Visual Recognition Efficiency by Adaptive Sampling
SSBNet: Improving Visual Recognition Efficiency by Adaptive Sampling
Ho Man Kwan
Shenghui Song
11
1
0
23 Jul 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
Learning Sequence Representations by Non-local Recurrent Neural Memory
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
19
1
0
20 Jul 2022
QSAN: A Near-term Achievable Quantum Self-Attention Network
QSAN: A Near-term Achievable Quantum Self-Attention Network
Jinjing Shi
Ren-Xin Zhao
Wenxuan Wang
Shenmin Zhang
Xuelong Li
19
20
0
14 Jul 2022
Previous
1234567
Next