ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.00520
  4. Cited By
Vision Transformer with Deformable Attention

Vision Transformer with Deformable Attention

3 January 2022
Zhuofan Xia
Xuran Pan
S. Song
Li Erran Li
Gao Huang
    ViT
ArXivPDFHTML

Papers citing "Vision Transformer with Deformable Attention"

50 / 60 papers shown
Title
RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation
RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation
Zhiwen Zeng
Yunfei Yin
Zheng Yuan
Argho Dey
Xianjian Bao
31
0
0
10 May 2025
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach
Shouyang Dong
Yuanbo Wen
Jun Bi
Di Huang
Jiaming Guo
...
Yifan Hao
Xuehai Zhou
Tianshi Chen
Qi Guo
Yunji Chen
24
0
0
04 May 2025
SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration
SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration
Xi Tong
Xing Luo
Jiangxin Yang
Yanpeng Cao
31
0
0
17 Apr 2025
FASR-Net: Unsupervised Shadow Removal Leveraging Inherent Frequency Priors
FASR-Net: Unsupervised Shadow Removal Leveraging Inherent Frequency Priors
Tao Lin
Qingwang Wang
Qiwei Liang
Minghua Tang
Yuxuan Sun
CVBM
57
0
0
08 Apr 2025
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
Hongyu Ke
Jack Morris
K. Oguchi
Xiaofei Cao
Yongkang Liu
Haoxin Wang
Yi Ding
Mamba
71
0
0
18 Mar 2025
Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution
Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution
Cuixin Yang
Rongkang Dong
Jun Xiao
Cong Zhang
Kin-Man Lam
Fei Zhou
Guoping Qiu
87
1
0
17 Jan 2025
Breaking the Low-Rank Dilemma of Linear Attention
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
42
0
0
12 Nov 2024
Towards Global Localization using Multi-Modal Object-Instance Re-Identification
Towards Global Localization using Multi-Modal Object-Instance Re-Identification
Aneesh Chavan
Vaibhav Agrawal
Vineeth Bhat
Sarthak Chittawar
Siddharth Srivastava
Chetan Arora
K. M. Krishna
87
0
0
18 Sep 2024
MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for
  Efficient Pedestrian Detection
MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection
Xiangbo Gao
A. Kanu-Asiegbu
Xiaoxiao Du
Mamba
35
0
0
02 Aug 2024
SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation
SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation
Lin Zhang
Wenbo Gao
Jie Yi
Yunyun Yang
43
0
0
14 Jul 2024
DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting
DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting
Yuxuan Shu
Vasileios Lampos
AI4TS
AI4CE
63
0
0
11 Jun 2024
TwinS: Revisiting Non-Stationarity in Multivariate Time Series
  Forecasting
TwinS: Revisiting Non-Stationarity in Multivariate Time Series Forecasting
Jiaxi Hu
Qingsong Wen
Sijie Ruan
Li Liu
Yuxuan Liang
AI4TS
28
5
0
06 Jun 2024
Sharing Key Semantics in Transformer Makes Efficient Image Restoration
Sharing Key Semantics in Transformer Makes Efficient Image Restoration
Bin Ren
Yawei Li
Jingyun Liang
Rakesh Ranjan
Mengyuan Liu
Rita Cucchiara
Luc Van Gool
Ming-Hsuan Yang
N. Sebe
37
3
0
30 May 2024
Vision Transformer with Sparse Scan Prior
Vision Transformer with Sparse Scan Prior
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
ViT
48
5
0
22 May 2024
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
Volodymyr Fedynyak
Yaroslav Romanus
Bohdan Hlovatskyi
Bohdan Sydor
Oles Dobosevych
Igor Babin
Roman Riazantsev
VOS
42
3
0
11 May 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques
  and Insights
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
43
7
0
28 Mar 2024
ST-LDM: A Universal Framework for Text-Grounded Object Generation in
  Real Images
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
43
1
0
15 Mar 2024
A Unified Framework for Microscopy Defocus Deblur with Multi-Pyramid
  Transformer and Contrastive Learning
A Unified Framework for Microscopy Defocus Deblur with Multi-Pyramid Transformer and Contrastive Learning
Yuelin Zhang
Pengyu Zheng
Wanquan Yan
Chengyu Fang
Shing Shin Cheng
MedIm
27
7
0
05 Mar 2024
Multi-Human Mesh Recovery with Transformers
Multi-Human Mesh Recovery with Transformers
Zeyu Wang
Zhenzhen Weng
Serena Yeung-Levy
3DH
32
1
0
26 Feb 2024
FViT: A Focal Vision Transformer with Gabor Filter
FViT: A Focal Vision Transformer with Gabor Filter
Yulong Shi
Mingwei Sun
Yongshuai Wang
Rui Wang
52
4
0
17 Feb 2024
Self-supervised Video Object Segmentation with Distillation Learning of
  Deformable Attention
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
Quang-Trung Truong
Duc Thanh Nguyen
Binh-Son Hua
Sai-Kit Yeung
VOS
34
1
0
25 Jan 2024
SCHEME: Scalable Channel Mixer for Vision Transformers
SCHEME: Scalable Channel Mixer for Vision Transformers
Deepak Sridhar
Yunsheng Li
Nuno Vasconcelos
33
0
0
01 Dec 2023
HiH: A Multi-modal Hierarchy in Hierarchy Network for Unconstrained Gait
  Recognition
HiH: A Multi-modal Hierarchy in Hierarchy Network for Unconstrained Gait Recognition
Lei Wang
Bo Liu
Yinchi Ma
Congcong Li
Wei Yao
CVBM
37
0
0
19 Nov 2023
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
Yulong Shi
Mingwei Sun
Yongshuai Wang
Hui Sun
Zengqiang Chen
31
4
0
10 Oct 2023
Low-Resolution Self-Attention for Semantic Segmentation
Low-Resolution Self-Attention for Semantic Segmentation
Yu-Huan Wu
Shi-Chen Zhang
Yun-Hai Liu
Le Zhang
Xin Zhan
Daquan Zhou
Jiashi Feng
Ming-Ming Cheng
Liangli Zhen
ViT
40
3
0
08 Oct 2023
Addressing Data Misalignment in Image-LiDAR Fusion on Point Cloud
  Segmentation
Addressing Data Misalignment in Image-LiDAR Fusion on Point Cloud Segmentation
Wei-Jong Yang
Guan-Cheng Lee
21
0
0
26 Sep 2023
UniBEV: Multi-modal 3D Object Detection with Uniform BEV Encoders for
  Robustness against Missing Sensor Modalities
UniBEV: Multi-modal 3D Object Detection with Uniform BEV Encoders for Robustness against Missing Sensor Modalities
Shiming Wang
Holger Caesar
Liangliang Nan
Julian F. P. Kooij
61
11
0
25 Sep 2023
3D Transformer based on deformable patch location for differential
  diagnosis between Alzheimer's disease and Frontotemporal dementia
3D Transformer based on deformable patch location for differential diagnosis between Alzheimer's disease and Frontotemporal dementia
H. Nguyen
Michael Clement
Boris Mansencal
Pierrick Coupé
MedIm
26
0
0
06 Sep 2023
UGCANet: A Unified Global Context-Aware Transformer-based Network with
  Feature Alignment for Endoscopic Image Analysis
UGCANet: A Unified Global Context-Aware Transformer-based Network with Feature Alignment for Endoscopic Image Analysis
Pham Vu Hung
N. Manh
Nguyen Thi Oanh
N. T. Thuy
D. V. Sang
ViT
MedIm
24
3
0
12 Jul 2023
Lightweight Vision Transformer with Bidirectional Interaction
Lightweight Vision Transformer with Bidirectional Interaction
Qihang Fan
Huaibo Huang
Xiaoqiang Zhou
Ran He
ViT
37
28
0
01 Jun 2023
DarSwin: Distortion Aware Radial Swin Transformer
DarSwin: Distortion Aware Radial Swin Transformer
Akshay Athwale
Ichrak Shili
Émile Bergeron
Arman Afrasiyabi
Justin Lague
Ola Ahmad
Jean-François Lalonde
26
6
0
19 Apr 2023
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot
  Keypoint Detection
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection
Changsheng Lu
Hao Zhu
Piotr Koniusz
48
11
0
06 Apr 2023
Revisiting the Evaluation of Image Synthesis with GANs
Revisiting the Evaluation of Image Synthesis with GANs
Mengping Yang
Ceyuan Yang
Yichi Zhang
Qingyan Bai
Yujun Shen
Bo Dai
EGVM
29
7
0
04 Apr 2023
Vision Transformer with Quadrangle Attention
Vision Transformer with Quadrangle Attention
Qiming Zhang
Jing Zhang
Yufei Xu
Dacheng Tao
ViT
21
38
0
27 Mar 2023
Adaptive Rotated Convolution for Rotated Object Detection
Adaptive Rotated Convolution for Rotated Object Detection
Yifan Pu
Yiru Wang
Zhuofan Xia
Yizeng Han
Yulin Wang
Weihao Gan
Zidong Wang
S. Song
Gao Huang
20
76
0
14 Mar 2023
Grid-Centric Traffic Scenario Perception for Autonomous Driving: A
  Comprehensive Review
Grid-Centric Traffic Scenario Perception for Autonomous Driving: A Comprehensive Review
Yining Shi
Kun Jiang
Jiusi Li
Zelin Qian
Jun Wen
Mengmeng Yang
Ke Wang
Diange Yang
86
25
0
02 Mar 2023
Efficiency 360: Efficient Vision Transformers
Efficiency 360: Efficient Vision Transformers
Badri N. Patro
Vijay Srinivas Agneeswaran
26
6
0
16 Feb 2023
Dynamic Background Reconstruction via MAE for Infrared Small Target
  Detection
Dynamic Background Reconstruction via MAE for Infrared Small Target Detection
Jingchao Peng
Haitao Zhao
Kaijie Zhao
Zhongze Wang
Lujian Yao
9
2
0
11 Jan 2023
FGAHOI: Fine-Grained Anchors for Human-Object Interaction Detection
FGAHOI: Fine-Grained Anchors for Human-Object Interaction Detection
Shuailei Ma
Yuefeng Wang
Shanze Wang
Ying-yu Wei
37
33
0
08 Jan 2023
Convolution-enhanced Evolving Attention Networks
Convolution-enhanced Evolving Attention Networks
Yujing Wang
Yaming Yang
Zhuowan Li
Jiangang Bai
Mingliang Zhang
Xiangtai Li
J. Yu
Ce Zhang
Gao Huang
Yu Tong
ViT
24
6
0
16 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
Rethinking Vision Transformers for MobileNet Size and Speed
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
30
159
0
15 Dec 2022
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group
  Propagation
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
Chenhongyi Yang
Jiarui Xu
Shalini De Mello
Elliot J. Crowley
X. Wang
ViT
32
21
0
13 Dec 2022
ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient
  Self-Supervised Monocular Depth Estimation
ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation
Daitao Xing
Jinglin Shen
C. Ho
Anthony Tzes
ViT
MDE
24
4
0
12 Dec 2022
Cross-Modal Learning with 3D Deformable Attention for Action Recognition
Cross-Modal Learning with 3D Deformable Attention for Action Recognition
Sangwon Kim
Dasom Ahn
ByoungChul Ko
ViT
3DPC
27
24
0
12 Dec 2022
Peeling the Onion: Hierarchical Reduction of Data Redundancy for
  Efficient Vision Transformer Training
Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
Zhenglun Kong
Haoyu Ma
Geng Yuan
Mengshu Sun
Yanyue Xie
...
Tianlong Chen
Xiaolong Ma
Xiaohui Xie
Zhangyang Wang
Yanzhi Wang
ViT
28
22
0
19 Nov 2022
Contrastive Language-Image Pre-Training with Knowledge Graphs
Contrastive Language-Image Pre-Training with Knowledge Graphs
Xuran Pan
Tianzhu Ye
Dongchen Han
S. Song
Gao Huang
VLM
CLIP
22
43
0
17 Oct 2022
Dynamic Graph Message Passing Networks for Visual Recognition
Dynamic Graph Message Passing Networks for Visual Recognition
Li Zhang
Mohan Chen
Anurag Arnab
Xiangyang Xue
Philip H. S. Torr
GNN
29
1
0
20 Sep 2022
Doubly Deformable Aggregation of Covariance Matrices for Few-shot
  Segmentation
Doubly Deformable Aggregation of Covariance Matrices for Few-shot Segmentation
Zhitong Xiong
Haopeng Li
Xiao Xiang Zhu
35
35
0
30 Jul 2022
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision
  Transformers for Panoramic Semantic Segmentation
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation
Jiaming Zhang
Kailun Yang
Haowen Shi
Simon Reiß
Kunyu Peng
Chaoxiang Ma
Haodong Fu
Philip H. S. Torr
Kaiwei Wang
Rainer Stiefelhagen
ViT
MDE
31
35
0
25 Jul 2022
HiFormer: Hierarchical Multi-scale Representations Using Transformers
  for Medical Image Segmentation
HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation
Moein Heidari
A. Kazerouni
Milad Soltany Kadarvish
Reza Azad
Ehsan Khodapanah Aghdam
Julien Cohen-Adad
Dorit Merhof
MedIm
ViT
25
178
0
18 Jul 2022
12
Next