ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04159
  4. Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
    ViT
ArXiv (abs)PDFHTMLGithub (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,533 papers shown
Title
FT-TDR: Frequency-guided Transformer and Top-Down Refinement Network for
  Blind Face Inpainting
FT-TDR: Frequency-guided Transformer and Top-Down Refinement Network for Blind Face Inpainting
Junke Wang
Shaoxiang Chen
Zuxuan Wu
Yu-Gang Jiang
CVBM
65
25
0
10 Aug 2021
Understanding the computational demands underlying visual reasoning
Understanding the computational demands underlying visual reasoning
Mohit Vaishnav
Rémi Cadène
A. Alamia
Drew Linsley
Rufin VanRullen
Thomas Serre
GNNCoGe
77
17
0
08 Aug 2021
PSViT: Better Vision Transformer via Token Pooling and Attention Sharing
PSViT: Better Vision Transformer via Token Pooling and Attention Sharing
Boyu Chen
Peixia Li
Baopu Li
Chuming Li
Lei Bai
Chen Lin
Ming Sun
Junjie Yan
Wanli Ouyang
ViT
124
35
0
07 Aug 2021
Automatic Rail Component Detection Based on AttnConv-Net
Automatic Rail Component Detection Based on AttnConv-Net
Tian-hu Wang
Zijun Zhang
Fangfang Yang
K. Tsui
28
12
0
05 Aug 2021
Fast Convergence of DETR with Spatially Modulated Co-Attention
Fast Convergence of DETR with Spatially Modulated Co-Attention
Peng Gao
Minghang Zheng
Xiaogang Wang
Jifeng Dai
Hongsheng Li
ViT
87
307
0
05 Aug 2021
Armour: Generalizable Compact Self-Attention for Vision Transformers
Armour: Generalizable Compact Self-Attention for Vision Transformers
Lingchuan Meng
ViT
27
3
0
03 Aug 2021
Vision Transformer with Progressive Sampling
Vision Transformer with Progressive Sampling
Xiaoyu Yue
Shuyang Sun
Zhanghui Kuang
Meng Wei
Philip Torr
Wayne Zhang
Dahua Lin
ViT
89
85
0
03 Aug 2021
Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
Yifan Xu
Zhijie Zhang
Mengdan Zhang
Kekai Sheng
Ke Li
Weiming Dong
Liqing Zhang
Changsheng Xu
Xing Sun
ViT
94
215
0
03 Aug 2021
AGAR a microbial colony dataset for deep learning detection
AGAR a microbial colony dataset for deep learning detection
Sylwia Majchrowska
J. Pawlowski
Grzegorz Gula
T. Bonus
Agata Hanas
Adam Loch
A. Pawlak
J. Roszkowiak
Tomasz Golan
Z. Drulis-Kawa
41
24
0
03 Aug 2021
GraphFPN: Graph Feature Pyramid Network for Object Detection
GraphFPN: Graph Feature Pyramid Network for Object Detection
Gangming Zhao
Weifeng Ge
Yizhou Yu
ObjD
163
84
0
02 Aug 2021
Multi-scale Matching Networks for Semantic Correspondence
Multi-scale Matching Networks for Semantic Correspondence
Dongyang Zhao
Ziyang Song
Zhenghao Ji
Gangming Zhao
Weifeng Ge
Yizhou Yu
99
49
0
31 Jul 2021
Word2Pix: Word to Pixel Cross Attention Transformer in Visual Grounding
Word2Pix: Word to Pixel Cross Attention Transformer in Visual Grounding
Heng Zhao
Qiufeng Wang
Yew-Soon Ong
ObjD
74
26
0
31 Jul 2021
HiFT: Hierarchical Feature Transformer for Aerial Tracking
HiFT: Hierarchical Feature Transformer for Aerial Tracking
Ziang Cao
Changhong Fu
Junjie Ye
Bowen Li
Yiming Li
85
198
0
31 Jul 2021
DPT: Deformable Patch-based Transformer for Visual Recognition
DPT: Deformable Patch-based Transformer for Visual Recognition
Zhiyang Chen
Yousong Zhu
Chaoyang Zhao
Guosheng Hu
Wei Zeng
Jinqiao Wang
Ming Tang
ViT
65
101
0
30 Jul 2021
A Unified Efficient Pyramid Transformer for Semantic Segmentation
A Unified Efficient Pyramid Transformer for Semantic Segmentation
Fangrui Zhu
Yi Zhu
Li Zhang
Chongruo Wu
Yanwei Fu
Mu Li
ViT
105
30
0
29 Jul 2021
PPT Fusion: Pyramid Patch Transformerfor a Case Study in Image Fusion
PPT Fusion: Pyramid Patch Transformerfor a Case Study in Image Fusion
Yu Fu
Tianyang Xu
Xiaojun Wu
J. Kittler
ViT
63
40
0
29 Jul 2021
Bridging Gap between Image Pixels and Semantics via Supervision: A
  Survey
Bridging Gap between Image Pixels and Semantics via Supervision: A Survey
Jiali Duan
C.-C. Jay Kuo
87
8
0
29 Jul 2021
PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
Bin Tan
Nan Xue
S. Bai
Tianfu Wu
Guisong Xia
ViT
117
40
0
27 Jul 2021
Exploring Sequence Feature Alignment for Domain Adaptive Detection
  Transformers
Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers
Wen Wang
Yang Cao
Jing Zhang
Fengxiang He
Zhengjun Zha
Yonggang Wen
Dacheng Tao
ViT
113
96
0
27 Jul 2021
Query2Label: A Simple Transformer Way to Multi-Label Classification
Query2Label: A Simple Transformer Way to Multi-Label Classification
Shilong Liu
Lei Zhang
Xiao Yang
Hang Su
Jun Zhu
68
193
0
22 Jul 2021
CycleMLP: A MLP-like Architecture for Dense Prediction
CycleMLP: A MLP-like Architecture for Dense Prediction
Shoufa Chen
Enze Xie
Chongjian Ge
Runjian Chen
Ding Liang
Ping Luo
135
235
0
21 Jul 2021
Image Fusion Transformer
Image Fusion Transformer
VS Vibashan
Jeya Maria Jose Valanarasu
Poojan Oza
Vishal M. Patel
ViT
79
123
0
19 Jul 2021
Video Crowd Localization with Multi-focus Gaussian Neighborhood
  Attention and a Large-Scale Benchmark
Video Crowd Localization with Multi-focus Gaussian Neighborhood Attention and a Large-Scale Benchmark
Haopeng Li
Lingbo Liu
Kunlin Yang
Shinan Liu
Junyuan Gao
Bin Zhao
Rui Zhang
Jun Hou
138
16
0
19 Jul 2021
RAMS-Trans: Recurrent Attention Multi-scale Transformer forFine-grained
  Image Recognition
RAMS-Trans: Recurrent Attention Multi-scale Transformer forFine-grained Image Recognition
Yunqing Hu
Xuan Jin
Yin Zhang
Ha Hong
Jingfeng Zhang
Yuan He
Hui Xue
ViT
82
104
0
17 Jul 2021
Transformer with Peak Suppression and Knowledge Guidance for
  Fine-grained Image Recognition
Transformer with Peak Suppression and Knowledge Guidance for Fine-grained Image Recognition
Xinda Liu
Lili Wang
Xiaoguang Han
ViT
94
70
0
14 Jul 2021
FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
Sheng-Chun Kao
Suvinay Subramanian
Gaurav Agrawal
Amir Yazdanbakhsh
T. Krishna
128
64
0
13 Jul 2021
GiT: Graph Interactive Transformer for Vehicle Re-identification
GiT: Graph Interactive Transformer for Vehicle Re-identification
Fei Shen
Yi Xie
Jianqing Zhu
Xiaobin Zhu
Huanqiang Zeng
ViT
115
113
0
12 Jul 2021
Semi-Supervised Object Detection with Adaptive Class-Rebalancing
  Self-Training
Semi-Supervised Object Detection with Adaptive Class-Rebalancing Self-Training
Fangyuan Zhang
Tianxiang Pan
Bin Wang
88
55
0
11 Jul 2021
Local-to-Global Self-Attention in Vision Transformers
Local-to-Global Self-Attention in Vision Transformers
Jinpeng Li
Yichao Yan
Tianran Ouyang
Xiaokang Yang
Ling Shao
ViT
64
29
0
10 Jul 2021
Trans4Trans: Efficient Transformer for Transparent Object Segmentation
  to Help Visually Impaired People Navigate in the Real World
Trans4Trans: Efficient Transformer for Transparent Object Segmentation to Help Visually Impaired People Navigate in the Real World
Jiaming Zhang
Kailun Yang
Angela Constantinescu
Kunyu Peng
Karin Muller
Rainer Stiefelhagen
ViT
92
62
0
07 Jul 2021
GLiT: Neural Architecture Search for Global and Local Image Transformer
GLiT: Neural Architecture Search for Global and Local Image Transformer
Boyu Chen
Peixia Li
Chuming Li
Baopu Li
Lei Bai
Chen Lin
Ming Sun
Junjie Yan
Wanli Ouyang
ViT
115
86
0
07 Jul 2021
CSWin Transformer: A General Vision Transformer Backbone with
  Cross-Shaped Windows
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows
Xiaoyi Dong
Jianmin Bao
Dongdong Chen
Weiming Zhang
Nenghai Yu
Lu Yuan
Dong Chen
B. Guo
ViT
181
993
0
01 Jul 2021
Focal Self-attention for Local-Global Interactions in Vision
  Transformers
Focal Self-attention for Local-Global Interactions in Vision Transformers
Jianwei Yang
Chunyuan Li
Pengchuan Zhang
Xiyang Dai
Bin Xiao
Lu Yuan
Jianfeng Gao
ViT
84
436
0
01 Jul 2021
CBNet: A Composite Backbone Network Architecture for Object Detection
CBNet: A Composite Backbone Network Architecture for Object Detection
Tingting Liang
Xiao Chu
Yudong Liu
Yongtao Wang
Zhi Tang
Wei Chu
Jingdong Chen
Haibin Ling
ObjD
105
166
0
01 Jul 2021
K-Net: Towards Unified Image Segmentation
K-Net: Towards Unified Image Segmentation
Wenwei Zhang
Jiangmiao Pang
Kai-xiang Chen
Chen Change Loy
ISeg
124
372
0
28 Jun 2021
Multi-Compound Transformer for Accurate Biomedical Image Segmentation
Multi-Compound Transformer for Accurate Biomedical Image Segmentation
Yuanfeng Ji
Ruimao Zhang
Huijie Wang
Zhen Li
Lingyun Wu
Shaoting Zhang
Ping Luo
ViTMedIm
79
138
0
28 Jun 2021
Post-Training Quantization for Vision Transformer
Post-Training Quantization for Vision Transformer
Zhenhua Liu
Yunhe Wang
Kai Han
Siwei Ma
Wen Gao
ViTMQ
112
346
0
27 Jun 2021
OffRoadTranSeg: Semi-Supervised Segmentation using Transformers on
  OffRoad environments
OffRoadTranSeg: Semi-Supervised Segmentation using Transformers on OffRoad environments
Anukriti Singh
Kartikeya Singh
P. B. Sujit
ViT
62
8
0
26 Jun 2021
ViTAS: Vision Transformer Architecture Search
ViTAS: Vision Transformer Architecture Search
Xiu Su
Shan You
Jiyang Xie
Mingkai Zheng
Fei Wang
Chao Qian
Changshui Zhang
Xiaogang Wang
Chang Xu
ViT
92
55
0
25 Jun 2021
Probing Inter-modality: Visual Parsing with Self-Attention for
  Vision-Language Pre-training
Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Hongwei Xue
Yupan Huang
Bei Liu
Houwen Peng
Jianlong Fu
Houqiang Li
Jiebo Luo
90
89
0
25 Jun 2021
VOLO: Vision Outlooker for Visual Recognition
VOLO: Vision Outlooker for Visual Recognition
Li-xin Yuan
Qibin Hou
Zihang Jiang
Jiashi Feng
Shuicheng Yan
ViT
128
328
0
24 Jun 2021
IA-RED$^2$: Interpretability-Aware Redundancy Reduction for Vision
  Transformers
IA-RED2^22: Interpretability-Aware Redundancy Reduction for Vision Transformers
Bowen Pan
Yikang Shen
Yi Ding
Zhangyang Wang
Rogerio Feris
A. Oliva
VLMViT
121
165
0
23 Jun 2021
Transformer Meets Convolution: A Bilateral Awareness Network for
  Semantic Segmentation of Very Fine Resolution Urban Scene Images
Transformer Meets Convolution: A Bilateral Awareness Network for Semantic Segmentation of Very Fine Resolution Urban Scene Images
Libo Wang
Rui Li
Dongzhi Wang
Chenxi Duan
Teng Wang
Xiaoliang Meng
ViT
120
152
0
23 Jun 2021
Probabilistic Attention for Interactive Segmentation
Probabilistic Attention for Interactive Segmentation
Prasad Gabbur
Manjot Bilkhu
J. Movellan
103
13
0
23 Jun 2021
P2T: Pyramid Pooling Transformer for Scene Understanding
P2T: Pyramid Pooling Transformer for Scene Understanding
Yu-Huan Wu
Yun-Hai Liu
Xin Zhan
Mingg-Ming Cheng
ViT
153
233
0
22 Jun 2021
Towards Biologically Plausible Convolutional Networks
Towards Biologically Plausible Convolutional Networks
Roman Pogodin
Yash Mehta
Timothy Lillicrap
P. Latham
109
22
0
22 Jun 2021
OadTR: Online Action Detection with Transformers
OadTR: Online Action Detection with Transformers
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Yuanjie Shao
Zhe Zuo
Changxin Gao
Nong Sang
OffRLViT
100
117
0
21 Jun 2021
Structured Sparse R-CNN for Direct Scene Graph Generation
Structured Sparse R-CNN for Direct Scene Graph Generation
Yao Teng
Limin Wang
3DPCGNN
108
56
0
21 Jun 2021
End-to-end Temporal Action Detection with Transformer
End-to-end Temporal Action Detection with Transformer
Xiaolong Liu
Qimeng Wang
Yao Hu
Xu Tang
Shiwei Zhang
S. Bai
X. Bai
ViT
112
233
0
18 Jun 2021
Efficient Self-supervised Vision Transformers for Representation
  Learning
Efficient Self-supervised Vision Transformers for Representation Learning
Chunyuan Li
Jianwei Yang
Pengchuan Zhang
Mei Gao
Bin Xiao
Xiyang Dai
Lu Yuan
Jianfeng Gao
ViT
108
214
0
17 Jun 2021
Previous
123...4748495051
Next