ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04159
  4. Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
    ViT
ArXiv (abs)PDFHTMLGithub (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,533 papers shown
Title
ISTR: End-to-End Instance Segmentation with Transformers
ISTR: End-to-End Instance Segmentation with Transformers
Jie Hu
Liujuan Cao
Yao Lu
Shengchuan Zhang
Yan Wang
Ke Li
Feiyue Huang
Ling Shao
Rongrong Ji
ISeg
84
96
0
03 May 2021
CAT: Cross-Attention Transformer for One-Shot Object Detection
CAT: Cross-Attention Transformer for One-Shot Object Detection
Weidong Lin
Yuyang Deng
Yang Gao
Ning Wang
Jinghao Zhou
Lingqiao Liu
Lei Zhang
Peng Wang
ViT
124
9
0
30 Apr 2021
Keypoint Transformer: Solving Joint Identification in Challenging Hands
  and Object Interactions for Accurate 3D Pose Estimation
Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation
Shreyas Hampali
S. Sarkar
Mahdi Rad
Vincent Lepetit
3DH
110
137
0
29 Apr 2021
Twins: Revisiting the Design of Spatial Attention in Vision Transformers
Twins: Revisiting the Design of Spatial Attention in Vision Transformers
Xiangxiang Chu
Zhi Tian
Yuqing Wang
Bo Zhang
Haibing Ren
Xiaolin K. Wei
Huaxia Xia
Chunhua Shen
ViT
100
1,031
0
28 Apr 2021
Segmentation-Based Bounding Box Generation for Omnidirectional
  Pedestrian Detection
Segmentation-Based Bounding Box Generation for Omnidirectional Pedestrian Detection
Masato Tamura
Tomoaki Yoshinaga
105
2
0
28 Apr 2021
ConTNet: Why not use convolution and transformer at the same time?
ConTNet: Why not use convolution and transformer at the same time?
Haotian Yan
Zhe Li
Weijian Li
Changhu Wang
Ming Wu
Chuang Zhang
ViT
88
77
0
27 Apr 2021
Vision Transformers with Patch Diversification
Vision Transformers with Patch Diversification
Chengyue Gong
Dilin Wang
Meng Li
Vikas Chandra
Qiang Liu
ViT
90
64
0
26 Apr 2021
Diverse Image Inpainting with Bidirectional and Autoregressive
  Transformers
Diverse Image Inpainting with Bidirectional and Autoregressive Transformers
Yingchen Yu
Fangneng Zhan
Rongliang Wu
Jianxiong Pan
Kaiwen Cui
Shijian Lu
Feiying Ma
Xuansong Xie
Chunyan Miao
ViT
97
152
0
26 Apr 2021
Visual Saliency Transformer
Visual Saliency Transformer
Nian Liu
Ni Zhang
Kaiyuan Wan
Ling Shao
Junwei Han
ViT
322
363
0
25 Apr 2021
M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object
  Detection with Transformers
M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
Tianrui Guan
Jun Wang
Shiyi Lan
Rohan Chandra
Zuxuan Wu
Larry S. Davis
Tianyi Zhou
ViT3DPC
83
123
0
24 Apr 2021
Region-Adaptive Deformable Network for Image Quality Assessment
Region-Adaptive Deformable Network for Image Quality Assessment
Shu Shi
Qingyan Bai
Ming Cao
Weihao Xia
Jiahao Wang
Yifan Chen
Yujiu Yang
62
20
0
23 Apr 2021
All Tokens Matter: Token Labeling for Training Better Vision
  Transformers
All Tokens Matter: Token Labeling for Training Better Vision Transformers
Zihang Jiang
Qibin Hou
Li-xin Yuan
Daquan Zhou
Yujun Shi
Xiaojie Jin
Anran Wang
Jiashi Feng
ViT
134
210
0
22 Apr 2021
Generative Transformer for Accurate and Reliable Salient Object
  Detection
Generative Transformer for Accurate and Reliable Salient Object Detection
Yuxin Mao
Jing Zhang
Zhexiong Wan
Yuchao Dai
Aixuan Li
Yun-Qiu Lv
Xinyu Tian
Deng-Ping Fan
Nick Barnes
ViT
145
34
0
20 Apr 2021
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection
Junke Wang
Zuxuan Wu
Wenhao Ouyang
Xintong Han
Jingjing Chen
Ser-Nam Lim
Yu-Gang Jiang
ViT
179
275
0
20 Apr 2021
TransVG: End-to-End Visual Grounding with Transformers
TransVG: End-to-End Visual Grounding with Transformers
Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wen-gang Zhou
Houqiang Li
ViT
94
348
0
17 Apr 2021
Vision Transformer Pruning
Vision Transformer Pruning
Mingjian Zhu
Yehui Tang
Kai Han
ViT
95
92
0
17 Apr 2021
Vision Transformer using Low-level Chest X-ray Feature Corpus for
  COVID-19 Diagnosis and Severity Quantification
Vision Transformer using Low-level Chest X-ray Feature Corpus for COVID-19 Diagnosis and Severity Quantification
Sangjoon Park
Gwanghyun Kim
Y. Oh
J. Seo
Sang Min Lee
Jin Hwan Kim
Sungjun Moon
Jae-Kwang Lim
Jong Chul Ye
ViTMedIm
110
97
0
15 Apr 2021
Co-Scale Conv-Attentional Image Transformers
Co-Scale Conv-Attentional Image Transformers
Weijian Xu
Yifan Xu
Tyler A. Chang
Zhuowen Tu
ViT
59
377
0
13 Apr 2021
HIH: Towards More Accurate Face Alignment via Heatmap in Heatmap
HIH: Towards More Accurate Face Alignment via Heatmap in Heatmap
Xing Lan
Qinghao Hu
Qiang Chen
Jian Xue
Jian Cheng
CVBM
73
20
0
07 Apr 2021
Efficient DETR: Improving End-to-End Object Detector with Dense Prior
Efficient DETR: Improving End-to-End Object Detector with Dense Prior
Z. Yao
Jiangbo Ai
Boxun Li
Chi Zhang
ViT
117
225
0
03 Apr 2021
TubeR: Tubelet Transformer for Video Action Detection
TubeR: Tubelet Transformer for Video Action Detection
Jiaojiao Zhao
Yanyi Zhang
Xinyu Li
Hao Chen
Shuai Bing
...
Yuanjun Xiong
Davide Modolo
I. Marsic
Cees G. M. Snoek
Joseph Tighe
ViT
84
74
0
02 Apr 2021
Bridging Global Context Interactions for High-Fidelity Image Completion
Bridging Global Context Interactions for High-Fidelity Image Completion
Chuanxia Zheng
Tat-Jen Cham
Jianfei Cai
Dinh Q. Phung
ViT
85
78
0
02 Apr 2021
Next Generation Multitarget Trackers: Random Finite Set Methods vs
  Transformer-based Deep Learning
Next Generation Multitarget Trackers: Random Finite Set Methods vs Transformer-based Deep Learning
Juliano Pinto
Georg Hess
William Ljungbergh
Yuxuan Xia
Lennart Svensson
H. Wymeersch
148
17
0
01 Apr 2021
DA-DETR: Domain Adaptive Detection Transformer with Information Fusion
DA-DETR: Domain Adaptive Detection Transformer with Information Fusion
Jingyi Zhang
Jiaxing Huang
Zhipeng Luo
Gongjie Zhang
Xiaoqin Zhang
Shijian Lu
ViT
88
36
0
31 Mar 2021
Rethinking Spatial Dimensions of Vision Transformers
Rethinking Spatial Dimensions of Vision Transformers
Byeongho Heo
Sangdoo Yun
Dongyoon Han
Sanghyuk Chun
Junsuk Choe
Seong Joon Oh
ViT
540
585
0
30 Mar 2021
CvT: Introducing Convolutions to Vision Transformers
CvT: Introducing Convolutions to Vision Transformers
Haiping Wu
Bin Xiao
Noel Codella
Mengchen Liu
Xiyang Dai
Lu Yuan
Lei Zhang
ViT
162
1,923
0
29 Mar 2021
Generic Attention-model Explainability for Interpreting Bi-Modal and
  Encoder-Decoder Transformers
Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers
Hila Chefer
Shir Gur
Lior Wolf
ViT
85
326
0
29 Mar 2021
On the Adversarial Robustness of Vision Transformers
On the Adversarial Robustness of Vision Transformers
Rulin Shao
Zhouxing Shi
Jinfeng Yi
Pin-Yu Chen
Cho-Jui Hsieh
ViT
115
145
0
29 Mar 2021
Multi-Scale Vision Longformer: A New Vision Transformer for
  High-Resolution Image Encoding
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Pengchuan Zhang
Xiyang Dai
Jianwei Yang
Bin Xiao
Lu Yuan
Lei Zhang
Jianfeng Gao
ViT
113
337
0
29 Mar 2021
TFPose: Direct Human Pose Estimation with Transformers
TFPose: Direct Human Pose Estimation with Transformers
Wei Mao
Yongtao Ge
Chunhua Shen
Zhi Tian
Xinlong Wang
Zhibin Wang
ViT
98
89
0
29 Mar 2021
TransCenter: Transformers with Dense Representations for Multiple-Object
  Tracking
TransCenter: Transformers with Dense Representations for Multiple-Object Tracking
Yihong Xu
Yutong Ban
Guillaume Delorme
Chuang Gan
Daniela Rus
Xavier Alameda-Pineda
VOT
94
96
0
28 Mar 2021
Looking Beyond Two Frames: End-to-End Multi-Object Tracking Using
  Spatial and Temporal Transformers
Looking Beyond Two Frames: End-to-End Multi-Object Tracking Using Spatial and Temporal Transformers
Tianyu Zhu
Markus Hiller
Mahsa Ehsanpour
Rongkai Ma
Tom Drummond
Ian Reid
Hamid Rezatofighi
VOT
74
36
0
27 Mar 2021
Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose
  Estimation
Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation
Wenhao Li
Hong Liu
Runwei Ding
Mengyuan Liu
Pichao Wang
Wenming Yang
ViT
176
198
0
26 Mar 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
492
21,752
0
25 Mar 2021
USB: Universal-Scale Object Detection Benchmark
USB: Universal-Scale Object Detection Benchmark
Yosuke Shinya
ObjD
102
13
0
25 Mar 2021
BossNAS: Exploring Hybrid CNN-transformers with Block-wisely
  Self-supervised Neural Architecture Search
BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search
Changlin Li
Tao Tang
Guangrun Wang
Jiefeng Peng
Bing Wang
Xiaodan Liang
Xiaojun Chang
ViT
147
107
0
23 Mar 2021
Conditional Training with Bounding Map for Universal Lesion Detection
Conditional Training with Bounding Map for Universal Lesion Detection
Han Li
Long Chen
Hu Han
S. Kevin Zhou
MedImAI4CE
68
9
0
23 Mar 2021
End-to-End Trainable Multi-Instance Pose Estimation with Transformers
End-to-End Trainable Multi-Instance Pose Estimation with Transformers
Lucas Stoffl
Maxime Vidal
Alexander Mathis
ViT
72
52
0
22 Mar 2021
Incorporating Convolution Designs into Visual Transformers
Incorporating Convolution Designs into Visual Transformers
Kun Yuan
Shaopeng Guo
Ziwei Liu
Aojun Zhou
F. Yu
Wei Wu
ViT
115
483
0
22 Mar 2021
Meta-DETR: Image-Level Few-Shot Object Detection with Inter-Class
  Correlation Exploitation
Meta-DETR: Image-Level Few-Shot Object Detection with Inter-Class Correlation Exploitation
Gongjie Zhang
Zhipeng Luo
Kaiwen Cui
Shijian Lu
ViT
82
78
0
22 Mar 2021
Scalable Vision Transformers with Hierarchical Pooling
Scalable Vision Transformers with Hierarchical Pooling
Zizheng Pan
Bohan Zhuang
Jing Liu
Haoyu He
Jianfei Cai
ViT
93
130
0
19 Mar 2021
UNETR: Transformers for 3D Medical Image Segmentation
UNETR: Transformers for 3D Medical Image Segmentation
Ali Hatamizadeh
Yucheng Tang
Vishwesh Nath
Dong Yang
Andriy Myronenko
Bennett Landman
H. Roth
Daguang Xu
ViTMedIm
196
1,631
0
18 Mar 2021
3D Human Pose Estimation with Spatial and Temporal Transformers
3D Human Pose Estimation with Spatial and Temporal Transformers
Ce Zheng
Sijie Zhu
Matías Mendieta
Taojiannan Yang
Chong Chen
Zhengming Ding
ViT
170
456
0
18 Mar 2021
Consistency-based Active Learning for Object Detection
Consistency-based Active Learning for Object Detection
Weiping Yu
Sijie Zhu
Taojiannan Yang
Chong Chen
ObjD
86
51
0
18 Mar 2021
Suppress-and-Refine Framework for End-to-End 3D Object Detection
Suppress-and-Refine Framework for End-to-End 3D Object Detection
Zili Liu
Guodong Xu
Honghui Yang
Minghao Chen
Kuoliang Wu
Zheng Yang
Haifeng Liu
Deng Cai
3DPC
51
4
0
18 Mar 2021
TransFG: A Transformer Architecture for Fine-grained Recognition
TransFG: A Transformer Architecture for Fine-grained Recognition
Ju He
Jieneng Chen
Shuai Liu
Adam Kortylewski
Cheng Yang
Yutong Bai
Changhu Wang
ViT
116
396
0
14 Mar 2021
Probabilistic two-stage detection
Probabilistic two-stage detection
Xingyi Zhou
V. Koltun
Philipp Krahenbuhl
ObjD
103
226
0
12 Mar 2021
Spatially Consistent Representation Learning
Spatially Consistent Representation Learning
Byungseok Roh
Wuhyun Shin
Ildoo Kim
Sungwoong Kim
SSL
67
90
0
10 Mar 2021
TransMed: Transformers Advance Multi-modal Medical Image Classification
TransMed: Transformers Advance Multi-modal Medical Image Classification
Yin Dai
Yifan Gao
ViTMedIm
103
294
0
10 Mar 2021
SSTN: Self-Supervised Domain Adaptation Thermal Object Detection for
  Autonomous Driving
SSTN: Self-Supervised Domain Adaptation Thermal Object Detection for Autonomous Driving
Farzeen Munir
Shoaib Azam
M. Jeon
ViT
75
37
0
04 Mar 2021
Previous
123...495051
Next