Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,533 papers shown
Title
UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework
Silin Cheng
Yuanpei Liu
Kai Han
EDL
141
0
0
12 Dec 2024
TimeRefine: Temporal Grounding with Time Refining Video LLM
Xizi Wang
Feng Cheng
Ziyang Wang
Huiyu Wang
Md. Mohaiminul Islam
Lorenzo Torresani
Joey Tianyi Zhou
Gedas Bertasius
David J. Crandall
188
2
0
12 Dec 2024
GAQAT: gradient-adaptive quantization-aware training for domain generalization
Jiacheng Jiang
Yuan Meng
Chen Tang
Han Yu
Qun Li
Zhi Wang
Wenwu Zhu
MQ
75
0
0
07 Dec 2024
Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection
K. Hashmi
Talha Uddin Sheikh
Didier Stricker
Muhammad Zeshan Afzal
113
0
0
06 Dec 2024
Towards Real-Time Open-Vocabulary Video Instance Segmentation
Bin Yan
Martin Sundermeyer
D. Tan
Huchuan Lu
F. Tombari
VLM
VOS
152
2
0
05 Dec 2024
Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty Measure
Saheli Hazra
Sudip Das
Rohit Choudhary
Arindam Das
Ganesh Sistu
Ciarán Eising
Ujjwal Bhattacharya
115
0
0
05 Dec 2024
Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable
Lizhen Xu
Shanmin Pang
Wenzhao Qiu
Zehao Wu
Xiuxiu Bai
K. Mei
Jianru Xue
161
2
0
03 Dec 2024
XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Xianrui Li
Kai Qiu
Hong Chen
Jason Kuen
Jiuxiang Gu
Jiadong Wang
Zhe Lin
Bhiksha Raj
VLM
213
9
0
02 Dec 2024
HandOS: 3D Hand Reconstruction in One Stage
Xingyu Chen
Zhuheng Song
Xiaoke Jiang
Yaoqing Hu
Junzhi Yu
Lei Zhang
3DH
HAI
190
0
0
02 Dec 2024
SyncVIS: Synchronized Video Instance Segmentation
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
137
0
0
01 Dec 2024
Explaining Object Detectors via Collective Contribution of Pixels
Toshinori Yamauchi
Hiroshi Kera
K. Kawamoto
ObjD
FAtt
113
1
0
01 Dec 2024
BGM: Background Mixup for X-ray Prohibited Items Detection
Wen Liu
R. Tao
Hongguang Zhu
Yunda Sun
Yao Zhao
Y. X. Wei
158
0
0
30 Nov 2024
LQ-Adapter: ViT-Adapter with Learnable Queries for Gallbladder Cancer Detection from Ultrasound Image
Chetan Madan
Mayuna Gupta
Soumen Basu
Pankaj Gupta
Chetan Arora
172
0
0
30 Nov 2024
Does Self-Attention Need Separate Weights in Transformers?
Md. Kowsher
Nusrat Jahan Prottasha
Chun-Nam Yu
O. Garibay
Niloofar Yousefi
538
1
0
30 Nov 2024
On Moving Object Segmentation from Monocular Video with Transformers
Christian Homeyer
Christoph Schnörr
144
3
0
28 Nov 2024
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video
Jinyuan Qu
Hongyang Li
Shilong Liu
Tianhe Ren
Zhaoyang Zeng
Lei Zhang
3DPC
130
1
0
27 Nov 2024
Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models
Peng Cui
Guande He
Dan Zhang
Zhijie Deng
Yinpeng Dong
Jun Zhu
161
1
0
26 Nov 2024
Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation
Minh-Tuan Tran
Trung Le
Xuan-May Le
Jianfei Cai
Mehrtash Harandi
Dinh Q. Phung
139
2
0
26 Nov 2024
Edge Weight Prediction For Category-Agnostic Pose Estimation
Or Hirschorn
S. Avidan
145
0
0
25 Nov 2024
GeoFormer: A Multi-Polygon Segmentation Transformer
Maxim Khomiakov
Michael Riis Andersen
J. Frellsen
105
1
0
25 Nov 2024
Monocular Lane Detection Based on Deep Learning: A Survey
Xin He
Haiyun Guo
Kuan Zhu
Bingke Zhu
Xu Zhao
Jianwu Fang
Jinqiao Wang
192
1
0
25 Nov 2024
TreeFormer: Single-view Plant Skeleton Estimation via Tree-constrained Graph Generation
Xinpeng Liu
Hiroaki Santo
Yosuke Toda
Fumio Okura
109
0
0
25 Nov 2024
Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training
Man Yao
Xuerui Qiu
Tianxiang Hu
J. Hu
Yuhong Chou
Keyu Tian
Jianxing Liao
Luziwei Leng
Bo Xu
Guoqi Li
144
16
0
25 Nov 2024
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Jongseong Bae
Junwoo Ha
Ha Young Kim
165
1
0
25 Nov 2024
MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving
Hongsi Liu
Jun Liu
Guangfeng Jiang
Xin Jin
372
3
0
22 Nov 2024
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Bencheng Liao
Shaoyu Chen
Haoran Yin
Bo Jiang
Cheng Wang
...
Xinbang Zhang
Xiangyu Li
Y. Zhang
Qian Zhang
Xinggang Wang
219
45
0
22 Nov 2024
DT-LSD: Deformable Transformer-based Line Segment Detection
Sebastian Janampa
Marios Pattichis
ViT
153
1
0
20 Nov 2024
RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model
Hongjun Chen
Wencheng Han
Huan Zheng
Jianbing Shen
Mamba
128
0
0
18 Nov 2024
Unveiling the Hidden: Online Vectorized HD Map Construction with Clip-Level Token Interaction and Propagation
Nayeon Kim
Hongje Seong
Daehyun Ji
Sujin Jang
73
2
0
17 Nov 2024
CCi-YOLOv8n: Enhanced Fire Detection with CARAFE and Context-Guided Modules
Kunwei Lv
Ruobing Wu
Suyang Chen
Ping Lan
175
4
0
17 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
137
0
0
16 Nov 2024
RETR: Multi-View Radar Detection Transformer for Indoor Perception
Ryoma Yataka
Adriano Cardace
Peng Wang
P. Boufounos
R. Takahashi
154
2
0
15 Nov 2024
Prompt-Guided Environmentally Consistent Adversarial Patch
Chaoqun Li
Huanqian Yan
Lifeng Zhou
Tairan Chen
Zhuodong Liu
Hang Su
DiffM
AAML
60
0
0
15 Nov 2024
Toward Robust and Accurate Adversarial Camouflage Generation against Vehicle Detectors
Jiawei Zhou
Linye Lyu
Daojing He
Yu Li
AAML
41
0
0
15 Nov 2024
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation
Weijie Ma
Jingwei Jiang
Yue Yang
Zhaoyu Chen
Hao Chen
58
0
0
09 Nov 2024
Moving Off-the-Grid: Scene-Grounded Video Representations
Sjoerd van Steenkiste
Daniel Zoran
Yi Yang
Yulia Rubanova
Rishabh Kabra
...
Thomas Keck
João Carreira
Alexey Dosovitskiy
Mehdi S. M. Sajjadi
Thomas Kipf
75
4
0
08 Nov 2024
OccLoff: Learning Optimized Feature Fusion for 3D Occupancy Prediction
Ji Zhang
Yiran Ding
Zixin Liu
3DPC
142
4
0
06 Nov 2024
Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection
Yifan Wang
Xiaohu Yang
Fanqi Pu
Q. Liao
Wenming Yang
74
0
0
05 Nov 2024
VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization
Yiwei Zhang
Jin Gao
Fudong Ge
Guan Luo
Bing Li
Zheng Zhang
Haibin Ling
Weiming Hu
88
0
0
03 Nov 2024
FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing
Jitesh Joshi
Sos S. Agaian
Youngjun Cho
AI4TS
73
2
0
03 Nov 2024
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer
Jack Spruyt
ViT
36
0
0
31 Oct 2024
Uncertainty Estimation for 3D Object Detection via Evidential Learning
Nikita Durasov
Rafid Mahmood
Jiwoong Choi
Marc T. Law
James Lucas
Pascal Fua
Jose M. Alvarez
UQCV
EDL
3DPC
92
2
0
31 Oct 2024
GigaCheck: Detecting LLM-generated Content
Irina Tolstykh
Aleksandra Tsybina
Sergey Yakubson
Aleksandr Gordeev
Vladimir Dokholyan
Maksim Kuprashevich
DeLMO
86
2
0
31 Oct 2024
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking
Run Luo
Zikai Song
Longze Chen
Yunshui Li
Min Yang
Wei-Guo Yang
88
0
0
30 Oct 2024
Unbiased Regression Loss for DETRs
Edric
Ueta Daisuke
Kurokawa Yukimasa
Karlekar Jayashree
Sugiri Pranata
66
0
0
30 Oct 2024
BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment
M. Hosseinzadeh
Ian Reid
60
1
0
28 Oct 2024
Referring Human Pose and Mask Estimation in the Wild
Bo Miao
Mingtao Feng
Zijie Wu
Mohammed Bennamoun
Yongsheng Gao
Ajmal Mian
79
0
0
27 Oct 2024
Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Shenghao Fu
Junkai Yan
Q. Yang
Xihan Wei
Xiaohua Xie
Wei-Shi Zheng
VLM
106
3
0
25 Oct 2024
Prompting Continual Person Search
Pengcheng Zhang
Xiaohan Yu
Xiao Bai
Jin Zheng
Xin Ning
CLL
VLM
66
1
0
25 Oct 2024
DCT-HistoTransformer: Efficient Lightweight Vision Transformer with DCT Integration for histopathological image analysis
Mahtab Ranjbar
Mehdi Mohebbi
Mahdi Cherakhloo
Bijan Vosoughi. Vahdat
MedIm
83
1
0
24 Oct 2024
Previous
1
2
3
...
5
6
7
...
49
50
51
Next