Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,533 papers shown
Title
P2PFormer: A Primitive-to-polygon Method for Regular Building Contour Extraction from Remote Sensing Images
Tao Zhang
Shiqing Wei
Yikang Zhou
M. Luo
Wenling You
Shunping Ji
75
2
0
05 Jun 2024
DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images
Yimian Dai
Minrui Zou
Yuxuan Li
Xiang Li
Kang Ni
Jian Yang
68
5
0
05 Jun 2024
Exploring Real World Map Change Generalization of Prior-Informed HD Map Prediction Models
Samuel M. Bateman
Ning Xu
H. C. Zhao
Yael Ben Shalom
Vince Gong
Greg Long
Will Maddern
72
4
0
04 Jun 2024
A Survey of Deep Learning Based Radar and Vision Fusion for 3D Object Detection in Autonomous Driving
Di Wu
Feng Yang
Benlian Xu
Pan Liao
Bo Liu
108
0
0
02 Jun 2024
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
Jiaming Li
Jiacheng Zhang
Jichang Li
Ge Li
Si Liu
Liang Lin
Guanbin Li
ObjD
VLM
114
14
0
01 Jun 2024
Research on an Autonomous UAV Search and Rescue System Based on the Improved
Haobin Chen
Junyu Tao
Bize Zhou
Xiaoyan Liu
43
0
0
01 Jun 2024
Task Planning for Object Rearrangement in Multi-room Environments
Karan Mirakhor
Sourav Ghosh
Dipanjan Das
Brojeshwar Bhowmick
54
1
0
01 Jun 2024
Learning Manipulation by Predicting Interaction
Jia Zeng
Qingwen Bu
Bangjun Wang
Wenke Xia
Li Chen
...
Heming Cui
Bin Zhao
Xuelong Li
Yu Qiao
Hongyang Li
134
26
0
01 Jun 2024
Diversifying Query: Region-Guided Transformer for Temporal Sentence Grounding
Xiaolong Sun
Liushuai Shi
Le Wang
Sanpin Zhou
Kun Xia
Yabing Wang
Gang Hua
88
2
0
31 May 2024
DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models
Linli Yao
Lei Li
Shuhuai Ren
Lean Wang
Yuanxin Liu
Xu Sun
Lu Hou
76
34
0
31 May 2024
On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines
Selim Kuzucu
Kemal Oksuz
Jonathan Sadeghi
P. Dokania
89
5
0
30 May 2024
SurgiTrack: Fine-Grained Multi-Class Multi-Tool Tracking in Surgical Videos
C. Nwoye
N. Padoy
85
5
0
30 May 2024
Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting
Qi Zhang
Yunfei Gong
Daijie Chen
Antoni B. Chan
Hui-dan Huang
79
4
0
30 May 2024
Towards Unified Multi-granularity Text Detection with Interactive Attention
Xingyu Wan
Chengquan Zhang
Pengyuan Lyu
Sen Fan
Zihan Ni
Kun Yao
Errui Ding
Jingdong Wang
92
2
0
30 May 2024
SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving
Yiming Cui
Cheng Han
Dongfang Liu
96
0
0
29 May 2024
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
Yansong Qu
Shaohui Dai
Xinyang Li
Jianghang Lin
Liujuan Cao
Shengchuan Zhang
Rongrong Ji
112
23
0
27 May 2024
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Yuanhui Huang
Wenzhao Zheng
Yunpeng Zhang
Jie Zhou
Jiwen Lu
3DGS
110
48
0
27 May 2024
LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding
Haoyu Zhao
Wenhang Ge
Ying-Cong Chen
ObjD
MLLM
VLM
88
5
0
27 May 2024
Efficient Visual Fault Detection for Freight Train via Neural Architecture Search with Data Volume Robustness
Yang Zhang
Mingying Li
Huilin Pan
Moyun Liu
Yang Zhou
54
0
0
27 May 2024
ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection
Ziying Song
Feiyang Jia
Hongyu Pan
Yadan Luo
Caiyan Jia
Guoxin Zhang
Lin Liu
Yang Ji
Lei Yang
Li-e Wang
94
9
0
27 May 2024
Understanding differences in applying DETR to natural and medical images
Yanqi Xu
Yiqiu Shen
C. Fernandez‐Granda
Laura Heacock
Krzysztof J. Geras
MedIm
118
3
0
27 May 2024
ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking
Xudong Han
Nobuyuki Oishi
Yueying Tian
Elif Ucurum
R. Young
C. Chatwin
Philip Birch
82
5
0
24 May 2024
HDC: Hierarchical Semantic Decoding with Counting Assistance for Generalized Referring Expression Segmentation
Zhuoyan Luo
Yinghao Wu
Yong-Jin Liu
Yicheng Xiao
Xiao-Ping Zhang
Yujiu Yang
97
0
0
24 May 2024
MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method
Pan Liao
Feng Yang
Di Wu
Liu Bo
56
1
0
24 May 2024
Label-efficient Semantic Scene Completion with Scribble Annotations
Song Wang
Jiawei Yu
Wentong Li
Hao Shi
Kailun Yang
Junbo Chen
Jianke Zhu
116
5
0
24 May 2024
Bring Adaptive Binding Prototypes to Generalized Referring Expression Segmentation
Weize Li
Zhicheng Zhao
Haochen Bai
Fei Su
122
0
0
24 May 2024
Diversifying Human Pose in Synthetic Data for Aerial-view Human Detection
Yingzhe Shen
Hyungtae Lee
Heesung Kwon
Shuvra S. Bhattacharyya
111
5
0
24 May 2024
TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving Scenes
Yanping Fu
Wenbin Liao
Xinyuan Liu
Hang Xu
Yike Ma
Feng Dai
Yucheng Zhang
LRM
98
11
0
23 May 2024
YOLOv10: Real-Time End-to-End Object Detection
Ao Wang
Hui Chen
Lihao Liu
Kai Chen
Zijia Lin
Jungong Han
Guiguang Ding
3DH
134
1,202
0
23 May 2024
RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar
Fangqiang Ding
Xiangyu Wen
Yunzhou Zhu
Yiming Li
Chris Xiaoxuan Lu
118
16
0
22 May 2024
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
Zhuopu Yu
Runmin Zhang
Jiacheng Ying
Junchen Yu
Xiaohai Hu
Lun Luo
Siyuan Cao
Hui-Liang Shen
ViT
100
15
0
22 May 2024
Multi-View Attentive Contextualization for Multi-View 3D Object Detection
Xianpeng Liu
Ce Zheng
Ming Qian
Nan Xue
Chong Chen
Zhebin Zhang
Chen Li
Tianfu Wu
122
3
0
20 May 2024
DLAFormer: An End-to-End Transformer For Document Layout Analysis
Jiawei Wang
Kai Hu
Qiang Huo
3DV
ViT
71
3
0
20 May 2024
FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention
Ziang Guo
Zakhar Yagudin
Selamawit Asfaw
Artem Lykov
Dzmitry Tsetserukou
3DPC
88
0
0
19 May 2024
Unifying 3D Vision-Language Understanding via Promptable Queries
Ziyu Zhu
Zhuofan Zhang
Xiaojian Ma
Xuesong Niu
Yixin Chen
Baoxiong Jia
Zhidong Deng
Siyuan Huang
Qing Li
110
32
0
19 May 2024
InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images
Wuzhou Li
Jiawei Zhou
Xiang Li
Yi Cao
Guanglu Jin
Xuemin Zhang
115
2
0
18 May 2024
Visible and Clear: Finding Tiny Objects in Difference Map
Bing Cao
Haiyu Yao
Pengfei Zhu
Qinghua Hu
ObjD
92
8
0
18 May 2024
Better Sampling, towards Better End-to-end Small Object Detection
Zile Huang
Chong Zhang
Mingyu Jin
Fangyu Wu
Chengzhi Liu
Xiaobo Jin
ObjD
112
1
0
17 May 2024
GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision
Xin Tan
Wenbin Wu
Zhiwei Zhang
Chaojie Fan
Yong Peng
Zhizhong Zhang
Yuan Xie
Lizhuang Ma
177
13
0
17 May 2024
DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection
Zhe Huang
Yizhe Zhao
Hao Xiao
Chenyan Wu
Lingting Ge
3DPC
160
1
0
17 May 2024
Infrared Adversarial Car Stickers
Xiaopei Zhu
Yuqiu Liu
Zhan Hu
Jianmin Li
Xiaolin Hu
AAML
90
0
0
16 May 2024
SpecDETR: A Transformer-based Hyperspectral Point Object Detection Network
Zhaoxu Li
Wei An
Gaowei Guo
Longguang Wang
Yingqian Wang
Zaiping Lin
ViT
207
0
0
16 May 2024
Gaze-DETR: Using Expert Gaze to Reduce False Positives in Vulvovaginal Candidiasis Screening
Yan Kong
Sheng Wang
Jiangdong Cai
Zihao Zhao
Zhenrong Shen
Yonghao Li
Manman Fei
Qian Wang
93
4
0
15 May 2024
BEVRender: Vision-based Cross-view Vehicle Registration in Off-road GNSS-denied Environment
Lihong Jin
Wei Dong
Michael Kaess
82
3
0
14 May 2024
Open-Vocabulary Object Detection via Neighboring Region Attention Alignment
Sunyuan Qiang
Xianfei Li
Yanyan Liang
Wenlong Liao
Tao He
Pai Peng
ObjD
81
0
0
14 May 2024
DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D Generation
Ziang Cao
Fangzhou Hong
Tong Wu
Liang Pan
Ziwei Liu
79
3
0
13 May 2024
MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders
Xue-Qiu Jiang
Sheng Jin
Xiaoqin Zhang
Ling Shao
Shijian Lu
MDE
82
7
0
13 May 2024
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
Volodymyr Fedynyak
Yaroslav Romanus
Bohdan Hlovatskyi
Bohdan Sydor
Oles Dobosevych
Igor Babin
Roman Riazantsev
VOS
78
3
0
11 May 2024
Replication Study and Benchmarking of Real-Time Object Detection Models
Pierre-Luc Asselin
Vincent Coulombe
William Guimont-Martin
William Larrivée-Hardy
88
0
0
11 May 2024
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Jiachen Li
Xinyao Wang
Sijie Zhu
Chia-Wen Kuo
Lu Xu
Fan Chen
Jitesh Jain
Humphrey Shi
Longyin Wen
MLLM
MoE
100
33
0
09 May 2024
Previous
1
2
3
...
11
12
13
...
49
50
51
Next