Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.04159
Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 911 papers shown
Title
A 2D Semantic-Aware Position Encoding for Vision Transformers
Xi Chen
Shiyang Zhou
Muqi Huang
Jiaxu Feng
Yun Xiong
...
Yuyao Zhang
Huishuai Bao
Sijia Peng
Chong Li
Feng Shi
ViT
31
0
0
14 May 2025
DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection
Mingqian Ji
Jian Yang
Shanshan Zhang
3DPC
MDE
40
0
0
12 May 2025
Differentiable NMS via Sinkhorn Matching for End-to-End Fabric Defect Detection
Zhengyang Lu
Bingjie Lu
Weifan Wang
Feng Wang
31
0
0
11 May 2025
Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search
XiaoTong Gu
Shengyu Tang
Yiming Cao
Changdong Yu
ViT
31
0
0
10 May 2025
RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation
Zhiwen Zeng
Yunfei Yin
Zheng Yuan
Argho Dey
Xianjian Bao
31
0
0
10 May 2025
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
Ho-Joong Kim
Y. E. Lee
Jung-Ho Hong
Seong-Whan Lee
40
0
0
09 May 2025
Accurate and Efficient Multivariate Time Series Forecasting via Offline Clustering
Yiming Niu
Jinliang Deng
L. Zhang
Zimu Zhou
Yongxin Tong
AI4TS
26
0
0
09 May 2025
A Simple Detector with Frame Dynamics is a Strong Tracker
Chenxu Peng
C. Wang
Minrui Zou
Danyang Li
Z. Yang
Yimian Dai
Ming-Ming Cheng
Xiang Li
47
0
0
08 May 2025
Adaptive Contextual Embedding for Robust Far-View Borehole Detection
Xuesong Liu
Tianyu Hao
Emmett J. Ientilucci
41
0
0
08 May 2025
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
Junjie Wang
Bin Chen
Yulin Li
Bin Kang
Yulin Chen
Zhuotao Tian
VLM
38
0
0
07 May 2025
MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection
Zhihao Zhang
Abhinav Kumar
Girish Chandar Ganesan
Xiaoming Liu
157
0
0
07 May 2025
Panoramic Out-of-Distribution Segmentation
Mengfei Duan
Kailun Yang
Y. Zhang
Yihong Cao
Fei Teng
Kai Luo
Jiaming Zhang
Zhiyong Li
Shutao Li
59
0
0
06 May 2025
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion
Boyuan Meng
Xinming Zhang
Peilin Li
Zhe Wu
Yiming Li
Wenkai Zhao
B. Yu
Hui-Liang Shen
ViT
93
0
0
02 May 2025
Multimodal and Multiview Deep Fusion for Autonomous Marine Navigation
Dimitrios Dagdilelis
Panagiotis Grigoriadis
R. Galeazzi
3DPC
150
0
0
02 May 2025
XeMap: Contextual Referring in Large-Scale Remote Sensing Environments
Yongqian Li
Lu Si
Y. T. Hou
Chengaung Liu
Yangqiu Song
Hongjian Fang
Jingyang Zhang
79
0
0
30 Apr 2025
Purifying, Labeling, and Utilizing: A High-Quality Pipeline for Small Object Detection
Siwei Wang
Zhiwei Chen
Liujuan Cao
Rongrong Ji
ObjD
69
0
0
29 Apr 2025
DG-DETR: Toward Domain Generalized Detection Transformer
Seongmin Hwang
Daeyoung Han
Moongu Jeon
ViT
65
0
0
28 Apr 2025
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Juhan Park
Kyungjae Lee
Hyung Jin Chang
Jungchan Cho
VLM
66
0
0
28 Apr 2025
Open-set Anomaly Segmentation in Complex Scenarios
Song Xia
Yi Yu
Henghui Ding
Wenhan Yang
S. Liu
Alex C. Kot
Xudong Jiang
DiffM
57
0
0
28 Apr 2025
STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction
Zhimin Liao
Ping Wei
Shuaijia Chen
Haoxuan Wang
Ziyang Ren
103
0
0
28 Apr 2025
Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection
Brian K. S. Isaac-Medina
T. Breckon
OODD
150
0
0
25 Apr 2025
Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models
Patrick Müller
Alexander Braun
M. Keuper
59
0
0
25 Apr 2025
MS-Occ: Multi-Stage LiDAR-Camera Fusion for 3D Semantic Occupancy Prediction
Zhiqiang Wei
Lianqing Zheng
Jiaheng Liu
Tao Huang
Qing-Long Han
Wenwen Zhang
Fengdeng Zhang
27
0
0
22 Apr 2025
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection
YangChen Zeng
ViT
31
0
0
18 Apr 2025
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
Dubing Chen
Huan Zheng
Jin Fang
Xingping Dong
Xianfei Li
Wenlong Liao
Tao He
Pai Peng
Jianbing Shen
42
0
0
17 Apr 2025
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding
Hang Ji
Tao Ni
Xufeng Huang
Tao Luo
Xin Zhan
Junbo Chen
3DPC
44
0
0
17 Apr 2025
RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity
Ranjan Sapkota
Rahul Harsha Cheppally
Ajay Sharda
Manoj Karkee
36
0
0
17 Apr 2025
Correcting Class Imbalances with Self-Training for Improved Universal Lesion Detection and Tagging
Alexander Shieh
T. Mathai
Jianfei Liu
Angshuman Paul
Ronald M. Summers
35
2
0
07 Apr 2025
Simultaneous Learning of Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model
Kotaro Ikeda
Masanori Koyama
Jinzhe Zhang
Kohei Hayashi
Kenji Fukumizu
OT
142
0
0
04 Apr 2025
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection
Ahyun Seo
Minsu Cho
76
0
0
26 Mar 2025
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model
Zhiqiang Zhang
J. Li
Zunnan Xu
Hanhui Li
Yiji Cheng
Fa-Ting Hong
Qin Lin
Qinglin Lu
Xiaodan Liang
DiffM
73
1
0
25 Mar 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun
Huazhang Hu
Yidong Ma
Gang Liu
Nemo Chen
Xu Tang
Yao Hu
Yongchao Xu
ObjD
47
0
0
24 Mar 2025
SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion
Xiyue Guo
Jiarui Hu
Junjie Hu
Hujun Bao
Guofeng Zhang
50
0
0
21 Mar 2025
SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition
Zeqi Zheng
Yanchen Huang
Yingchao Yu
Zizheng Zhu
Junfeng Tang
Zhaofei Yu
Yaochu Jin
39
0
0
20 Mar 2025
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
Jiawei Wang
Kai Hu
Qiang Huo
55
0
0
20 Mar 2025
TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection
Qiang Qi
Xiao Wang
ViT
163
0
0
18 Mar 2025
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
Hongyu Ke
Jack Morris
K. Oguchi
Xiaofei Cao
Yongkang Liu
Haoxin Wang
Yi Ding
Mamba
73
0
0
18 Mar 2025
Action tube generation by person query matching for spatio-temporal action detection
Kazuki Omi
Jion Oshima
Toru Tamaki
62
0
0
17 Mar 2025
8-Calves Image dataset
Xuyang Fang
S. Hannuna
Neill D. F. Campbell
121
0
0
17 Mar 2025
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
Chuhan Zhang
Chaoyang Zhu
Pingcheng Dong
Long Chen
Dong Zhang
ObjD
VLM
164
0
0
14 Mar 2025
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
Fanqi Pu
Yifan Wang
Jiru Deng
Wenming Yang
MDE
ViT
59
2
0
13 Mar 2025
Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning
Lizhen Xu
Xiuxiu Bai
Xiaojun Jia
Jianwu Fang
Shanmin Pang
63
0
0
13 Mar 2025
Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis
Kai Qiu
Xianrui Li
Jason Kuen
H. Chen
Xiaohao Xu
Jiuxiang Gu
Yinyi Luo
Bhiksha Raj
Zhe-nan Lin
Marios Savvides
62
0
0
11 Mar 2025
SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements
Haiyang Xie
Xi Shen
Shihua Huang
Qirui Wang
Zheng Wang
44
0
0
10 Mar 2025
Omnidirectional Multi-Object Tracking
Kai Luo
Hao-miao Shi
Sheng Wu
Fei Teng
Mengfei Duan
Chang Huang
Yixuan Wang
Kaiwei Wang
Kailun Yang
42
0
0
06 Mar 2025
Fractional Correspondence Framework in Detection Transformer
Masoumeh Zareapoor
Pourya Shamsolmoali
Huiyu Zhou
Yue Lu
Salvador García
55
0
0
06 Mar 2025
A lightweight model FDM-YOLO for small target improvement based on YOLOv8
Xuerui Zhang
ObjD
53
0
0
06 Mar 2025
Boltzmann Attention Sampling for Image Analysis with Small Objects
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
73
0
0
04 Mar 2025
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
Xin Ye
Burhaneddin Yaman
Sheng Cheng
Feng Tao
Abhirup Mallik
Liu Ren
DiffM
68
1
0
27 Feb 2025
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
Zeyu Yang
Nan Song
Wei Li
Xiatian Zhu
L. Zhang
Philip H. S. Torr
79
4
0
24 Feb 2025
1
2
3
4
...
17
18
19
Next