Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,533 papers shown
Title
RT-DEMT: A hybrid real-time acupoint detection model combining mamba and transformer
Shilong Yang
Qi Zang
Chulong Zhang
Lingfeng Huang
Yaoqin Xie
Mamba
210
1
0
16 Feb 2025
CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs
Qizhen Lan
Qing Tian
86
0
0
15 Feb 2025
Improving action segmentation via explicit similarity measurement
Kamel Aouaidjia
Wenhao Zhang
Aofan Li
Chongsheng Zhang
79
0
0
15 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
89
0
0
11 Feb 2025
Dense Object Detection Based on De-homogenized Queries
Yueming Huang
Chenrui Ma
Hao Zhou
Hao Wu
Guowu Yuan
200
0
0
11 Feb 2025
Cell Nuclei Detection and Classification in Whole Slide Images with Transformers
Oscar Pina
Eduard Dorca
Verónica Vilaplana
72
0
0
10 Feb 2025
Amnesia as a Catalyst for Enhancing Black Box Pixel Attacks in Image Classification and Object Detection
Dongsu Song
Daehwa Ko
Jay Hoon Jung
AAML
100
0
0
10 Feb 2025
SMART: Advancing Scalable Map Priors for Driving Topology Reasoning
Junjie Ye
David Paz
Hengyuan Zhang
Yuliang Guo
Xinyu Huang
Henrik I. Christensen
Yue Wang
Liu Ren
LRM
167
2
0
06 Feb 2025
Foundation Model-Based Apple Ripeness and Size Estimation for Selective Harvesting
Keyi Zhu
Jiajia Li
Kaixiang Zhang
Chaaran Arunachalam
Siddhartha Bhattacharya
R. Lu
Zhaojian Li
183
0
0
03 Feb 2025
Not Every Patch is Needed: Towards a More Efficient and Effective Backbone for Video-based Person Re-identification
Lanyun Zhu
Tianrun Chen
Deyi Ji
Jieping Ye
Jing Liu
156
2
0
28 Jan 2025
SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Jianing Li
Ming Lu
Hao Wang
Chenyang Gu
Wenzhao Zheng
Li Du
Shanghang Zhang
192
0
0
28 Jan 2025
CSPCL: Category Semantic Prior Contrastive Learning for Deformable DETR-Based Prohibited Item Detectors
Mingyuan Li
Tong Jia
Hui Lu
Bowen Ma
Hao Wang
Dongyue Chen
140
0
0
28 Jan 2025
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
Xiangyu Gao
Yu Dai
Benliu Qiu
Hongliang Li
Heqian Qiu
Hongliang Li
ObjD
VLM
461
0
0
28 Jan 2025
Object Detection for Medical Image Analysis: Insights from the RT-DETR Model
Weijie He
Yuwei Zhang
T. Xu
Tai An
Yingbin Liang
Bo Zhang
PINN
MU
MedIm
64
8
0
27 Jan 2025
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Fu Rong
Meng Lan
Qian Zhang
Lefei Zhang
VOS
VGen
112
1
0
23 Jan 2025
A generalizable 3D framework and model for self-supervised learning in medical imaging
Tony Xu
Sepehr Hosseini
Chris Anderson
Anthony Rinaldi
Rahul G. Krishnan
Anne L. Martel
Maged Goubran
MedIm
162
3
0
20 Jan 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
136
2
0
18 Jan 2025
Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution
Cuixin Yang
Rongkang Dong
Jun Xiao
Cong Zhang
Kin-Man Lam
Fei Zhou
Guoping Qiu
200
2
0
17 Jan 2025
BioPose: Biomechanically-accurate 3D Pose Estimation from Monocular Videos
Farnoosh Koleini
Muhammad Usama Saleem
Pu Wang
Hongfei Xue
Ahmed Helmy
Abbey Fenwick
3DH
111
1
0
14 Jan 2025
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing
Varun Biyyala
Bharat Chanderprakash Kathuria
Jialu Li
Youshan Zhang
112
0
0
13 Jan 2025
TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations
Daniel Steininger
Julia Simon
Andreas Trondl
Markus Murschitz
62
1
0
13 Jan 2025
Toward Realistic Camouflaged Object Detection: Benchmarks and Method
Zhimeng Xin
Tianxu Wu
Shiming Chen
Shuo Ye
Zijing Xie
Yixiong Zou
Xinge You
Yufei Guo
50
0
0
13 Jan 2025
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
Ji Soo Lee
Jongha Kim
Jeehye Na
Jinyoung Park
H. Kim
VGen
58
2
0
12 Jan 2025
MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis
Hengyuan Zhang
David Paz
Yuliang Guo
Xinyu Huang
Henrik I. Christensen
Liu Ren
3DGS
ViT
93
1
0
11 Jan 2025
UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping
Yanjie Li
Wenxuan Zhang
K. Liang
Bin Xiao
AAML
101
3
0
10 Jan 2025
Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance
Duc-Hai Pham
Duc Dung Nguyen
Anh Pham
Ho Lai Tuan
P. Nguyen
Khoi Duc Minh Nguyen
Rang Nguyen
3DPC
168
1
0
10 Jan 2025
AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features
Ruochen Zhang
Hyeung-Sik Choi
Dongwook Jung
Phan Huy Nam Anh
Sang-Ki Jeong
Zihao Zhu
3DPC
MDE
77
0
0
08 Jan 2025
Siamese-DETR for Generic Multi-Object Tracking
Qiankun Liu
Yichen Li
Yuqi Jiang
Ying Fu
VOT
118
9
0
08 Jan 2025
Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves
Madeleine Darbyshire
Elizabeth I. Sklar
Simon Parsons
138
0
0
03 Jan 2025
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
Jiannan Wu
Muyan Zhong
Sen Xing
Zeqiang Lai
Zhaoyang Liu
...
Lewei Lu
Tong Lu
Ping Luo
Yu Qiao
Jifeng Dai
MLLM
VLM
LRM
357
59
0
03 Jan 2025
Open-Set Object Detection By Aligning Known Class Representations
Hiran Sarkar
Vishal M. Chudasama
N. Onoe
Pankaj Wasnik
Vineeth N. Balasubramanian
ObjD
107
5
0
31 Dec 2024
Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model
Yi-Chia Chen
Wei-Hua Li
Chu-Song Chen
VLM
84
1
0
25 Dec 2024
Evaluating the Adversarial Robustness of Detection Transformers
A. Nazeri
Chunheng Zhao
P. Pisu
AAML
112
1
0
25 Dec 2024
Feature Based Methods in Domain Adaptation for Object Detection: A Review Paper
Helia Mohamadi
Mohammad Ali Keyvanrad
Mohammad Reza Mohammadi
99
0
0
23 Dec 2024
Towards Unsupervised Model Selection for Domain Adaptive Object Detection
Hengfu Yu
Jinhong Deng
Wen Li
Lixin Duan
121
0
0
23 Dec 2024
NumbOD: A Spatial-Frequency Fusion Attack Against Object Detectors
Ziqi Zhou
Bowen Li
Yufei Song
Zhifei Yu
Shengshan Hu
Wei Wan
L. Zhang
Dezhong Yao
Hai Jin
AAML
177
2
0
22 Dec 2024
ImagineMap: Enhanced HD Map Construction with SD Maps
Yishen Ji
Zhiqi Li
Tong Lu
148
1
0
22 Dec 2024
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression
Shaofei Huang
Zhenwei Shen
Zehao Huang
Yue Liao
Jizhong Han
Naiyan Wang
Si Liu
177
2
0
22 Dec 2024
Object Detection Approaches to Identifying Hand Images with High Forensic Values
Thanh Thi Nguyen
Campbell Wilson
Imad Khan
Janis Dalins
3DH
113
0
0
21 Dec 2024
LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer
Yipeng Zhang
Yi Liu
Zonghao Guo
Yidan Zhang
Xuesong Yang
...
Yuan Yao
Zhiyuan Liu
Tat-Seng Chua
Maosong Sun
Maosong Sun
MLLM
VLM
147
0
0
18 Dec 2024
Differential Alignment for Domain Adaptive Object Detection
Xinyu He
Xinhui Li
Xiaojie Guo
132
1
0
17 Dec 2024
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
Haoyi Jiang
Liu Liu
Tianheng Cheng
Xinjie Wang
Tianwei Lin
Zhizhong Su
Wen Liu
Xinyu Wang
3DGS
ViT
194
10
0
17 Dec 2024
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
S. Nagendra
Kashif Rashid
Chaopeng Shen
Daniel Kifer
VLM
143
2
0
16 Dec 2024
CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector
Tianheng Qiu
Ka Lung Law
Guanghua Pan
Jufei Wang
Xin Gao
Xuan Huang
Hu Wei
132
0
0
16 Dec 2024
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning
Chang Xu
Ruixiang Zhang
Wen Yang
Haoran Zhu
Fang Xu
Jian Ding
Gui-Song Xia
ObjD
153
1
0
16 Dec 2024
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations
Jin-Cheng Jhang
Tao Tu
Fu-En Wang
Ke Zhang
Min Sun
Cheng-Hao Kuo
119
2
0
16 Dec 2024
Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T Videos
Qingyu Xu
Longguang Wang
Weidong Sheng
Yingqian Wang
Chao Xiao
Chao Ma
Wei An
VOT
151
8
0
14 Dec 2024
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Jinrong Zhang
Penghui Wang
Chunxiao Liu
Wei Liu
D. Jin
Qiong Zhang
Erli Meng
Zhengnan Hu
VLM
132
0
0
14 Dec 2024
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Hong Chen
Zihan Wang
Xianrui Li
Xingwu Sun
Fangyi Chen
Jiang Liu
Jiadong Wang
Bhiksha Raj
Zicheng Liu
Emad Barsoum
VLM
277
10
0
14 Dec 2024
PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation
Lojze Žust
Matej Kristan
ViT
145
1
0
13 Dec 2024
Previous
1
2
3
4
5
6
...
49
50
51
Next