Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,533 papers shown
Title
Exploring Contextual Attribute Density in Referring Expression Counting
Zhicheng Wang
Zhiyu Pan
Zhan Peng
Jian Cheng
Liwen Xiao
Wei Jiang
Zhiguo Cao
76
0
0
16 Mar 2025
Minuscule Cell Detection in AS-OCT Images with Progressive Field-of-View Focusing
Boyu Chen
A. L. Solebo
Daqian Shi
Jinge Wu
Paul Taylor
121
0
0
15 Mar 2025
Active Learning from Scene Embeddings for End-to-End Autonomous Driving
Wenhao Jiang
Duo Li
Menghan Hu
Chao Ma
Ke Wang
Zhipeng Zhang
127
0
0
14 Mar 2025
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
Chuhan Zhang
Chaoyang Zhu
Pingcheng Dong
Long Chen
Dong Zhang
ObjD
VLM
489
0
0
14 Mar 2025
Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation
Hiroyasu Akada
Jian Wang
Vladislav Golyanik
Christian Theobalt
EgoV
116
0
0
14 Mar 2025
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
Fanqi Pu
Yifan Wang
Jiru Deng
Wenming Yang
MDE
ViT
178
3
0
13 Mar 2025
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection
Shenghao Fu
Junkai Yan
Q. Yang
Xihan Wei
Xiaohua Xie
Wei-Shi Zheng
ObjD
VLM
82
0
0
13 Mar 2025
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection
Chiara Cappellino
Gianluca Mancusi
Matteo Mosconi
Angelo Porrello
Simone Calderara
Rita Cucchiara
ObjD
VLM
176
0
0
12 Mar 2025
Foundation X: Integrating Classification, Localization, and Segmentation through Lock-Release Pretraining Strategy for Chest X-ray Analysis
N. Islam
Dongao Ma
Jiaxuan Pang
Shivasakthi Senthil Velan
Michael B. Gotway
Jianming Liang
95
0
0
12 Mar 2025
SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection
Hyeongseok Son
Jia He
Seung-In Park
Ying Min
Yunhao Zhang
ByungIn Yoo
100
0
0
11 Mar 2025
Towards Large-scale Chemical Reaction Image Parsing via a Multimodal Large Language Model
Yufan Chen
Ching Ting Leung
Jianwei Sun
Yong Huang
Linyan Li
Hao Chen
Hanyu Gao
98
1
0
11 Mar 2025
Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis
Kai Qiu
Xianrui Li
Jason Kuen
Hong Chen
Xiaohao Xu
Jiuxiang Gu
Yinyi Luo
Bhiksha Raj
Zhe Lin
Marios Savvides
156
2
0
11 Mar 2025
LEGO-Motion: Learning-Enhanced Grids with Occupancy Instance Modeling for Class-Agnostic Motion Prediction
Kangan Qian
Jinyu Miao
Ziang Luo
Zheng Fu
and Jinchen Li
Yining Shi
Yunlong Wang
Kun Jiang
Mengmeng Yang
Ke Wang
117
1
0
10 Mar 2025
YOLOE: Real-Time Seeing Anything
Ao Wang
Lihao Liu
Hui Chen
Zijia Lin
Jiawei Han
Guiguang Ding
VLM
ObjD
134
6
0
10 Mar 2025
SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements
Haiyang Xie
Xi Shen
Shihua Huang
Qirui Wang
Zheng Wang
114
0
0
10 Mar 2025
Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction
Zongzheng Zhang
Xinrun Li
Sizhe Zou
Guoxuan Chi
Siqi Li
...
Guoliang Wang
Guantian Zheng
Leichen Wang
Hang Zhao
Hao Zhao
145
0
0
10 Mar 2025
Removing Multiple Hybrid Adverse Weather in Video via a Unified Model
Yecong Wan
Mingwen Shao
Yuanshuo Cheng
Jun Shu
Shuigen Wang
89
0
0
08 Mar 2025
Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection
Yifan Chang
Junjie Huang
Xiaofeng Wang
Yun Ye
Zhujin Liang
Yi Shan
Dalong Du
Xingang Wang
3DPC
142
0
0
08 Mar 2025
FastMap: Fast Queries Initialization Based Vectorized HD Map Reconstruction Framework
Haotian Hu
Jingwei Xu
Fanyi Wang
Toyota Li
Yaonong Wang
Laifeng Hu
Zhiwang Zhang
77
1
0
07 Mar 2025
Fractional Correspondence Framework in Detection Transformer
Masoumeh Zareapoor
Pourya Shamsolmoali
Huiyu Zhou
Yue Lu
Salvador García
86
0
0
06 Mar 2025
Omnidirectional Multi-Object Tracking
Kai Luo
Hao-miao Shi
Sheng Wu
Fei Teng
Mengfei Duan
Chang Huang
Yansen Wang
Kaiwei Wang
Kailun Yang
169
1
0
06 Mar 2025
A lightweight model FDM-YOLO for small target improvement based on YOLOv8
Xuerui Zhang
ObjD
83
0
0
06 Mar 2025
Manboformer: Learning Gaussian Representations via Spatial-temporal Attention Mechanism
Ziyue Zhao
Qining Qi
Jianfa Ma
73
0
0
06 Mar 2025
Prediction of Frozen Region Growth in Kidney Cryoablation Intervention Using a 3D Flow-Matching Model
Siyeop Yoon
Y. Oh
Matthew Tivnan
S. Song
Pengfei Jin
Sekeun KimHyun Jin Cho
Hyun Jin Cho
Dufan Wu
Raul Uppot
Quanzheng Li
93
0
0
06 Mar 2025
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Zhao Yang
Zezhong Qian
Xiaofan Li
Weixiang Xu
Gongpeng Zhao
Ruohong Yu
Lingsi Zhu
Longjun Liu
DiffM
VGen
119
2
0
05 Mar 2025
Boltzmann Attention Sampling for Image Analysis with Small Objects
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
188
0
0
04 Mar 2025
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection
Boyong He
Yuxiang Ji
Qianwen Ye
Zhuoyue Tan
Liaoni Wu
DiffM
160
0
0
03 Mar 2025
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
Hao Tang
Chenwei Xie
Haiyang Wang
Xiaoyi Bao
Tingyu Weng
Pandeng Li
Yun Zheng
Liwei Wang
ObjD
VLM
130
1
0
03 Mar 2025
Evaluating Stenosis Detection with Grounding DINO, YOLO, and DINO-DETR
Muhammad Musab Ansari
53
0
0
03 Mar 2025
MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism
Zhixiong Nan
Xianghong Li
Jifeng Dai
Tao Xiang
133
0
0
03 Mar 2025
RTGen: Real-Time Generative Detection Transformer
Chi Ruan
ObjD
VLM
73
0
0
28 Feb 2025
SAC-ViT: Semantic-Aware Clustering Vision Transformer with Early Exit
Youbing Hu
Yun Cheng
Anqi Lu
Dawei Wei
Zhijun Li
68
0
0
27 Feb 2025
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
Xin Ye
Burhaneddin Yaman
Sheng Cheng
Feng Tao
Abhirup Mallik
Liu Ren
DiffM
117
2
0
27 Feb 2025
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects
Elkhan Ismayilzada
MD Khalequzzaman Chowdhury Sayem
Yihalem Yimolal Tiruneh
Mubarrat Chowdhury
Muhammadjon Boboev
Seungryul Baek
ViT
132
1
0
27 Feb 2025
WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation
Mingjie Wu
Chenggui Yang
Huihua Wang
Chen Xue
Yibo Wang
...
Yuqi Han
R. Li
Lijun Yun
Zaiqing Chen
Siyang Song
172
0
0
27 Feb 2025
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration
X. J. Yang
Jing Liu
Peng Wang
Guoqing Wang
Yue Yang
Jikang Cheng
ObjD
191
0
0
27 Feb 2025
CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query
Ziyi Wang
Shaocong Xu
Xucai Zhuang
Tongda Xu
Yan Wang
Qingbin Liu
Yilun Chen
Yuanhang Zhang
154
1
0
26 Feb 2025
ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding
Qihang Peng
Henry Zheng
Gao Huang
3DPC
119
0
0
26 Feb 2025
Automatic Vehicle Detection using DETR: A Transformer-Based Approach for Navigating Treacherous Roads
Istiaq Ahmed Fahad
Abdullah Ibne Hanif Arean
Nazmus Sakib Ahmed
Mahmudul Hasan
ViT
61
2
0
25 Feb 2025
Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking
Xin Tong
Shi Peng
Baojie Tian
Yufei Guo
Xuhui Huang
Zhe Ma
ViT
93
1
0
25 Feb 2025
Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks
Tianyou Jiang
Mingshun Shao
Tianyi Zhang
Xiaoyu Liu
Qun Yu
107
0
0
24 Feb 2025
MVIP -- A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition
Paul Koch
Marian Schluter
Jörg Krüger
142
0
0
24 Feb 2025
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
Zeyu Yang
Nan Song
Wei Li
Xiatian Zhu
Lefei Zhang
Philip H. S. Torr
160
4
0
24 Feb 2025
A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition
Dewan Tauhid Rahman
Yeahia Sarker
Antar Mazumder
Md. Shamim Anower
ViT
61
0
0
24 Feb 2025
Cross-domain Few-shot Object Detection with Multi-modal Textual Enrichment
Zeyu Shangguan
Daniel Seita
Mohammad Rostami
ObjD
123
0
0
23 Feb 2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Wenwen Yu
Zhibo Yang
Jianqiang Wan
Sibo Song
J. Tang
Wenqing Cheng
Yunxing Liu
Xiang Bai
111
5
0
22 Feb 2025
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection
Yuming Chen
Xinbin Yuan
Ruiqi Wu
Jiabao Wang
Qibin Hou
Mingg-Ming Cheng
Ming-Ming Cheng
ObjD
292
58
0
21 Feb 2025
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines
Xinyi Ying
Chao Xiao
Ruojing Li
Xu He
Boyang Li
...
Miao Li
Shilin Zhou
Wei An
Weidong Sheng
Li Liu
215
7
0
21 Feb 2025
MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding
Weikang Qiu
Zheng Huang
Haoyu Hu
Aosong Feng
Yujun Yan
Rex Ying
97
0
0
18 Feb 2025
GraphMorph: Tubular Structure Extraction by Morphing Predicted Graphs
Zhao Zhang
Ziwei Zhao
Dong Wang
Liwei Wang
MedIm
141
1
0
17 Feb 2025
Previous
1
2
3
4
5
...
49
50
51
Next