ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04159
  4. Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
    ViT
ArXiv (abs)PDFHTMLGithub (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,533 papers shown
Title
Few-shot Personalized Scanpath Prediction
Few-shot Personalized Scanpath Prediction
Ruoyu Xue
Jingyi Xu
Sounak Mondal
Hieu Le
G. Zelinsky
Minh Hoai
D. Samaras
78
0
0
07 Apr 2025
Correcting Class Imbalances with Self-Training for Improved Universal Lesion Detection and Tagging
Correcting Class Imbalances with Self-Training for Improved Universal Lesion Detection and Tagging
Alexander Shieh
T. Mathai
Jianfei Liu
Angshuman Paul
Ronald M. Summers
103
2
0
07 Apr 2025
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection
Jiancheng Pan
Yanxing Liu
Xiao He
Long Peng
Jiahao Li
Yuze Sun
Xiaomeng Huang
78
2
0
06 Apr 2025
Detection-Friendly Nonuniformity Correction: A Union Framework for Infrared UAVTarget Detection
Detection-Friendly Nonuniformity Correction: A Union Framework for Infrared UAVTarget Detection
Houzhang Fang
Xiaolin Wang
Zechao Li
Lu Wang
Qingshan Li
Yi Chang
Luxin Yan
53
0
0
05 Apr 2025
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning
Xiao-Hui Li
Fei Yin
Cheng-Lin Liu
83
1
0
05 Apr 2025
A Modular Energy Aware Framework for Multicopter Modeling in Control and Planning Applications
A Modular Energy Aware Framework for Multicopter Modeling in Control and Planning Applications
Sebastian Gasche
Christian Kallies
Andreas Himmel
R. Findeisen
124
1
0
04 Apr 2025
ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving
ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving
Sheng Yang
Tong Zhan
Shichen Qiao
Jicheng Gong
Qing Yang
Jian Wang
Yanfeng Lu
3DPC
121
0
0
04 Apr 2025
Pairwise Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model
Pairwise Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model
Kotaro Ikeda
Masanori Koyama
Jinzhe Zhang
Kohei Hayashi
Kenji Fukumizu
OT
562
0
0
04 Apr 2025
APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
Zhuguanyu Wu
Jiayi Zhang
Jiaxin Chen
Jinyang Guo
Di Huang
Yunhong Wang
MQ
120
1
0
03 Apr 2025
v-CLR: View-Consistent Learning for Open-World Instance Segmentation
v-CLR: View-Consistent Learning for Open-World Instance Segmentation
Chang-Bin Zhang
Jinhong Ni
Yujie Zhong
Kai Han
3DVVLM
177
0
0
02 Apr 2025
Adaptive Low Light Enhancement via Joint Global-Local Illumination Adjustment
Adaptive Low Light Enhancement via Joint Global-Local Illumination Adjustment
Haodian Wang
Yaqi Song
85
0
0
01 Apr 2025
CellVTA: Enhancing Vision Foundation Models for Accurate Cell Segmentation and Classification
CellVTA: Enhancing Vision Foundation Models for Accurate Cell Segmentation and Classification
Yang Yang
Xijie Xu
Yixun Zhou
Jie Zheng
ViT
78
0
0
01 Apr 2025
Coca-Splat: Collaborative Optimization for Camera Parameters and 3D Gaussians
Coca-Splat: Collaborative Optimization for Camera Parameters and 3D Gaussians
Jiamin Wu
Hongyang Li
Xiaoke Jiang
Yuan Yao
Lei Zhang
3DGS
152
0
0
01 Apr 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
124
0
0
31 Mar 2025
DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model
DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model
Ming Yuan
Sichao Wang
Chuang Zhang
Lei He
Qing Xu
Jianqiang Wang
DiffMMDE
74
0
0
31 Mar 2025
EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing
EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing
Hongxiang Jiang
Jihao Yin
Qixiong Wang
Jiaqi Feng
Guo Chen
98
1
0
30 Mar 2025
VLM-C4L: Continual Core Dataset Learning with Corner Case Optimization via Vision-Language Models for Autonomous Driving
VLM-C4L: Continual Core Dataset Learning with Corner Case Optimization via Vision-Language Models for Autonomous Driving
Haibo Hu
Jiacheng Zuo
Yang Lou
Yufei Cui
Jianping Wang
Nan Guan
Jin Wang
Yung-Hui Li
Chun Jason Xue
VLM
122
1
0
29 Mar 2025
Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction
Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction
Xiaolu Liu
Ruizi Yang
Song Wang
Wentong Li
Jintai Chen
Jianke Zhu
90
0
0
29 Mar 2025
Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction
Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy Prediction
Seokha Moon
Janghyun Baek
Giseop Kim
Jinkyu Kim
Sunwook Choi
126
1
0
28 Mar 2025
AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization
AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization
Martin Kiss
Michal Hradiš
Martina Dvořáková
Václav Jiroušek
Filip Kersch
101
1
0
28 Mar 2025
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
Kuang Wu
Chuan Yang
Zhanbin Li
92
0
0
27 Mar 2025
Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation
Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation
Junjie Chen
Weilong Chen
Yifan Zuo
Yuming Fang
87
0
0
27 Mar 2025
MSPLoRA: A Multi-Scale Pyramid Low-Rank Adaptation for Efficient Model Fine-Tuning
MSPLoRA: A Multi-Scale Pyramid Low-Rank Adaptation for Efficient Model Fine-Tuning
Jiancheng Zhao
Xingda Yu
Zhen Yang
MoE
84
3
0
27 Mar 2025
OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation
OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation
Mallika Garg
Debashis Ghosh
P. M. Pradhan
3DH
94
0
0
27 Mar 2025
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection
Ahyun Seo
Minsu Cho
124
0
0
26 Mar 2025
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
Chen Tang
Xinzhu Ma
Encheng Su
Xiufeng Song
Xiaohong Liu
Wei-Hong Li
Lei Bai
Wanli Ouyang
Xiangyu Yue
3DGSAI4TS
102
0
0
26 Mar 2025
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model
Zhiqiang Zhang
Jia-Nan Li
Zunnan Xu
Hanhui Li
Yiji Cheng
Fa-Ting Hong
Qin Lin
Qinglin Lu
Xiaodan Liang
DiffM
138
2
0
25 Mar 2025
BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction
BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction
Jan Kohút
Martin Dočekal
Michal Hradiš
Marek Vaško
85
0
0
25 Mar 2025
Your ViT is Secretly an Image Segmentation Model
Your ViT is Secretly an Image Segmentation Model
Tommie Kerssies
Niccolò Cavagnero
Alexander Hermans
Narges Norouzi
Giuseppe Averta
Bastian Leibe
Gijs Dubbelman
Daan de Geus
ViTVLM
123
5
0
24 Mar 2025
FG$^2$: Fine-Grained Cross-View Localization by Fine-Grained Feature Matching
FG2^22: Fine-Grained Cross-View Localization by Fine-Grained Feature Matching
Zimin Xia
Alexandre Alahi
128
2
0
24 Mar 2025
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery
Sara Al-Emadi
Yin Yang
Ferda Ofli
67
0
0
24 Mar 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun
Huazhang Hu
Yidong Ma
Gang Liu
Nemo Chen
Xu Tang
Feng-Long Xie
Yongchao Xu
ObjD
126
0
0
24 Mar 2025
An Image-like Diffusion Method for Human-Object Interaction Detection
An Image-like Diffusion Method for Human-Object Interaction Detection
Xiaofei Hui
Haoxuan Qu
Hossein Rahmani
Jun Liu
DiffM
126
0
0
23 Mar 2025
SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion
SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion
Xiyue Guo
Jiarui Hu
Junjie Hu
Hujun Bao
Guofeng Zhang
112
1
0
21 Mar 2025
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
Jiawei Wang
Kai Hu
Qiang Huo
117
0
0
20 Mar 2025
SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition
SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition
Zeqi Zheng
Yanchen Huang
Yingchao Yu
Zizheng Zhu
Junfeng Tang
Zhaofei Yu
Yaochu Jin
85
0
0
20 Mar 2025
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding
Keyan Chen
Chenyang Liu
Bowen Chen
Wenyuan Li
Zhengxia Zou
Zhenwei Shi
78
3
0
20 Mar 2025
3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
Gyeongrok Oh
Sungjune Kim
Heeju Ko
Hyung-Gun Chi
J. Kim
Dongwook Lee
Daehyun Ji
Sungjoon Choi
Sujin Jang
Sangpil Kim
93
1
0
19 Mar 2025
Panoramic Distortion-Aware Tokenization for Person Detection and Localization Using Transformers in Overhead Fisheye Images
Panoramic Distortion-Aware Tokenization for Person Detection and Localization Using Transformers in Overhead Fisheye Images
Nobuhiko Wakai
Satoshi Sato
Yasunori Ishii
Takayoshi Yamashita
122
0
0
18 Mar 2025
Is Discretization Fusion All You Need for Collaborative Perception?
Is Discretization Fusion All You Need for Collaborative Perception?
Kang Yang
Tianci Bu
L. Li
Chunxu Li
Yanjie Wang
Deying Li
140
0
0
18 Mar 2025
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
Chuxin Wang
Wenfei Yang
Xiang Liu
Tianzhu Zhang
137
1
0
18 Mar 2025
SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model
SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model
Xinqing Li
Ruiqi Song
Qingyu Xie
Ye Wu
Nanxin Zeng
Yunfeng Ai
VGenSyDa
103
2
0
18 Mar 2025
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
Hongyu Ke
Jack Morris
K. Oguchi
Xiaofei Cao
Yongkang Liu
Haoxin Wang
Yi Ding
Mamba
133
0
0
18 Mar 2025
LipShiFT: A Certifiably Robust Shift-based Vision Transformer
LipShiFT: A Certifiably Robust Shift-based Vision Transformer
Rohan Menon
Nicola Franco
Stephan Günnemann
80
0
0
18 Mar 2025
TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection
TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection
Qiang Qi
Xiao Wang
ViT
551
0
0
18 Mar 2025
8-Calves Image dataset
8-Calves Image dataset
Xuyang Fang
S. Hannuna
Neill D. F. Campbell
402
0
0
17 Mar 2025
AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction
AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction
T. Monninger
Md Zafar Anwar
Stanislaw Antol
Steffen Staab
Sihao Ding
86
0
0
17 Mar 2025
Action tube generation by person query matching for spatio-temporal action detection
Action tube generation by person query matching for spatio-temporal action detection
Kazuki Omi
Jion Oshima
Toru Tamaki
153
0
0
17 Mar 2025
Exploring Contextual Attribute Density in Referring Expression Counting
Exploring Contextual Attribute Density in Referring Expression Counting
Zhicheng Wang
Zhiyu Pan
Zhan Peng
Jian Cheng
Liwen Xiao
Wei Jiang
Zhiguo Cao
76
0
0
16 Mar 2025
History-Aware Transformation of ReID Features for Multiple Object Tracking
History-Aware Transformation of ReID Features for Multiple Object Tracking
Ruopeng Gao
Yidan Wang
Chunxu Liu
Limin Wang
VOT
136
1
0
16 Mar 2025
Previous
123456...495051
Next