Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.12329
Cited By
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
28 January 2022
Shilong Liu
Feng Li
Hao Zhang
X. Yang
Xianbiao Qi
Hang Su
Jun Zhu
Lei Zhang
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"
50 / 389 papers shown
Title
Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search
XiaoTong Gu
Shengyu Tang
Yiming Cao
Changdong Yu
ViT
29
0
0
10 May 2025
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
Ho-Joong Kim
Y. E. Lee
Jung-Ho Hong
Seong-Whan Lee
40
0
0
09 May 2025
Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection
Zhangchi Hu
Peixi Wu
Jie Chen
Huyue Zhu
Yijun Wang
Yansong Peng
H. Li
X. Sun
46
0
0
09 May 2025
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
Junjie Wang
Bin Chen
Yulin Li
Bin Kang
Y. Chen
Zhuotao Tian
VLM
38
0
0
07 May 2025
Dynamic Robot Tool Use with Vision Language Models
Noah Trupin
Zixing Wang
A. H. Qureshi
37
0
0
02 May 2025
Mcity Data Engine: Iterative Model Improvement Through Open-Vocabulary Data Selection
Daniel Bogdoll
Rajanikant Ananta
Abeyankar Giridharan
Isabel Moore
Gregory Stevens
Henry X. Liu
VLM
51
0
0
30 Apr 2025
Purifying, Labeling, and Utilizing: A High-Quality Pipeline for Small Object Detection
Siwei Wang
Zhiwei Chen
Liujuan Cao
Rongrong Ji
ObjD
69
0
0
29 Apr 2025
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Juhan Park
Kyungjae Lee
Hyung Jin Chang
Jungchan Cho
VLM
66
0
0
28 Apr 2025
Few-Shot Referring Video Single- and Multi-Object Segmentation via Cross-Modal Affinity with Instance Sequence Matching
Heng Liu
Guanghui Li
Mingqi Gao
Xiantong Zhen
Feng Zheng
Y. Wang
VOS
48
0
0
18 Apr 2025
SO-DETR: Leveraging Dual-Domain Features and Knowledge Distillation for Small Object Detection
Huaxiang Zhang
Hao Zhang
Aoran Mei
Zhongxue Gan
Guo-Niu Zhu
30
0
0
11 Apr 2025
WS-DETR: Robust Water Surface Object Detection through Vision-Radar Fusion with Detection Transformer
Huilin Yin
Pengyu Wang
Senmao Li
Jun Yan
Daniel Watzenig
31
0
0
10 Apr 2025
A Modular Energy Aware Framework for Multicopter Modeling in Control and Planning Applications
Sebastian Gasche
Christian Kallies
Andreas Himmel
R. Findeisen
36
0
0
04 Apr 2025
v-CLR: View-Consistent Learning for Open-World Instance Segmentation
Chang-Bin Zhang
Jinhong Ni
Yujie Zhong
Kai Han
3DV
VLM
69
0
0
02 Apr 2025
Coca-Splat: Collaborative Optimization for Camera Parameters and 3D Gaussians
Jiamin Wu
Hongyang Li
Xiaoke Jiang
Yuan Yao
Lei Zhang
3DGS
51
0
0
01 Apr 2025
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
Kuang Wu
Chuan Yang
Zhanbin Li
55
0
0
27 Mar 2025
BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction
Jan Kohút
Martin Dočekal
Michal Hradiš
Marek Vaško
37
0
0
25 Mar 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun
Huazhang Hu
Yidong Ma
Gang Liu
Nemo Chen
Xu Tang
Yao Hu
Yongchao Xu
ObjD
47
0
0
24 Mar 2025
Vision-R1: Evolving Human-Free Alignment in Large Vision-Language Models via Vision-Guided Reinforcement Learning
Yufei Zhan
Yousong Zhu
Shurong Zheng
Hongyin Zhao
Fan Yang
Ming Tang
J. T. Wang
VLM
67
3
0
23 Mar 2025
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
Jiawei Wang
Kai Hu
Qiang Huo
55
0
0
20 Mar 2025
Panoramic Distortion-Aware Tokenization for Person Detection and Localization Using Transformers in Overhead Fisheye Images
Nobuhiko Wakai
Satoshi Sato
Yasunori Ishii
Takayoshi Yamashita
61
0
0
18 Mar 2025
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
Chuxin Wang
Wenfei Yang
Xiang Liu
Tianzhu Zhang
59
0
0
18 Mar 2025
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection
Chiara Cappellino
G. Mancusi
Matteo Mosconi
Angelo Porrello
Simone Calderara
Rita Cucchiara
ObjD
VLM
86
0
0
12 Mar 2025
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving
Xiaosong Jia
Junqi You
Zhiyuan Zhang
Junchi Yan
44
5
0
07 Mar 2025
MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism
Zhixiong Nan
Xianghong Li
Jifeng Dai
Tao Xiang
46
0
0
03 Mar 2025
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects
Elkhan Ismayilzada
MD Khalequzzaman Chowdhury Sayem
Yihalem Yimolal Tiruneh
Mubarrat Chowdhury
Muhammadjon Boboev
Seungryul Baek
ViT
71
1
0
27 Feb 2025
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection
Yuming Chen
Xinbin Yuan
Ruiqi Wu
Jiabao Wang
Qibin Hou
Mingg-Ming Cheng
Ming-Ming Cheng
ObjD
153
51
0
21 Feb 2025
CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs
Qizhen Lan
Qing Tian
50
0
0
15 Feb 2025
Dense Object Detection Based on De-homogenized Queries
Yueming Huang
Chenrui Ma
Hao Zhou
Hao Wu
Guowu Yuan
125
0
0
11 Feb 2025
CSPCL: Category Semantic Prior Contrastive Learning for Deformable DETR-Based Prohibited Item Detectors
Mingyuan Li
Tong Jia
Hui Lu
Bowen Ma
Hao Wang
Dongyue Chen
72
0
0
28 Jan 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
42
1
0
18 Jan 2025
Siamese-DETR for Generic Multi-Object Tracking
Qiankun Liu
Yichen Li
Yuqi Jiang
Ying Fu
VOT
59
7
0
08 Jan 2025
SpecDETR: A Transformer-based Hyperspectral Point Object Detection Network
Zhaoxu Li
Wei An
Gaowei Guo
Longguang Wang
Yingqian Wang
Zaiping Lin
ViT
85
0
0
03 Jan 2025
To Predict or Not To Predict? Proportionally Masked Autoencoders for Tabular Data Imputation
Jungkyu Kim
Kibok Lee
Taeyoung Park
38
1
0
26 Dec 2024
PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation
Lojze Žust
Matej Kristan
ViT
81
1
0
13 Dec 2024
DEIM: DETR with Improved Matching for Fast Convergence
Shihua Huang
Zhichao Lu
Xiaodong Cun
Yongjun Yu
Xiao Zhou
Xi Shen
VLM
174
2
0
05 Dec 2024
LQ-Adapter: ViT-Adapter with Learnable Queries for Gallbladder Cancer Detection from Ultrasound Image
Chetan Madan
Mayuna Gupta
Soumen Basu
Pankaj Gupta
Chetan Arora
85
0
0
30 Nov 2024
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video
Jinyuan Qu
Hongyang Li
Shilong Liu
Tianhe Ren
Zhaoyang Zeng
Lei Zhang
3DPC
72
1
0
27 Nov 2024
Monocular Lane Detection Based on Deep Learning: A Survey
Xin He
Haiyun Guo
Kuan Zhu
Bingke Zhu
Xu Zhao
Jianwu Fang
J. T. Wang
93
1
0
25 Nov 2024
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Jongseong Bae
Junwoo Ha
Ha Young Kim
84
0
0
25 Nov 2024
DT-LSD: Deformable Transformer-based Line Segment Detection
Sebastian Janampa
Marios Pattichis
ViT
71
0
0
20 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
45
0
0
16 Nov 2024
RETR: Multi-View Radar Detection Transformer for Indoor Perception
Ryoma Yataka
Adriano Cardace
P. Wang
P. Boufounos
R. Takahashi
44
1
0
15 Nov 2024
Multi-object Tracking by Detection and Query: an efficient end-to-end manner
Shukun Jia
Yichao Cao
Feng Yang
Xin Lu
Xiaobo Lu
VOT
34
0
0
09 Nov 2024
LAM-YOLO: Drones-based Small Object Detection on Lighting-Occlusion Attention Mechanism YOLO
Yuchen Zheng
Yuxin Jing
Jufeng Zhao
Guangmang Cui
ObjD
34
0
0
01 Nov 2024
GigaCheck: Detecting LLM-generated Content
Irina Tolstykh
Aleksandra Tsybina
Sergey Yakubson
Aleksandr Gordeev
Vladimir Dokholyan
Maksim Kuprashevich
DeLMO
37
1
0
31 Oct 2024
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking
Run Luo
Zikai Song
Longze Chen
Yunshui Li
Min Yang
Wei-Guo Yang
38
0
0
30 Oct 2024
Unbiased Regression Loss for DETRs
Edric
Ueta Daisuke
Kurokawa Yukimasa
Karlekar Jayashree
Sugiri Pranata
32
0
0
30 Oct 2024
DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model
Zhixiong Nan
Xianghong Li
Tao Xiang
Jifeng Dai
ISeg
43
0
0
22 Oct 2024
Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models
Yufei Zhan
Hongyin Zhao
Yousong Zhu
Fan Yang
Ming Tang
Jinqiao Wang
MLLM
43
1
0
21 Oct 2024
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement
Yansong Peng
Hebei Li
Peixi Wu
Yueyi Zhang
X. Sun
Feng Wu
36
13
0
17 Oct 2024
1
2
3
4
5
6
7
8
Next