Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.03605
Cited By
v1
v2
v3
v4 (latest)
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (2506★)
Papers citing
"DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
50 / 742 papers shown
Title
Break Stylistic Sophon: Are We Really Meant to Confine the Imagination in Style Transfer?
Gary Song Yan
Yusen Zhang
Jinyu Zhao
Hao Zhang
Zhangping Yang
...
Tao Zhang
Yujie He
Siyuan Tian
Yao Gou
Min Li
DiffM
63
0
0
18 Jun 2025
FOAM: A General Frequency-Optimized Anti-Overlapping Framework for Overlapping Object Perception
Mingyuan Li
Tong Jia
Han Gu
Hui Lu
Hao Wang
Bowen Ma
Shuyang Lin
Shiyi Guo
Shizhuo Deng
Dongyue Chen
34
0
0
16 Jun 2025
DETRPose: Real-time end-to-end transformer model for multi-person pose estimation
Sebastian Janampa
Marios Pattichis
ViT
20
0
0
16 Jun 2025
Multiple Object Tracking in Video SAR: A Benchmark and Tracking Baseline
Haoxiang Chen
Wei Zhao
Rufei Zhang
Nannan Li
Dongjin Li
17
0
0
13 Jun 2025
Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection
Duc Thanh Pham
Hong Dang Nguyen
Nhat Minh Nguyen Quoc
Linh Ngo Van
Sang Dinh Viet
Duc Anh Nguyen
ViT
51
0
0
10 Jun 2025
Robust sensor fusion against on-vehicle sensor staleness
Meng Fan
Yifan Zuo
Patrick Blaes
Harley Montgomery
Subhasis Das
53
0
0
06 Jun 2025
Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration
Fanhu Zeng
Deli Yu
Zhenglun Kong
Hao Tang
ViT
63
1
0
06 Jun 2025
BiXFormer: A Robust Framework for Maximizing Modality Effectiveness in Multi-Modal Semantic Segmentation
Jialei Chen
Xu Zheng
Danda Pani Paudel
Luc Van Gool
Hiroshi Murase
Daisuke Deguchi
91
0
0
04 Jun 2025
Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection
Yechi Ma
Wei Hua
Shu Kong
66
0
0
03 Jun 2025
FDSG: Forecasting Dynamic Scene Graphs
Yi Yang
Yuren Cong
Hao Cheng
Bodo Rosenhahn
Michael Ying Yang
AI4TS
54
0
0
02 Jun 2025
SEED: A Benchmark Dataset for Sequential Facial Attribute Editing with Diffusion Models
Yule Zhu
Ping Liu
Zhedong Zheng
Wei Liu
28
0
0
31 May 2025
DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models
Chenbin Pan
Wenbin He
Zhengzhong Tu
Liu Ren
LRM
VLM
75
0
0
29 May 2025
WTEFNet: Real-Time Low-Light Object Detection for Advanced Driver Assistance Systems
Hao Wu
Junzhou Chen
Ronghui Zhang
Nengchao Lyu
Hongyu Hu
Yanyong Guo
Tony Z. Qiu
73
0
0
29 May 2025
ObjectClear: Complete Object Removal via Object-Effect Attention
Jixin Zhao
Shangchen Zhou
Zhouxia Wang
Peiqing Yang
Chen Change Loy
DiffM
69
0
0
28 May 2025
Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection
Guiping Cao
Wenjian Huang
X. Lan
Jianguo Zhang
D. Jiang
Yaowei Wang
ViT
47
0
0
28 May 2025
S2AFormer: Strip Self-Attention for Efficient Vision Transformer
Guoan Xu
Wenfeng Huang
Wenjing Jia
Jiamao Li
Guangwei Gao
Guo-Jun Qi
71
0
0
28 May 2025
VME: A Satellite Imagery Dataset and Benchmark for Detecting Vehicles in the Middle East and Beyond
Noora Al-Emadi
Ingmar Weber
Yin Yang
Ferda Ofli
54
0
0
28 May 2025
Improving Contrastive Learning for Referring Expression Counting
Kostas Triaridis
Panagiotis Kaliosis
E-Ro Nguyen
Aoxiang Fan
Hieu M. Le
Dimitris Samaras
SSL
67
0
0
28 May 2025
Adapting Segment Anything Model for Power Transmission Corridor Hazard Segmentation
Hang Chen
Maoyuan Ye
Peng Yang
Haibin He
Juhua Liu
Bo Du
39
0
0
28 May 2025
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Guiping Cao
Tao Wang
Wenjian Huang
X. Lan
Jianguo Zhang
D. Jiang
ObjD
VLM
24
0
0
27 May 2025
StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation
Yi Wu
Lingting Zhu
Shengju Qian
Lei Liu
Wandi Qiao
Lequan Yu
Bin Li
72
0
0
26 May 2025
CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting
Lei Tian
Xiaomin Li
Liqian Ma
Hefei Huang
Zirui Zheng
Hao Yin
Taiqing Li
Huchuan Lu
Xu Jia
29
0
0
26 May 2025
Rethinking Features-Fused-Pyramid-Neck for Object Detection
Hulin Li
221
0
0
19 May 2025
CrafText Benchmark: Advancing Instruction Following in Complex Multimodal Open-Ended World
Zoya Volovikova
G. Gorbov
Petr Kuderov
Aleksandr I. Panov
A. Skrynnik
95
0
0
17 May 2025
Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data
Yiwen Liu
Jessica Bader
Jae Myung Kim
DiffM
79
1
0
15 May 2025
EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models
Hu Yue
Siyuan Huang
Yue Liao
Shengcong Chen
Pengfei Zhou
Liliang Chen
Maoqing Yao
Guanghui Ren
VGen
84
1
0
14 May 2025
VIViT: Variable-Input Vision Transformer Framework for 3D MR Image Segmentation
Badhan Kumar Das
Ajay Singh
Gengyan Zhao
Han Liu
Thomas J. Re
Dorin Comaniciu
Eli Gibson
Andreas Maier
ViT
MedIm
69
0
0
13 May 2025
BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation
Panwen Hu
Jiehui Huang
Qiang Sun
Xiaodan Liang
DiffM
VGen
109
0
0
11 May 2025
Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection
Zhangchi Hu
Peixi Wu
Jie Chen
Huyue Zhu
Yijun Wang
Yansong Peng
Hong Li
Xingwu Sun
141
0
0
09 May 2025
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
Ho-Joong Kim
Y. E. Lee
Jung-Ho Hong
Seong-Whan Lee
179
0
0
09 May 2025
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model
Navin Ranjan
Andreas E. Savakis
MQ
VLM
145
0
0
08 May 2025
A Simple Detector with Frame Dynamics is a Strong Tracker
Chenxu Peng
Changbo Wang
Minrui Zou
Danyang Li
Zhiyong Yang
Yimian Dai
Ming-Ming Cheng
Xiang Li
109
0
0
08 May 2025
Crafting Physical Adversarial Examples by Combining Differentiable and Physically Based Renders
Yuqiu Liu
Huanqian Yan
Xiaopei Zhu
Xiaolin Hu
L. Tang
Hang Su
Chen Lv
50
0
0
07 May 2025
X-ray illicit object detection using hybrid CNN-transformer neural network architectures
Jorgen Cani
Christos Diou
Spyridon Evangelatos
Panagiotis I. Radoglou-Grammatikis
Vasileios Argyriou
Panagiotis G. Sarigiannidis
Iraklis Varlamis
Georgios Th. Papadopoulos
71
0
0
01 May 2025
MolMole: Molecule Mining from Scientific Literature
LG AI Research
S. Chun
Jiye Kim
Ahra Jo
Yeonsik Jo
...
Sehui Han
Jaewan Lee
Changyoung Park
Kijeong Jeon
Sihyuk Yi
52
0
0
30 Apr 2025
Style-Adaptive Detection Transformer for Single-Source Domain Generalized Object Detection
Jianhong Han
Yupei Wang
Liang Chen
ViT
105
0
0
29 Apr 2025
DG-DETR: Toward Domain Generalized Detection Transformer
Seongmin Hwang
Daeyoung Han
Moongu Jeon
ViT
107
0
0
28 Apr 2025
Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models
Patrick Müller
Alexander Braun
Margret Keuper
109
0
0
25 Apr 2025
AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization
Jinda Lu
Jinghan Li
Yuan Gao
Junkang Wu
Jiancan Wu
Xiang Wang
Xiangnan He
417
1
0
22 Apr 2025
Context Aware Grounded Teacher for Source Free Object Detection
Tajamul Ashraf
Rajes Manna
Partha Sarathi Purkayastha
Tavaheed Tariq
Janibul Bashir
106
0
0
21 Apr 2025
SG-Reg: Generalizable and Efficient Scene Graph Registration
Chuhao Liu
Zhijian Qiao
Jieqi Shi
Ke Wang
Peize Liu
Shaojie Shen
132
0
0
20 Apr 2025
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection
YangChen Zeng
ViT
78
0
0
18 Apr 2025
Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction
Yushen He
Lei Zhao
Tianchen Deng
Zipeng Fang
Weidong Chen
69
0
0
18 Apr 2025
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjD
VOS
329
9
0
17 Apr 2025
RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity
Ranjan Sapkota
Rahul Harsha Cheppally
Ajay Sharda
Manoj Karkee
100
0
0
17 Apr 2025
CM3AE: A Unified RGB Frame and Event-Voxel/-Frame Pre-training Framework
Wentao Wu
Xinyu Wang
Chenglong Li
Bo Jiang
Jin Tang
Bin Luo
Qi Liu
100
0
0
17 Apr 2025
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding
Hang Ji
Tao Ni
Xufeng Huang
Zhan Shi
Tao Luo
Xin Zhan
Junbo Chen
3DPC
84
0
0
17 Apr 2025
Multimodal Spatio-temporal Graph Learning for Alignment-free RGBT Video Object Detection
Qishun Wang
Zhengzheng Tu
Chenglong Li
Bo Jiang
VOS
112
0
0
16 Apr 2025
Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach
Lvpan Cai
Haowei Wang
Jiayi Ji
YanShu ZhouMen
Yiwei Ma
Xiaoshuai Sun
Liujuan Cao
Rongrong Ji
ViT
88
1
0
16 Apr 2025
Weather-Aware Object Detection Transformer for Domain Adaptation
Soheil Gharatappeh
Salimeh Yasaei Sekeh
Vikas Dhiman
ViT
73
0
0
15 Apr 2025
1
2
3
4
...
13
14
15
Next