Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.03605
Cited By
v1
v2
v3
v4 (latest)
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (2506★)
Papers citing
"DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
42 / 742 papers shown
Title
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
180
699
0
10 Nov 2022
Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining
Qiang Chen
Jian Wang
Chuchu Han
Shangang Zhang
Zexian Li
...
Haocheng Feng
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
ViT
VLM
92
45
0
07 Nov 2022
SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency
Yang Liu
Yao Zhang
Yixin Wang
Yang Zhang
Jiang Tian
Zhongchao Shi
Jianping Fan
Zhiqiang He
109
14
0
03 Nov 2022
State-of-the-art Models for Object Detection in Various Fields of Application
S. A. G. Naqvi
Syed Shahnawaz Ali
ObjD
OOD
125
0
0
01 Nov 2022
Transformers For Recognition In Overhead Imagery: A Reality Check
Francesco Luzi
Aneesh Gupta
L. Collins
Kyle Bradbury
Jordan M. Malof
ViT
79
4
0
23 Oct 2022
RLM-Tracking: Online Multi-Pedestrian Tracking Supported by Relative Location Mapping
Kai Ren
Chuanping Hu
51
2
0
19 Oct 2022
1st Place Solutions for the UVO Challenge 2022
Jiajun Zhang
Boyu Chen
Zhilong Ji
Jinfeng Bai
Zonghai Hu
86
1
0
18 Oct 2022
Point Transformer V2: Grouped Vector Attention and Partition-based Pooling
Xiaoyang Wu
Yixing Lao
Li Jiang
Xihui Liu
Hengshuang Zhao
3DPC
ViT
171
407
0
11 Oct 2022
A Closer Look at Robustness to L-infinity and Spatial Perturbations and their Composition
Luke Rowe
Benjamin Thérien
Krzysztof Czarnecki
Hongyang R. Zhang
OOD
58
0
0
05 Oct 2022
FQDet: Fast-converging Query-based Detector
Cédric Picron
Punarjay Chakravarty
Tinne Tuytelaars
ObjD
108
2
0
05 Oct 2022
Long-Term Localization using Semantic Cues in Floor Plan Maps
Nicky Zimmerman
Tiziano Guadagnino
Xieyuanli Chen
Jens Behley
C. Stachniss
64
25
0
04 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng Zhang
Chao Zhang
Hanhua Hu
117
31
0
03 Oct 2022
Physical Adversarial Attack meets Computer Vision: A Decade Survey
Hui Wei
Hao Tang
Xuemei Jia
Zhixiang Wang
Han-Bing Yu
Zhubo Li
Shiníchi Satoh
Luc Van Gool
Zheng Wang
AAML
150
56
0
30 Sep 2022
Motion Transformer with Global Intention Localization and Local Movement Refinement
Shaoshuai Shi
Li Jiang
Dengxin Dai
Bernt Schiele
108
244
0
27 Sep 2022
CenterFormer: Center-based Transformer for 3D Object Detection
Zixiang Zhou
Xian Zhao
Yu Wang
Panqu Wang
H. Foroosh
3DPC
ViT
100
141
0
12 Sep 2022
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Hongyang Li
Chonghao Sima
Jifeng Dai
Wenhai Wang
Lewei Lu
...
Xiaosong Jia
Siqian Liu
Jianping Shi
Dahua Lin
Yu Qiao
176
151
0
12 Sep 2022
Vec2Face-v2: Unveil Human Faces from their Blackbox Features via Attention-based Network in Face Recognition
Thanh-Dat Truong
C. Duong
Ngan Le
Marios Savvides
Khoa Luu
CVBM
105
9
0
11 Sep 2022
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors
Gongjie Zhang
Zhipeng Luo
Zichen Tian
Yingchen Yu
Jingyi Zhang
Shijian Lu
101
29
0
24 Aug 2022
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Wenhui Wang
Hangbo Bao
Li Dong
Johan Bjorck
Zhiliang Peng
...
Kriti Aggarwal
O. Mohammed
Saksham Singhal
Subhojit Som
Furu Wei
MLLM
VLM
ViT
157
647
0
22 Aug 2022
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions
Yongming Rao
Wenliang Zhao
Yansong Tang
Jie Zhou
Ser-Nam Lim
Jiwen Lu
ViT
117
256
0
28 Jul 2022
Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion
Gongjie Zhang
Zhipeng Luo
Jiaxing Huang
Shijian Lu
Eric Xing
ViT
98
21
0
28 Jul 2022
PEA: Improving the Performance of ReLU Networks for Free by Using Progressive Ensemble Activations
Á. Utasi
49
0
0
28 Jul 2022
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Qiang Chen
Xiaokang Chen
Jian Wang
Shan Zhang
Kun Yao
Haocheng Feng
Junyu Han
Errui Ding
Gang Zeng
Jingdong Wang
ViT
145
135
0
26 Jul 2022
DETRs with Hybrid Matching
Ding Jia
Yuhui Yuan
Hao He
Xiao-pei Wu
Haojun Yu
Weihong Lin
Lei-huan Sun
Chao Zhang
Hanhua Hu
69
200
0
26 Jul 2022
Efficient Graph-Friendly COCO Metric Computation for Train-Time Model Evaluation
Luke Wood
François Chollet
28
7
0
21 Jul 2022
Focused Decoding Enables 3D Anatomical Detection by Transformers
Bastian Wittmann
Fernando Navarro
Suprosanna Shit
Bjoern Menze
ViT
MedIm
76
10
0
21 Jul 2022
Exploring Contextual Relationships for Cervical Abnormal Cell Detection
Yixiong Liang
Shuo Feng
Qing Liu
Hulin Kuang
Jianfeng Liu
Liyan Liao
Yun Du
Jianxin Wang
80
13
0
11 Jul 2022
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Chien-Yao Wang
Alexey Bochkovskiy
H. Liao
ObjD
181
6,668
0
06 Jul 2022
Global Context Vision Transformers
Ali Hatamizadeh
Hongxu Yin
Greg Heinrich
Jan Kautz
Pavlo Molchanov
ViT
82
129
0
20 Jun 2022
A Multi-task Framework for Infrared Small Target Detection and Segmentation
Yuhang Chen
Liyuan Li
Xin Liu
Xiaofeng Su
Fansheng Chen
60
48
0
14 Jun 2022
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
152
394
0
06 Jun 2022
Cross-domain Detection Transformer based on Spatial-aware and Semantic-aware Token Alignment
Jinhong Deng
Xiaoyue Zhang
Wen Li
Lixin Duan
92
9
0
01 Jun 2022
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
Yixuan Wei
Han Hu
Zhenda Xie
Zheng Zhang
Yue Cao
Jianmin Bao
Dong Chen
B. Guo
CLIP
158
128
0
27 May 2022
An Empirical Study Of Self-supervised Learning Approaches For Object Detection With Transformers
Gokul Karthik Kumar
Sahal Shaji Mullappilly
Abhishek Singh Gehlot
ViT
54
1
0
11 May 2022
Improving Transferability for Domain Adaptive Detection Transformers
Kaixiong Gong
Shuang Li
Shugang Li
Rui Zhang
Chi Harold Liu
Qiang Chen
135
36
0
29 Apr 2022
Focal Modulation Networks
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
111
281
0
22 Mar 2022
A Quality Index Metric and Method for Online Self-Assessment of Autonomous Vehicles Sensory Perception
Ce Zhang
A. Eskandarian
78
8
0
04 Mar 2022
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
Feng Li
Hao Zhang
Shi-guang Liu
Jian Guo
L. Ni
Lei Zhang
ViT
199
692
0
02 Mar 2022
Context Autoencoder for Self-Supervised Representation Learning
Xiaokang Chen
Mingyu Ding
Xiaodi Wang
Ying Xin
Shentong Mo
Yunhao Wang
Shumin Han
Ping Luo
Gang Zeng
Jingdong Wang
SSL
206
402
0
07 Feb 2022
CPPE-5: Medical Personal Protective Equipment Dataset
Rishit Dagli
A. Shaikh
89
12
0
15 Dec 2021
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
195
356
0
11 Nov 2021
YOLO9000: Better, Faster, Stronger
Joseph Redmon
Ali Farhadi
VLM
ObjD
342
15,695
0
25 Dec 2016
Previous
1
2
3
...
13
14
15