Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.03605
Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
50 / 720 papers shown
Title
Position-Aware Contrastive Alignment for Referring Image Segmentation
Bo Chen
Zhiwei Hu
Zhilong Ji
Jinfeng Bai
W. Zuo
33
8
0
27 Dec 2022
Reversible Column Networks
Yuxuan Cai
Yi Zhou
Qi Han
Jianjian Sun
Xiangwen Kong
Jun Yu Li
Xiangyu Zhang
VLM
31
53
0
22 Dec 2022
Analysis and application of multispectral data for water segmentation using machine learning
Shubham Gupta
D. Uma
R. Hebbar
20
0
0
16 Dec 2022
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection
Benjin Zhu
Zhe Wang
Shaoshuai Shi
Hang Xu
Lanqing Hong
Hongsheng Li
16
24
0
14 Dec 2022
NMS Strikes Back
Jeffrey Ouyang-Zhang
Jang Hyun Cho
Xingyi Zhou
Philipp Krahenbuhl
35
38
0
12 Dec 2022
REAP: A Large-Scale Realistic Adversarial Patch Benchmark
Nabeel Hingun
Chawin Sitawarin
Jerry Li
David Wagner
AAML
31
14
0
12 Dec 2022
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Jishnu Mukhoti
Tsung-Yu Lin
Omid Poursaeed
Rui Wang
Ashish Shah
Philip Torr
Ser-Nam Lim
VLM
35
80
0
09 Dec 2022
Vision Transformer Computation and Resilience for Dynamic Inference
Kavya Sreedhar
Jason Clemons
Rangharajan Venkatesan
S. Keckler
M. Horowitz
29
2
0
06 Dec 2022
GRiT: A Generative Region-to-text Transformer for Object Understanding
Jialian Wu
Jianfeng Wang
Zhengyuan Yang
Zhe Gan
Zicheng Liu
Junsong Yuan
Lijuan Wang
ObjD
VLM
22
112
0
01 Dec 2022
How to Train an Accurate and Efficient Object Detection Model on Any Dataset
Galina Zalesskaya
B. Bylicka
Eugene Liu
3DH
34
3
0
30 Nov 2022
Analysis of Training Object Detection Models with Synthetic Data
Bram Vanherle
Steven Moonen
F. Reeth
Nick Michiels
19
14
0
29 Nov 2022
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
Siyi Liu
Yaoyuan Liang
Feng Li
Shijia Huang
Hao Zhang
Hang Su
Jun Zhu
Lei Zhang
ObjD
50
25
0
28 Nov 2022
Learning Object-Language Alignments for Open-Vocabulary Object Detection
Chuang Lin
Pei Sun
Yi-Xin Jiang
Ping Luo
Lizhen Qu
Gholamreza Haffari
Zehuan Yuan
Jianfei Cai
VLM
ObjD
29
95
0
27 Nov 2022
3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection Transformers
Changyong Shu
Jiajun Deng
Feng Yu
Yifan Liu
3DPC
27
10
0
27 Nov 2022
Self-Supervised Learning based on Heat Equation
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Lu Yuan
Zicheng Liu
Youzuo Lin
29
4
0
23 Nov 2022
DETRs with Collaborative Hybrid Assignments Training
Zhuofan Zong
Guanglu Song
Yu Liu
ViT
57
306
0
22 Nov 2022
Teach-DETR: Better Training DETR with Teachers
Linjiang Huang
Kaixin Lu
Guanglu Song
Liang Wang
Siyu Liu
Yu Liu
Hongsheng Li
35
9
0
22 Nov 2022
Plug and Play Active Learning for Object Detection
Chenhongyi Yang
Lichao Huang
Elliot J. Crowley
ObjD
25
16
0
21 Nov 2022
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
44
71
0
19 Nov 2022
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Hao Li
Jinguo Zhu
Xiaohu Jiang
Xizhou Zhu
Hongsheng Li
...
Xiaohua Wang
Yu Qiao
Xiaogang Wang
Wenhai Wang
Jifeng Dai
MLLM
26
55
0
17 Nov 2022
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information
Weijie Su
Xizhou Zhu
Chenxin Tao
Lewei Lu
Bin Li
Gao Huang
Yu Qiao
Xiaogang Wang
Jie Zhou
Jifeng Dai
42
41
0
17 Nov 2022
DiffusionDet: Diffusion Model for Object Detection
Shoufa Chen
Pei Sun
Yibing Song
Ping Luo
68
443
0
17 Nov 2022
D
3
^3
3
ETR: Decoder Distillation for Detection Transformer
Xiaokang Chen
Jiahui Chen
Yong-Jin Liu
Gang Zeng
42
16
0
17 Nov 2022
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Guo Chen
Sen Xing
Zhe Chen
Yi Wang
Kunchang Li
...
Hongjie Zhang
Tong Lu
Yali Wang
Liming Wang
Yu Qiao
41
46
0
17 Nov 2022
DETRDistill: A Universal Knowledge Distillation Framework for DETR-families
Jiahao Chang
Shuo Wang
Guangkai Xu
Zehui Chen
Chenhongyi Yang
Fengshang Zhao
34
29
0
17 Nov 2022
Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling
Yu Wang
Xin Li
Shengzhao Wen
Fu-En Yang
Wanping Zhang
Gang Zhang
Haocheng Feng
Junyu Han
Errui Ding
45
5
0
15 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
CLIP
87
679
0
14 Nov 2022
DEYO: DETR with YOLO for Step-by-Step Object Detection
Hao Ouyang
38
7
0
12 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
41
660
0
10 Nov 2022
Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining
Qiang Chen
Jian Wang
Chuchu Han
Shangang Zhang
Zexian Li
...
Haocheng Feng
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
ViT
VLM
42
45
0
07 Nov 2022
SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency
Yang Liu
Yao Zhang
Yixin Wang
Yang Zhang
Jiang Tian
Zhongchao Shi
Jianping Fan
Zhiqiang He
42
14
0
03 Nov 2022
State-of-the-art Models for Object Detection in Various Fields of Application
S. A. G. Naqvi
Syed Shahnawaz Ali
ObjD
OOD
38
0
0
01 Nov 2022
Transformers For Recognition In Overhead Imagery: A Reality Check
Francesco Luzi
Aneesh Gupta
L. Collins
Kyle Bradbury
Jordan M. Malof
ViT
35
4
0
23 Oct 2022
RLM-Tracking: Online Multi-Pedestrian Tracking Supported by Relative Location Mapping
Kai Ren
Chuanping Hu
27
2
0
19 Oct 2022
1st Place Solutions for the UVO Challenge 2022
Jiajun Zhang
Boyu Chen
Zhilong Ji
Jinfeng Bai
Zonghai Hu
39
1
0
18 Oct 2022
Point Transformer V2: Grouped Vector Attention and Partition-based Pooling
Xiaoyang Wu
Yixing Lao
Li Jiang
Xihui Liu
Hengshuang Zhao
3DPC
ViT
32
367
0
11 Oct 2022
A Closer Look at Robustness to L-infinity and Spatial Perturbations and their Composition
Luke Rowe
Benjamin Thérien
Krzysztof Czarnecki
Hongyang R. Zhang
OOD
27
0
0
05 Oct 2022
FQDet: Fast-converging Query-based Detector
Cédric Picron
Punarjay Chakravarty
Tinne Tuytelaars
ObjD
41
2
0
05 Oct 2022
Long-Term Localization using Semantic Cues in Floor Plan Maps
Nicky Zimmerman
Tiziano Guadagnino
Xieyuanli Chen
Jens Behley
C. Stachniss
39
23
0
04 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng-Wei Zhang
Chao Zhang
Hanhua Hu
35
25
0
03 Oct 2022
Physical Adversarial Attack meets Computer Vision: A Decade Survey
Hui Wei
Hao Tang
Xuemei Jia
Zhixiang Wang
Han-Bing Yu
Zhubo Li
Shiníchi Satoh
Luc Van Gool
Zheng Wang
AAML
33
43
0
30 Sep 2022
Motion Transformer with Global Intention Localization and Local Movement Refinement
Shaoshuai Shi
Li Jiang
Dengxin Dai
Bernt Schiele
36
220
0
27 Sep 2022
CenterFormer: Center-based Transformer for 3D Object Detection
Zixiang Zhou
Xian Zhao
Yu Wang
Panqu Wang
H. Foroosh
3DPC
ViT
27
135
0
12 Sep 2022
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Hongyang Li
Chonghao Sima
Jifeng Dai
Wenhai Wang
Lewei Lu
...
Xiaosong Jia
Siqian Liu
Jianping Shi
Dahua Lin
Yu Qiao
98
139
0
12 Sep 2022
Vec2Face-v2: Unveil Human Faces from their Blackbox Features via Attention-based Network in Face Recognition
Thanh-Dat Truong
C. Duong
Ngan Le
Marios Savvides
Khoa Luu
CVBM
72
9
0
11 Sep 2022
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors
Gongjie Zhang
Zhipeng Luo
Zichen Tian
Yingchen Yu
Jingyi Zhang
Shijian Lu
32
26
0
24 Aug 2022
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Wenhui Wang
Hangbo Bao
Li Dong
Johan Bjorck
Zhiliang Peng
...
Kriti Aggarwal
O. Mohammed
Saksham Singhal
Subhojit Som
Furu Wei
MLLM
VLM
ViT
54
629
0
22 Aug 2022
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions
Yongming Rao
Wenliang Zhao
Yansong Tang
Jie Zhou
Ser-Nam Lim
Jiwen Lu
ViT
22
251
0
28 Jul 2022
Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion
Gongjie Zhang
Zhipeng Luo
Jiaxing Huang
Shijian Lu
Eric Xing
ViT
41
20
0
28 Jul 2022
PEA: Improving the Performance of ReLU Networks for Free by Using Progressive Ensemble Activations
Á. Utasi
35
0
0
28 Jul 2022
Previous
1
2
3
...
13
14
15
Next