Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.03605
Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
50 / 720 papers shown
Title
Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection
Yifan Zhang
Zhiyu Zhu
Junhui Hou
Dapeng Wu
31
7
0
01 Jul 2023
MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention Querying
Shaoshuai Shi
Li Jiang
Dengxin Dai
Bernt Schiele
29
114
0
30 Jun 2023
Integrating Large Pre-trained Models into Multimodal Named Entity Recognition with Evidential Fusion
Weide Liu
Xiaoyang Zhong
Jingwen Hou
Shaohua Li
Haozhe Huang
Yuming Fang
EDL
35
5
0
29 Jun 2023
The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One Shot
L. Osco
Qiusheng Wu
Eduardo Lopes de Lemos
W. Gonçalves
A. P. Ramos
Jonathan Li
J. M. Junior
VLM
18
181
0
29 Jun 2023
Taming Detection Transformers for Medical Object Detection
Marc K. Ickler
Michael Baumgartner
Saikat Roy
Tassilo Wald
Klaus H. Maier-Hein
ViT
MedIm
23
6
0
27 Jun 2023
A Survey on Multimodal Large Language Models
Shukang Yin
Chaoyou Fu
Sirui Zhao
Ke Li
Xing Sun
Tong Xu
Enhong Chen
MLLM
LRM
62
562
0
23 Jun 2023
Bridging the Performance Gap between DETR and R-CNN for Graphical Object Detection in Document Images
Tahira Shehzadi
K. Hashmi
D. Stricker
Marcus Liwicki
Muhammad Zeshan Afzal
29
7
0
23 Jun 2023
CrossKD: Cross-Head Knowledge Distillation for Object Detection
Jiabao Wang
Yuming Chen
Zhaohui Zheng
Xiang Li
Ming-Ming Cheng
Qibin Hou
48
33
0
20 Jun 2023
DEYOv2: Rank Feature with Greedy Matching for End-to-End Object Detection
Hao Ouyang
42
4
0
15 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
41
7
0
14 Jun 2023
detrex: Benchmarking Detection Transformers
Tianhe Ren
Siyi Liu
Feng Li
Hao Zhang
Ailing Zeng
...
Zhaoyang Zeng
Xianbiao Qi
Yuhui Yuan
Jianwei Yang
Lei Zhang
42
13
0
12 Jun 2023
FasterViT: Fast Vision Transformers with Hierarchical Attention
Ali Hatamizadeh
Greg Heinrich
Hongxu Yin
Andrew Tao
J. Álvarez
Jan Kautz
Pavlo Molchanov
ViT
28
68
0
09 Jun 2023
Image Blending Algorithm with Automatic Mask Generation
Haochen Xue
Min Jin
Chong Zhang
Yuxuan Huang
Q. Weng
Xiaobo Jin
19
0
0
08 Jun 2023
RefineVIS: Video Instance Segmentation with Temporal Attention Refinement
Andre Abrantes
Jiang Wang
Peng Chu
Quanzeng You
Zicheng Liu
VOS
29
0
0
07 Jun 2023
Object Detection with Transformers: A Review
Tahira Shehzadi
K. Hashmi
D. Stricker
Muhammad Zeshan Afzal
ViT
MU
26
28
0
07 Jun 2023
YONA: You Only Need One Adjacent Reference-frame for Accurate and Fast Video Polyp Detection
Yuncheng Jiang
Zixun Zhang
Ruimao Zhang
Guanbin Li
Shuguang Cui
Zerui Li
26
3
0
06 Jun 2023
Recognize Anything: A Strong Image Tagging Model
Youcai Zhang
Xinyu Huang
Jinyu Ma
Zhaoyang Li
Zhaochuan Luo
...
Tong Luo
Yaqian Li
Siyi Liu
Yandong Guo
Lei Zhang
VLM
47
225
0
06 Jun 2023
OCBEV: Object-Centric BEV Transformer for Multi-View 3D Object Detection
Zhangyang Qi
Jiaqi Wang
Xiaoyang Wu
Hengshuang Zhao
43
11
0
02 Jun 2023
Segment Anything in High Quality
Lei Ke
Mingqiao Ye
Martin Danelljan
Yifan Liu
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
VLM
43
311
0
02 Jun 2023
Multi-modal Queried Object Detection in the Wild
Yifan Xu
Mengdan Zhang
Chaoyou Fu
Peixian Chen
Xiaoshan Yang
Ke Li
Changsheng Xu
ObjD
VLM
38
30
0
30 May 2023
MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction
J. Wang
Aixin Sun
Hao Zhang
Xiaoli Li
ViT
21
13
0
30 May 2023
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjD
VLM
MLLM
41
78
0
29 May 2023
InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
30
31
0
29 May 2023
Image Quality Is Not All You Want: Task-Driven Lens Design for Image Classification
Xinge Yang
Qiang Fu
Yunfeng Nie
Wolfgang Heidrich
VLM
29
7
0
26 May 2023
TFDet: Target-Aware Fusion for RGB-T Pedestrian Detection
Xue Zhang
Xiaohan Zhang
Jiacheng Ying
Zehua Sheng
Heng Yu
Chunguang Li
Hui-Liang Shen
ViT
24
8
0
26 May 2023
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical Invariance
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Lu Yuan
Zicheng Liu
Youzuo Lin
33
2
0
25 May 2023
Thinking Twice: Clinical-Inspired Thyroid Ultrasound Lesion Detection Based on Feature Feedback
Lingtao Wang
Jianrui Ding
Fenghe Tang
C. Ning
36
1
0
24 May 2023
ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents
Christoph Auer
A. Nassar
Maksym Lysak
Michele Dolfi
Nikolaos Livathinos
Peter W. J. Staar
OOD
3DV
35
6
0
24 May 2023
Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking
Feng Yan
Weihua Luo
Yujie Zhong
Yiyang Gan
Lin Ma
VOT
43
15
0
22 May 2023
Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model
Jie Yang
Bing Li
Fengyu Yang
Ailing Zeng
Lei Zhang
Ruimao Zhang
VLM
DiffM
32
17
0
20 May 2023
Hausdorff Distance Matching with Adaptive Query Denoising for Rotated Detection Transformer
Hakjin Lee
Minki Song
Jamyoung Koo
Junghoon Seo
37
7
0
12 May 2023
Segment and Track Anything
Yangming Cheng
Liulei Li
Yuanyou Xu
Xiaodi Li
Zongxin Yang
Wenguan Wang
Yi Yang
VOS
30
193
0
11 May 2023
WeLayout: WeChat Layout Analysis System for the ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents
Mingliang Zhang
Zhen Cao
Juntao Liu
Liqiang Niu
Fandong Meng
Jie Zhou
48
6
0
11 May 2023
Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving
Xiaosong Jia
Peng Wu
Li Chen
Jiangwei Xie
Conghui He
Junchi Yan
Hongyang Li
53
94
0
10 May 2023
SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
Ayan Banerjee
Sanket Biswas
Josep Lladós
Umapada Pal
ViT
20
16
0
08 May 2023
Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN
YuXuan Liu
Nikhil Mishra
Pieter Abbeel
Xi Chen
ISeg
UQCV
21
4
0
03 May 2023
MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
Yifang Xu
Yunzhuo Sun
Yang Li
Yilei Shi
Xiaoxia Zhu
S. Du
ViT
56
33
0
29 Apr 2023
A Strong and Reproducible Object Detector with Only Public Datasets
Tianhe Ren
Jianwei Yang
Siyi Liu
Ailing Zeng
Feng Li
Hao Zhang
Hongyang Li
Zhaoyang Zeng
Lei Zhang
ObjD
43
11
0
25 Apr 2023
End-to-End Spatio-Temporal Action Localisation with Video Transformers
A. Gritsenko
Xuehan Xiong
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
Anurag Arnab
ViT
42
13
0
24 Apr 2023
OmniLabel: A Challenging Benchmark for Language-Based Object Detection
S. Schulter
G. VijayKumarB.
Yumin Suh
Konstantinos M. Dafnis
Zhixing Zhang
Shiyu Zhao
Dimitris N. Metaxas
ObjD
35
12
0
22 Apr 2023
LipsFormer: Introducing Lipschitz Continuity to Vision Transformers
Xianbiao Qi
Jianan Wang
Yihao Chen
Yukai Shi
Lei Zhang
46
16
0
19 Apr 2023
Transformer-Based Visual Segmentation: A Survey
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViT
MedIm
42
132
0
19 Apr 2023
MMDR: A Result Feature Fusion Object Detection Approach for Autonomous System
Wendong Zhang
27
0
0
19 Apr 2023
DETRs Beat YOLOs on Real-time Object Detection
Yian Zhao
Wenyu Lv
Shangliang Xu
Jinman Wei
Guanzhong Wang
Qingqing Dang
Yi Liu
Cheng Cui
35
838
0
17 Apr 2023
Permutation Equivariance of Transformers and Its Applications
Hengyuan Xu
Liyao Xiang
Hang Ye
Dixi Yao
Pengzhi Chu
Baochun Li
19
13
0
16 Apr 2023
Align-DETR: Improving DETR with Simple IoU-aware BCE loss
Zhi Cai
Songtao Liu
Guodong Wang
Zheng Ge
Xiangyu Zhang
Di Huang
34
3
0
15 Apr 2023
CornerFormer: Boosting Corner Representation for Fine-Grained Structured Reconstruction
Hongbo Tian
Yulong Li
Linzhi Huang
Xu Ling
Yue Yang
Weihong Deng
3DV
19
0
0
14 Apr 2023
Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation
Yifeng Shi
Feng Lv
Xinliang Wang
Chunlong Xia
Shaojie Li
Shu-Zhen Yang
Teng Xi
Gang Zhang
VLM
46
13
0
12 Apr 2023
StageInteractor: Query-based Object Detector with Cross-stage Interaction
Yao Teng
Haisong Liu
Sheng Guo
Limin Wang
ObjD
36
8
0
11 Apr 2023
Detection Transformer with Stable Matching
Siyi Liu
Tianhe Ren
Jia-Yu Chen
Zhaoyang Zeng
Hao Zhang
...
Hongyang Li
Jun Huang
Hang Su
Jun Zhu
Lei Zhang
33
34
0
10 Apr 2023
Previous
1
2
3
...
11
12
13
14
15
Next