ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.03605
  4. Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object
  Detection

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
    ViT
ArXivPDFHTML

Papers citing "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

50 / 720 papers shown
Title
Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object
  Detection
Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection
Yifan Zhang
Zhiyu Zhu
Junhui Hou
Dapeng Wu
31
7
0
01 Jul 2023
MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and
  Guided Intention Querying
MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention Querying
Shaoshuai Shi
Li Jiang
Dengxin Dai
Bernt Schiele
29
114
0
30 Jun 2023
Integrating Large Pre-trained Models into Multimodal Named Entity
  Recognition with Evidential Fusion
Integrating Large Pre-trained Models into Multimodal Named Entity Recognition with Evidential Fusion
Weide Liu
Xiaoyang Zhong
Jingwen Hou
Shaohua Li
Haozhe Huang
Yuming Fang
EDL
35
5
0
29 Jun 2023
The Segment Anything Model (SAM) for Remote Sensing Applications: From
  Zero to One Shot
The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One Shot
L. Osco
Qiusheng Wu
Eduardo Lopes de Lemos
W. Gonçalves
A. P. Ramos
Jonathan Li
J. M. Junior
VLM
18
181
0
29 Jun 2023
Taming Detection Transformers for Medical Object Detection
Taming Detection Transformers for Medical Object Detection
Marc K. Ickler
Michael Baumgartner
Saikat Roy
Tassilo Wald
Klaus H. Maier-Hein
ViT
MedIm
23
6
0
27 Jun 2023
A Survey on Multimodal Large Language Models
A Survey on Multimodal Large Language Models
Shukang Yin
Chaoyou Fu
Sirui Zhao
Ke Li
Xing Sun
Tong Xu
Enhong Chen
MLLM
LRM
62
562
0
23 Jun 2023
Bridging the Performance Gap between DETR and R-CNN for Graphical Object
  Detection in Document Images
Bridging the Performance Gap between DETR and R-CNN for Graphical Object Detection in Document Images
Tahira Shehzadi
K. Hashmi
D. Stricker
Marcus Liwicki
Muhammad Zeshan Afzal
29
7
0
23 Jun 2023
CrossKD: Cross-Head Knowledge Distillation for Object Detection
CrossKD: Cross-Head Knowledge Distillation for Object Detection
Jiabao Wang
Yuming Chen
Zhaohui Zheng
Xiang Li
Ming-Ming Cheng
Qibin Hou
48
33
0
20 Jun 2023
DEYOv2: Rank Feature with Greedy Matching for End-to-End Object
  Detection
DEYOv2: Rank Feature with Greedy Matching for End-to-End Object Detection
Hao Ouyang
42
4
0
15 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
41
7
0
14 Jun 2023
detrex: Benchmarking Detection Transformers
detrex: Benchmarking Detection Transformers
Tianhe Ren
Siyi Liu
Feng Li
Hao Zhang
Ailing Zeng
...
Zhaoyang Zeng
Xianbiao Qi
Yuhui Yuan
Jianwei Yang
Lei Zhang
42
13
0
12 Jun 2023
FasterViT: Fast Vision Transformers with Hierarchical Attention
FasterViT: Fast Vision Transformers with Hierarchical Attention
Ali Hatamizadeh
Greg Heinrich
Hongxu Yin
Andrew Tao
J. Álvarez
Jan Kautz
Pavlo Molchanov
ViT
28
68
0
09 Jun 2023
Image Blending Algorithm with Automatic Mask Generation
Image Blending Algorithm with Automatic Mask Generation
Haochen Xue
Min Jin
Chong Zhang
Yuxuan Huang
Q. Weng
Xiaobo Jin
19
0
0
08 Jun 2023
RefineVIS: Video Instance Segmentation with Temporal Attention
  Refinement
RefineVIS: Video Instance Segmentation with Temporal Attention Refinement
Andre Abrantes
Jiang Wang
Peng Chu
Quanzeng You
Zicheng Liu
VOS
29
0
0
07 Jun 2023
Object Detection with Transformers: A Review
Object Detection with Transformers: A Review
Tahira Shehzadi
K. Hashmi
D. Stricker
Muhammad Zeshan Afzal
ViT
MU
26
28
0
07 Jun 2023
YONA: You Only Need One Adjacent Reference-frame for Accurate and Fast
  Video Polyp Detection
YONA: You Only Need One Adjacent Reference-frame for Accurate and Fast Video Polyp Detection
Yuncheng Jiang
Zixun Zhang
Ruimao Zhang
Guanbin Li
Shuguang Cui
Zerui Li
26
3
0
06 Jun 2023
Recognize Anything: A Strong Image Tagging Model
Recognize Anything: A Strong Image Tagging Model
Youcai Zhang
Xinyu Huang
Jinyu Ma
Zhaoyang Li
Zhaochuan Luo
...
Tong Luo
Yaqian Li
Siyi Liu
Yandong Guo
Lei Zhang
VLM
47
225
0
06 Jun 2023
OCBEV: Object-Centric BEV Transformer for Multi-View 3D Object Detection
OCBEV: Object-Centric BEV Transformer for Multi-View 3D Object Detection
Zhangyang Qi
Jiaqi Wang
Xiaoyang Wu
Hengshuang Zhao
43
11
0
02 Jun 2023
Segment Anything in High Quality
Segment Anything in High Quality
Lei Ke
Mingqiao Ye
Martin Danelljan
Yifan Liu
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
VLM
43
311
0
02 Jun 2023
Multi-modal Queried Object Detection in the Wild
Multi-modal Queried Object Detection in the Wild
Yifan Xu
Mengdan Zhang
Chaoyou Fu
Peixian Chen
Xiaoshan Yang
Ke Li
Changsheng Xu
ObjD
VLM
38
30
0
30 May 2023
MS-DETR: Natural Language Video Localization with Sampling Moment-Moment
  Interaction
MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction
J. Wang
Aixin Sun
Hao Zhang
Xiaoli Li
ViT
21
13
0
30 May 2023
Contextual Object Detection with Multimodal Large Language Models
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjD
VLM
MLLM
41
78
0
29 May 2023
InstructEdit: Improving Automatic Masks for Diffusion-based Image
  Editing With User Instructions
InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
30
31
0
29 May 2023
Image Quality Is Not All You Want: Task-Driven Lens Design for Image
  Classification
Image Quality Is Not All You Want: Task-Driven Lens Design for Image Classification
Xinge Yang
Qiang Fu
Yunfeng Nie
Wolfgang Heidrich
VLM
29
7
0
26 May 2023
TFDet: Target-Aware Fusion for RGB-T Pedestrian Detection
TFDet: Target-Aware Fusion for RGB-T Pedestrian Detection
Xue Zhang
Xiaohan Zhang
Jiacheng Ying
Zehua Sheng
Heng Yu
Chunguang Li
Hui-Liang Shen
ViT
24
8
0
26 May 2023
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical
  Invariance
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical Invariance
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Lu Yuan
Zicheng Liu
Youzuo Lin
33
2
0
25 May 2023
Thinking Twice: Clinical-Inspired Thyroid Ultrasound Lesion Detection
  Based on Feature Feedback
Thinking Twice: Clinical-Inspired Thyroid Ultrasound Lesion Detection Based on Feature Feedback
Lingtao Wang
Jianrui Ding
Fenghe Tang
C. Ning
36
1
0
24 May 2023
ICDAR 2023 Competition on Robust Layout Segmentation in Corporate
  Documents
ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents
Christoph Auer
A. Nassar
Maksym Lysak
Michele Dolfi
Nikolaos Livathinos
Peter W. J. Staar
OOD
3DV
35
6
0
24 May 2023
Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object
  Tracking
Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking
Feng Yan
Weihua Luo
Yujie Zhong
Yiyang Gan
Lin Ma
VOT
43
15
0
22 May 2023
Boosting Human-Object Interaction Detection with Text-to-Image Diffusion
  Model
Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model
Jie Yang
Bing Li
Fengyu Yang
Ailing Zeng
Lei Zhang
Ruimao Zhang
VLM
DiffM
32
17
0
20 May 2023
Hausdorff Distance Matching with Adaptive Query Denoising for Rotated
  Detection Transformer
Hausdorff Distance Matching with Adaptive Query Denoising for Rotated Detection Transformer
Hakjin Lee
Minki Song
Jamyoung Koo
Junghoon Seo
37
7
0
12 May 2023
Segment and Track Anything
Segment and Track Anything
Yangming Cheng
Liulei Li
Yuanyou Xu
Xiaodi Li
Zongxin Yang
Wenguan Wang
Yi Yang
VOS
30
193
0
11 May 2023
WeLayout: WeChat Layout Analysis System for the ICDAR 2023 Competition
  on Robust Layout Segmentation in Corporate Documents
WeLayout: WeChat Layout Analysis System for the ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents
Mingliang Zhang
Zhen Cao
Juntao Liu
Liqiang Niu
Fandong Meng
Jie Zhou
48
6
0
11 May 2023
Think Twice before Driving: Towards Scalable Decoders for End-to-End
  Autonomous Driving
Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving
Xiaosong Jia
Peng Wu
Li Chen
Jiangwei Xie
Conghui He
Junchi Yan
Hongyang Li
53
94
0
10 May 2023
SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for
  Document Instance Segmentation
SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
Ayan Banerjee
Sanket Biswas
Josep Lladós
Umapada Pal
ViT
20
16
0
08 May 2023
Distributional Instance Segmentation: Modeling Uncertainty and High
  Confidence Predictions with Latent-MaskRCNN
Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN
YuXuan Liu
Nikhil Mishra
Pieter Abbeel
Xi Chen
ISeg
UQCV
21
4
0
03 May 2023
MH-DETR: Video Moment and Highlight Detection with Cross-modal
  Transformer
MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
Yifang Xu
Yunzhuo Sun
Yang Li
Yilei Shi
Xiaoxia Zhu
S. Du
ViT
56
33
0
29 Apr 2023
A Strong and Reproducible Object Detector with Only Public Datasets
A Strong and Reproducible Object Detector with Only Public Datasets
Tianhe Ren
Jianwei Yang
Siyi Liu
Ailing Zeng
Feng Li
Hao Zhang
Hongyang Li
Zhaoyang Zeng
Lei Zhang
ObjD
43
11
0
25 Apr 2023
End-to-End Spatio-Temporal Action Localisation with Video Transformers
End-to-End Spatio-Temporal Action Localisation with Video Transformers
A. Gritsenko
Xuehan Xiong
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
Anurag Arnab
ViT
42
13
0
24 Apr 2023
OmniLabel: A Challenging Benchmark for Language-Based Object Detection
OmniLabel: A Challenging Benchmark for Language-Based Object Detection
S. Schulter
G. VijayKumarB.
Yumin Suh
Konstantinos M. Dafnis
Zhixing Zhang
Shiyu Zhao
Dimitris N. Metaxas
ObjD
35
12
0
22 Apr 2023
LipsFormer: Introducing Lipschitz Continuity to Vision Transformers
LipsFormer: Introducing Lipschitz Continuity to Vision Transformers
Xianbiao Qi
Jianan Wang
Yihao Chen
Yukai Shi
Lei Zhang
46
16
0
19 Apr 2023
Transformer-Based Visual Segmentation: A Survey
Transformer-Based Visual Segmentation: A Survey
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViT
MedIm
42
132
0
19 Apr 2023
MMDR: A Result Feature Fusion Object Detection Approach for Autonomous
  System
MMDR: A Result Feature Fusion Object Detection Approach for Autonomous System
Wendong Zhang
27
0
0
19 Apr 2023
DETRs Beat YOLOs on Real-time Object Detection
DETRs Beat YOLOs on Real-time Object Detection
Yian Zhao
Wenyu Lv
Shangliang Xu
Jinman Wei
Guanzhong Wang
Qingqing Dang
Yi Liu
Cheng Cui
35
838
0
17 Apr 2023
Permutation Equivariance of Transformers and Its Applications
Permutation Equivariance of Transformers and Its Applications
Hengyuan Xu
Liyao Xiang
Hang Ye
Dixi Yao
Pengzhi Chu
Baochun Li
19
13
0
16 Apr 2023
Align-DETR: Improving DETR with Simple IoU-aware BCE loss
Align-DETR: Improving DETR with Simple IoU-aware BCE loss
Zhi Cai
Songtao Liu
Guodong Wang
Zheng Ge
Xiangyu Zhang
Di Huang
34
3
0
15 Apr 2023
CornerFormer: Boosting Corner Representation for Fine-Grained Structured
  Reconstruction
CornerFormer: Boosting Corner Representation for Fine-Grained Structured Reconstruction
Hongbo Tian
Yulong Li
Linzhi Huang
Xu Ling
Yue Yang
Weihong Deng
3DV
19
0
0
14 Apr 2023
Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model
  Challenge of Intelligent Transportation
Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation
Yifeng Shi
Feng Lv
Xinliang Wang
Chunlong Xia
Shaojie Li
Shu-Zhen Yang
Teng Xi
Gang Zhang
VLM
46
13
0
12 Apr 2023
StageInteractor: Query-based Object Detector with Cross-stage
  Interaction
StageInteractor: Query-based Object Detector with Cross-stage Interaction
Yao Teng
Haisong Liu
Sheng Guo
Limin Wang
ObjD
36
8
0
11 Apr 2023
Detection Transformer with Stable Matching
Detection Transformer with Stable Matching
Siyi Liu
Tianhe Ren
Jia-Yu Chen
Zhaoyang Zeng
Hao Zhang
...
Hongyang Li
Jun Huang
Hang Su
Jun Zhu
Lei Zhang
33
34
0
10 Apr 2023
Previous
123...1112131415
Next