ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.03605
  4. Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object
  Detection
v1v2v3v4 (latest)

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
    ViT
ArXiv (abs)PDFHTMLGithub (2506★)

Papers citing "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

50 / 742 papers shown
Title
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
Wenbo Wei
Jun Wang
Abhir Bhalerao
452
0
0
19 Sep 2024
TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road
  Topology Problem
TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem
M. E. Kalfaoglu
H. Öztürk
Ozsel Kilinc
A. Temi̇zel
3DPC
86
2
0
17 Sep 2024
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Minghan Chen
Guikun Chen
Wenguan Wang
Yi Yang
130
4
0
16 Sep 2024
OPUS: Occupancy Prediction Using a Sparse Set
OPUS: Occupancy Prediction Using a Sparse Set
Jiabao Wang
Zhaojiang Liu
Qiang Meng
Liujiang Yan
Ke Wang
Jie Yang
Wei Liu
Qibin Hou
Ming-Ming Cheng
81
15
0
14 Sep 2024
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary
  Detection
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
Haoxuan Wang
Qu He
Jinlong Peng
Hao Yang
Mingmin Chi
Yabiao Wang
Mamba
104
2
0
13 Sep 2024
RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense
  Positive Supervision
RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision
Shuo Wang
Chunlong Xia
Feng Lv
Yifeng Shi
PINNViTMU
105
10
0
13 Sep 2024
From COCO to COCO-FP: A Deep Dive into Background False Positives for
  COCO Detectors
From COCO to COCO-FP: A Deep Dive into Background False Positives for COCO Detectors
Longfei Liu
Wen Guo
Shijie Huang
Cheng Li
Xi Shen
ObjD
93
0
0
12 Sep 2024
ENACT: Entropy-based Clustering of Attention Input for Reducing the Computational Needs of Object Detection Transformers
ENACT: Entropy-based Clustering of Attention Input for Reducing the Computational Needs of Object Detection Transformers
Giorgos Savathrakis
Antonis Argyros
ViT
46
0
0
11 Sep 2024
Advancing SEM Based Nano-Scale Defect Analysis in Semiconductor
  Manufacturing for Advanced IC Nodes
Advancing SEM Based Nano-Scale Defect Analysis in Semiconductor Manufacturing for Advanced IC Nodes
Bappaditya Dey
Matthias Monden
Victor Blanco
Sandip Halder
S. de Gendt
58
0
0
06 Sep 2024
PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic
  Pituitary Surgery
PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery
Adrito Das
Danyal Z. Khan
Dimitrios Psychogyios
Yitong Zhang
John G. Hanrahan
...
Santiago Rodriguez
Pablo Arbelaez
Danail Stoyanov
Hani J. Marcus
Sophia Bano
68
6
0
02 Sep 2024
IAFI-FCOS: Intra- and across-layer feature interaction FCOS model for
  lesion detection of CT images
IAFI-FCOS: Intra- and across-layer feature interaction FCOS model for lesion detection of CT images
Q. Guan
Mengjie Pan
Feng Chen
Zhiqiang Yang
Zhongwen Yu
Qianwei Zhou
Haigen Hu
66
0
0
01 Sep 2024
COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
Shaorong Sun
Shuchao Pang
Yazhou Yao
Xiaoshui Huang
71
1
0
01 Sep 2024
LLaVA-SG: Leveraging Scene Graphs as Visual Semantic Expression in
  Vision-Language Models
LLaVA-SG: Leveraging Scene Graphs as Visual Semantic Expression in Vision-Language Models
Jingyi Wang
Jianzhong Ju
Jian Luan
Zhidong Deng
VLM
94
2
0
29 Aug 2024
Center Direction Network for Grasping Point Localization on Cloths
Center Direction Network for Grasping Point Localization on Cloths
Domen Tabernik
Jon Muhovič
Matej Urbas
Danijel Skočaj
3DPC
61
1
0
26 Aug 2024
LSM-YOLO: A Compact and Effective ROI Detector for Medical Detection
LSM-YOLO: A Compact and Effective ROI Detector for Medical Detection
Zhongwen Yu
Q. Guan
Jianmin Yang
Zhiqiang Yang
Qianwei Zhou
Yang Chen
Feng Chen
94
1
0
26 Aug 2024
CustomCrafter: Customized Video Generation with Preserving Motion and
  Concept Composition Abilities
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Tao Wu
Yong Zhang
Xintao Wang
Xianpan Zhou
Guangcong Zheng
Zhongang Qi
Ying Shan
Xi Li
VGenDiffM
74
30
0
23 Aug 2024
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation
  Models
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models
Wentao Wu
Fanghua Hong
Xiao Wang
Chenglong Li
Jin Tang
VLM
93
1
0
23 Aug 2024
RT-OVAD: Real-Time Open-Vocabulary Aerial Object Detection via Image-Text Collaboration
RT-OVAD: Real-Time Open-Vocabulary Aerial Object Detection via Image-Text Collaboration
Guoting Wei
Xia Yuan
Yu Liu
Zhenhao Shang
Kelu Yao
Peng Wang
Qingsen Yan
Chunxia Zhao
Haokui Zhang
Rong Xiao
ObjDVLM
91
1
0
22 Aug 2024
On the Potential of Open-Vocabulary Models for Object Detection in
  Unusual Street Scenes
On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes
Sadia Ilyas
Ido Freeman
Matthias Rottmann
ObjD
107
3
0
20 Aug 2024
PADetBench: Towards Benchmarking Physical Attacks against Object Detection
PADetBench: Towards Benchmarking Physical Attacks against Object Detection
Jiawei Lian
Jianhong Pan
L. Wang
Yi Wang
Lap-Pui Chau
Shaohui Mei
AAML
106
0
0
17 Aug 2024
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community
Jiancheng Pan
Yanxing Liu
Yuqian Fu
Muyuan Ma
Jiaohao Li
D. Paudel
Luc Van Gool
Xiaomeng Huang
ObjD
136
9
0
17 Aug 2024
DPDETR: Decoupled Position Detection Transformer for Infrared-Visible
  Object Detection
DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection
Junjie Guo
Chenqiang Gao
Fangcen Liu
Deyu Meng
ViT
109
3
0
12 Aug 2024
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection
Zitian Wang
Zehao Huang
Yulu Gao
Naiyan Wang
Si Liu
3DPC
124
6
0
12 Aug 2024
U-DECN: End-to-End Underwater Object Detection ConvNet with Improved
  DeNoising Training
U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training
Zhuoyan Liu
Bo Wang
Ye Li
ViT
72
0
0
11 Aug 2024
Embodied Uncertainty-Aware Object Segmentation
Embodied Uncertainty-Aware Object Segmentation
Xiaolin Fang
Leslie Pack Kaelbling
Tomás Lozano-Pérez
58
6
0
08 Aug 2024
Img-Diff: Contrastive Data Synthesis for Multimodal Large Language
  Models
Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models
Qirui Jiao
Daoyuan Chen
Yilun Huang
Yaliang Li
Ying Shen
VLM
116
8
0
08 Aug 2024
Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial
  Conditional Diffusion Model
Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model
Guoqing Zhu
Honghu Pan
Qiang Wang
Chao Tian
Chao Yang
Zhenyu He
72
1
0
07 Aug 2024
Contrastive Learning for Image Complexity Representation
Contrastive Learning for Image Complexity Representation
Shipeng Liu
Liang Zhao
Dengfeng Chen
Zhanping Song
59
2
0
06 Aug 2024
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Xi Chen
Qian Qiao
Jun Gao
Tianxiang Wu
Rahul Bhadani
Jiaqing Fan
Ziqiang Cao
Larry Head
DiffM
109
9
0
01 Aug 2024
Practical Video Object Detection via Feature Selection and Aggregation
Practical Video Object Detection via Feature Selection and Aggregation
Yuheng Shi
Tong Zhang
Xiaojie Guo
ObjD
114
3
0
29 Jul 2024
MOoSE: Multi-Orientation Sharing Experts for Open-set Scene Text
  Recognition
MOoSE: Multi-Orientation Sharing Experts for Open-set Scene Text Recognition
Chang Liu
Simon Corbillé
Elisa H Barney Smith
57
0
0
26 Jul 2024
VSSD: Vision Mamba with Non-Causal State Space Duality
VSSD: Vision Mamba with Non-Causal State Space Duality
Yuheng Shi
Minjing Dong
Mingjia Li
Chang Xu
Mamba
101
8
0
26 Jul 2024
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
Jiasen Wang
Zhenglin Li
Ke Sun
Xianyuan Liu
Yang Zhou
100
0
0
24 Jul 2024
Dynamic Retraining-Updating Mean Teacher for Source-Free Object
  Detection
Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection
Trinh Le Ba Khanh
Huy-Hung Nguyen
L. Pham
Duong Nguyen-Ngoc Tran
Jae Wook Jeon
122
3
0
23 Jul 2024
ESOD: Efficient Small Object Detection on High-Resolution Images
ESOD: Efficient Small Object Detection on High-Resolution Images
Kai-Chun Liu
Zhihang Fu
Sheng Jin
Ze Chen
Fan Zhou
Rongxin Jiang
Yao-Shen Chen
Jieping Ye
ObjD
99
4
0
23 Jul 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
200
1
0
23 Jul 2024
DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object
  Detection
DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection
Zhourui Zhang
Jun Li
Zhijian Wu
Jifeng Shen
Jianhua Xu
71
0
0
18 Jul 2024
GroupMamba: Efficient Group-Based Visual State Space Model
GroupMamba: Efficient Group-Based Visual State Space Model
Abdelrahman M. Shaker
Syed Talal Wasim
Salman Khan
Juergen Gall
Fahad Shahbaz Khan
Mamba
102
0
0
18 Jul 2024
Relation DETR: Exploring Explicit Position Relation Prior for Object
  Detection
Relation DETR: Exploring Explicit Position Relation Prior for Object Detection
Xiuquan Hou
Mei-qin Liu
Senlin Zhang
Ping Wei
Badong Chen
Xuguang Lan
ViT
100
17
0
16 Jul 2024
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded
  Scenes
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes
Zhi Cai
Yingjie Gao
Yaoyan Zheng
Nan Zhou
Di Huang
VLM
97
6
0
16 Jul 2024
LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction
LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction
Penghui Du
Yu Wang
Yifan Sun
Luting Wang
Yue Liao
Gang Zhang
Errui Ding
Yan Wang
Jingdong Wang
Si Liu
VLMObjD
127
1
0
16 Jul 2024
TCFormer: Visual Recognition via Token Clustering Transformer
TCFormer: Visual Recognition via Token Clustering Transformer
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
77
5
0
16 Jul 2024
SEED: A Simple and Effective 3D DETR in Point Clouds
SEED: A Simple and Effective 3D DETR in Point Clouds
Yanfeng Guo
Jinghua Hou
Xiaoqing Ye
Tong Wang
Jingdong Wang
Xiang Bai
3DPC
89
8
0
15 Jul 2024
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model
  and Benchmark Dataset
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
Yi Zhang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
80
6
0
14 Jul 2024
Plain-Det: A Plain Multi-Dataset Object Detector
Plain-Det: A Plain Multi-Dataset Object Detector
Cheng Shi
Yuchen Zhu
Sibei Yang
ObjDVLM
89
2
0
14 Jul 2024
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object
  Detection
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection
Ziyue Huang
Yongchao Feng
Qingjie Liu
Yunhong Wang
ViT
134
1
0
13 Jul 2024
FD-SOS: Vision-Language Open-Set Detectors for Bone Fenestration and
  Dehiscence Detection from Intraoral Images
FD-SOS: Vision-Language Open-Set Detectors for Bone Fenestration and Dehiscence Detection from Intraoral Images
Marawan Elbatel
Keyuan Liu
Yanqi Yang
Xuelong Li
58
0
0
12 Jul 2024
Textual Query-Driven Mask Transformer for Domain Generalized
  Segmentation
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation
Byeonghyun Pak
Byeongju Woo
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
134
5
0
12 Jul 2024
Global-Local Collaborative Inference with LLM for Lidar-Based
  Open-Vocabulary Detection
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection
Xingyu Peng
Yan Bai
Chen Gao
Lirong Yang
Fei Xia
Beipeng Mu
Xiaofei Wang
Si Liu
ObjD
78
3
0
12 Jul 2024
D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation
  in Breast Cancer Detection from Mammograms
D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms
Tajamul Ashraf
K. Rangarajan
Mohit Gambhir
Richa Gabha
Chetan Arora
MedIm
116
2
0
09 Jul 2024
Previous
123456...131415
Next