ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.03605
  4. Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object
  Detection

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
    ViT
ArXivPDFHTML

Papers citing "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

50 / 720 papers shown
Title
YOLO-World: Real-Time Open-Vocabulary Object Detection
YOLO-World: Real-Time Open-Vocabulary Object Detection
Tianheng Cheng
Lin Song
Yixiao Ge
Wenyu Liu
Xinggang Wang
Ying Shan
VLM
ObjD
38
251
0
30 Jan 2024
Characterization of Magnetic Labyrinthine Structures Through Junctions
  and Terminals Detection Using Template Matching and CNN
Characterization of Magnetic Labyrinthine Structures Through Junctions and Terminals Detection Using Template Matching and CNN
YU Okubo
Kotaro Shimizu
B. S. Shivaram
Hae Yong Kim
23
1
0
30 Jan 2024
Computer Vision for Primate Behavior Analysis in the Wild
Computer Vision for Primate Behavior Analysis in the Wild
Richard Vogg
Timo Lüddecke
Jonathan Henrich
Sharmita Dey
Matthias Nuske
...
Alexander Gail
Stefan Treue
H. Scherberger
Florentin Wörgötter
Alexander S. Ecker
43
3
0
29 Jan 2024
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Tianhe Ren
Shilong Liu
Ailing Zeng
Jing Lin
Kunchang Li
...
Feng Li
Jie Yang
Hongyang Li
Qing Jiang
Lei Zhang
VLM
51
385
0
25 Jan 2024
MM-LLMs: Recent Advances in MultiModal Large Language Models
MM-LLMs: Recent Advances in MultiModal Large Language Models
Duzhen Zhang
Yahan Yu
Jiahua Dong
Chenxing Li
Dan Su
Chenhui Chu
Dong Yu
OffRL
LRM
56
182
0
24 Jan 2024
ChatterBox: Multi-round Multimodal Referring and Grounding
ChatterBox: Multi-round Multimodal Referring and Grounding
Yunjie Tian
Tianren Ma
Lingxi Xie
Jihao Qiu
Xi Tang
Yuan Zhang
Jianbin Jiao
Qi Tian
Qixiang Ye
33
14
0
24 Jan 2024
PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation
PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation
Zhaozhi Xie
Bochen Guan
Weihao Jiang
Muyang Yi
Yue Ding
Hongtao Lu
Lei Zhang
VLM
41
13
0
23 Jan 2024
Detect-Order-Construct: A Tree Construction based Approach for
  Hierarchical Document Structure Analysis
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis
Jiawei Wang
Kai Hu
Zhuoyao Zhong
Lei-huan Sun
Qiang Huo
35
6
0
22 Jan 2024
Pixel-Wise Recognition for Holistic Surgical Scene Understanding
Pixel-Wise Recognition for Holistic Surgical Scene Understanding
Nicolás Ayobi
Santiago Rodríguez
Alejandra Pérez
Isabela Hernández
Nicolás Aparicio
...
Sebastián Pena
J. Santander
J. Caicedo
Nicolás Fernández
Pablo Arbelaez
ViT
MedIm
39
9
0
20 Jan 2024
Symbol as Points: Panoptic Symbol Spotting via Point-based
  Representation
Symbol as Points: Panoptic Symbol Spotting via Point-based Representation
Wenlong Liu
Tianyu Yang
Yuhan Wang
Qizhi Yu
Lei Zhang
3DPC
27
5
0
19 Jan 2024
Stream Query Denoising for Vectorized HD Map Construction
Stream Query Denoising for Vectorized HD Map Construction
Shuo Wang
Fan Jia
Yingfei Liu
Yucheng Zhao
Zehui Chen
Tiancai Wang
Chi Zhang
Xiangyu Zhang
Feng Zhao
36
20
0
17 Jan 2024
Small Object Detection by DETR via Information Augmentation and Adaptive
  Feature Fusion
Small Object Detection by DETR via Information Augmentation and Adaptive Feature Fusion
Ji Huang
Hui Wang
ViT
27
5
0
16 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
77
1
0
15 Jan 2024
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator
  for Vision Applications
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications
Yuwen Xiong
Zhiqi Li
Yuntao Chen
Feng Wang
Xizhou Zhu
...
Hongsheng Li
Yu Qiao
Lewei Lu
Jie Zhou
Jifeng Dai
36
51
0
11 Jan 2024
Wasserstein Distance-based Expansion of Low-Density Latent Regions for
  Unknown Class Detection
Wasserstein Distance-based Expansion of Low-Density Latent Regions for Unknown Class Detection
Prakash Mallick
Feras Dayoub
Jamie Sherrah
24
1
0
10 Jan 2024
ECC-PolypDet: Enhanced CenterNet with Contrastive Learning for Automatic
  Polyp Detection
ECC-PolypDet: Enhanced CenterNet with Contrastive Learning for Automatic Polyp Detection
Yuncheng Jiang
Zixun Zhang
Yiwen Hu
Guanbin Li
Xiang Wan
Song Wu
Shuguang Cui
Silin Huang
Zhen Li
29
3
0
10 Jan 2024
Dr$^2$Net: Dynamic Reversible Dual-Residual Networks for
  Memory-Efficient Finetuning
Dr2^22Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning
Chen Zhao
Shuming Liu
K. Mangalam
Guocheng Qian
Fatimah Zohra
Abdulmohsen Alghannam
Jitendra Malik
Guohao Li
54
3
0
08 Jan 2024
MS-DETR: Efficient DETR Training with Mixed Supervision
MS-DETR: Efficient DETR Training with Mixed Supervision
Chuyang Zhao
Yifan Sun
Wenhao Wang
Qiang Chen
Errui Ding
Yi Yang
Jingdong Wang
MU
41
20
0
08 Jan 2024
Exploiting Polarized Material Cues for Robust Car Detection
Exploiting Polarized Material Cues for Robust Car Detection
Wen Dong
Haiyang Mei
Ziqi Wei
Ao Jin
Sen Qiu
Qiang Zhang
Xin Yang
22
1
0
05 Jan 2024
An Open and Comprehensive Pipeline for Unified Object Grounding and
  Detection
An Open and Comprehensive Pipeline for Unified Object Grounding and Detection
Xiangyu Zhao
Yicheng Chen
Shilin Xu
Xiangtai Li
Xinjiang Wang
Yining Li
Haian Huang
ObjD
AI4CE
45
29
0
04 Jan 2024
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
Yiran Song
Qianyu Zhou
Hefei Ling
Deng-Ping Fan
Xuequan Lu
Lizhuang Ma
VLM
43
14
0
04 Jan 2024
Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video
  Grounding
Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding
Syed Talal Wasim
Muzammal Naseer
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
31
12
0
31 Dec 2023
HEAP: Unsupervised Object Discovery and Localization with Contrastive
  Grouping
HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping
Xin Zhang
Jinheng Xie
Yuan. Yuan
Michael Bi Mi
Robby T. Tan
VOS
OCL
VLM
65
2
0
29 Dec 2023
Transformer-Based Multi-Object Smoothing with Decoupled Data Association
  and Smoothing
Transformer-Based Multi-Object Smoothing with Decoupled Data Association and Smoothing
Juliano Pinto
Georg Hess
Yuxuan Xia
H. Wymeersch
Lennart Svensson
VOT
32
3
0
22 Dec 2023
Universal Noise Annotation: Unveiling the Impact of Noisy annotation on
  Object Detection
Universal Noise Annotation: Unveiling the Impact of Noisy annotation on Object Detection
Kwang-seok Ryoo
Yeonsik Jo
Seungjun Lee
Mira Kim
Ahra Jo
S. Kim
Seungryong Kim
Soonyoung Lee
NoLa
34
1
0
21 Dec 2023
Diffusion-Based Particle-DETR for BEV Perception
Diffusion-Based Particle-DETR for BEV Perception
Asen Nachkov
Martin Danelljan
D. Paudel
Luc Van Gool
DiffM
40
3
0
18 Dec 2023
MatchDet: A Collaborative Framework for Image Matching and Object
  Detection
MatchDet: A Collaborative Framework for Image Matching and Object Detection
Jinxiang Lai
Wenlong Wu
Bin-Bin Gao
Jun Liu
Jiawei Zhan
Congchong Nie
Yi Zeng
Chengjie Wang
VLM
30
0
0
18 Dec 2023
DETER: Detecting Edited Regions for Deterring Generative Manipulations
DETER: Detecting Edited Regions for Deterring Generative Manipulations
Sai Wang
Ye Zhu
Ruoyu Wang
Amaya Dharmasiri
Olga Russakovsky
Yu Wu
43
2
0
16 Dec 2023
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for
  Open-Vocabulary Object Detection
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection
Joonhyun Jeong
Geondo Park
Jayeon Yoo
Hyungsik Jung
Heesu Kim
VLM
ObjD
41
10
0
12 Dec 2023
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object
  Detection
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection
Hu Zhang
Jianhua Xu
Tao Tang
Haiyang Sun
Xin Yu
Zi Huang
Kaicheng Yu
ObjD
3DPC
46
12
0
12 Dec 2023
Mixed Pseudo Labels for Semi-Supervised Object Detection
Mixed Pseudo Labels for Semi-Supervised Object Detection
Ze-Yi Chen
Wenwei Zhang
Xinjiang Wang
Kai Chen
Zhi Wang
ObjD
40
10
0
12 Dec 2023
A Multimodal Dataset and Benchmark for Radio Galaxy and Infrared Host
  Detection
A Multimodal Dataset and Benchmark for Radio Galaxy and Infrared Host Detection
N. Gupta
Zeeshan Hayder
Ray P. Norris
Minh Huynh
Lars Petersson
11
3
0
11 Dec 2023
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Abdullah Rashwan
Jiageng Zhang
A. Taalimi
Fan Yang
Xingyi Zhou
Chaochao Yan
Liang-Chieh Chen
Yeqing Li
ViT
31
5
0
11 Dec 2023
You Only Learn One Query: Learning Unified Human Query for Single-Stage
  Multi-Person Multi-Task Human-Centric Perception
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception
Sheng Jin
Shuhuai Li
Tong Li
Wentao Liu
Chao Qian
Ping Luo
42
5
0
09 Dec 2023
Vision-based Learning for Drones: A Survey
Vision-based Learning for Drones: A Survey
Jiaping Xiao
Rangya Zhang
Yuhang Zhang
Mir Feroskhan
34
4
0
08 Dec 2023
Lyrics: Boosting Fine-grained Language-Vision Alignment and
  Comprehension via Semantic-aware Visual Objects
Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects
Junyu Lu
Ruyi Gan
Di Zhang
Xiaojun Wu
Ziwei Wu
Renliang Sun
Jiaxing Zhang
Pingjian Zhang
Yan Song
MLLM
VLM
31
15
0
08 Dec 2023
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Hao Zhang
Hongyang Li
Feng Li
Tianhe Ren
Xueyan Zou
...
Shijia Huang
Jianfeng Gao
Lei Zhang
Chun-yue Li
Jianwei Yang
91
68
0
05 Dec 2023
Lenna: Language Enhanced Reasoning Detection Assistant
Lenna: Language Enhanced Reasoning Detection Assistant
Fei Wei
Xinyu Zhang
Ailing Zhang
Bo Zhang
Xiangxiang Chu
MLLM
LRM
29
23
0
05 Dec 2023
MobileUtr: Revisiting the relationship between light-weight CNN and
  Transformer for efficient medical image segmentation
MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentation
Fenghe Tang
Bingkun Nian
Jianrui Ding
Quan Quan
Jie Yang
Wei Liu
S.Kevin Zhou
ViT
MedIm
23
3
0
04 Dec 2023
Learning Efficient Unsupervised Satellite Image-based Building Damage
  Detection
Learning Efficient Unsupervised Satellite Image-based Building Damage Detection
Yiyun Zhang
Zijian Wang
Yadan Luo
Xin Yu
Zi Huang
26
4
0
04 Dec 2023
DiverseDream: Diverse Text-to-3D Synthesis with Augmented Text Embedding
DiverseDream: Diverse Text-to-3D Synthesis with Augmented Text Embedding
Uy Dieu Tran
Minh Luu
P. Nguyen
K. Nguyen
Binh-Son Hua
40
1
0
02 Dec 2023
Segment and Caption Anything
Segment and Caption Anything
Xiaoke Huang
Jianfeng Wang
Yansong Tang
Zheng Zhang
Han Hu
Jiwen Lu
Lijuan Wang
Zicheng Liu
MLLM
VLM
34
18
0
01 Dec 2023
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal
  Sentence Grounding in Videos
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos
Pilhyeon Lee
Hyeran Byun
19
10
0
30 Nov 2023
Language-conditioned Detection Transformer
Language-conditioned Detection Transformer
Jang Hyun Cho
Philipp Krahenbuhl
VLM
ObjD
47
1
0
29 Nov 2023
A Graph-Based Approach for Category-Agnostic Pose Estimation
A Graph-Based Approach for Category-Agnostic Pose Estimation
Or Hirschorn
S. Avidan
42
10
0
29 Nov 2023
PViT-6D: Overclocking Vision Transformers for 6D Pose Estimation with
  Confidence-Level Prediction and Pose Tokens
PViT-6D: Overclocking Vision Transformers for 6D Pose Estimation with Confidence-Level Prediction and Pose Tokens
Sebastian Stapf
Tobias Bauernfeind
Marco Riboldi
ViT
25
1
0
29 Nov 2023
TransNeXt: Robust Foveal Visual Perception for Vision Transformers
TransNeXt: Robust Foveal Visual Perception for Vision Transformers
Dai Shi
ViT
23
77
0
28 Nov 2023
Stable Segment Anything Model
Stable Segment Anything Model
Qi Fan
Xin Tao
Lei Ke
Mingqiao Ye
Yuanhui Zhang
Pengfei Wan
Zhong-ming Wang
Yu-Wing Tai
Chi-Keung Tang
VLM
28
6
0
27 Nov 2023
Griffon: Spelling out All Object Locations at Any Granularity with Large
  Language Models
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
Yufei Zhan
Yousong Zhu
Zhiyang Chen
Fan Yang
E. Goles
Jinqiao Wang
ObjD
52
15
0
24 Nov 2023
OneFormer3D: One Transformer for Unified Point Cloud Segmentation
OneFormer3D: One Transformer for Unified Point Cloud Segmentation
Maksim Kolodiazhnyi
Anna Vorontsova
Anton Konushin
D. Rukhovich
ViT
38
41
0
24 Nov 2023
Previous
123...8910...131415
Next