ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.03605
  4. Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object
  Detection
v1v2v3v4 (latest)

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
    ViT
ArXiv (abs)PDFHTMLGithub (2506★)

Papers citing "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

50 / 742 papers shown
Title
DLAFormer: An End-to-End Transformer For Document Layout Analysis
DLAFormer: An End-to-End Transformer For Document Layout Analysis
Jiawei Wang
Kai Hu
Qiang Huo
3DVViT
73
3
0
20 May 2024
Track Anything Rapter(TAR)
Track Anything Rapter(TAR)
Tharun V. Puthanveettil
Fnu Obaid ur Rahman
74
0
0
19 May 2024
Visible and Clear: Finding Tiny Objects in Difference Map
Visible and Clear: Finding Tiny Objects in Difference Map
Bing Cao
Haiyu Yao
Pengfei Zhu
Qinghua Hu
ObjD
92
8
0
18 May 2024
Open-Vocabulary Spatio-Temporal Action Detection
Open-Vocabulary Spatio-Temporal Action Detection
Tao Wu
Shuqiu Ge
Jie Qin
Gangshan Wu
Limin Wang
ObjD
75
7
0
17 May 2024
A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells
  Detection with Morphological Attributes for Explainability
A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for Explainability
Abdul Rehman
Talha Meraj
A. Minhas
Ayisha Imran
Mohsen Ali
Waqas Sultani
97
2
0
17 May 2024
Better Sampling, towards Better End-to-end Small Object Detection
Better Sampling, towards Better End-to-end Small Object Detection
Zile Huang
Chong Zhang
Mingyu Jin
Fangyu Wu
Chengzhi Liu
Xiaobo Jin
ObjD
117
1
0
17 May 2024
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Tianhe Ren
Qing Jiang
Shilong Liu
Zhaoyang Zeng
Wenlong Liu
...
Hao Zhang
Feng Li
Peijun Tang
Kent Yu
Lei Zhang
ObjDVLM
133
38
0
16 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks
  via Multi-modal Large Language Models
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
130
21
0
16 May 2024
SpecDETR: A Transformer-based Hyperspectral Point Object Detection Network
SpecDETR: A Transformer-based Hyperspectral Point Object Detection Network
Zhaoxu Li
Wei An
Gaowei Guo
Longguang Wang
Yingqian Wang
Zaiping Lin
ViT
209
0
0
16 May 2024
Gaze-DETR: Using Expert Gaze to Reduce False Positives in Vulvovaginal
  Candidiasis Screening
Gaze-DETR: Using Expert Gaze to Reduce False Positives in Vulvovaginal Candidiasis Screening
Yan Kong
Sheng Wang
Jiangdong Cai
Zihao Zhao
Zhenrong Shen
Yonghao Li
Manman Fei
Qian Wang
93
4
0
15 May 2024
MetaFruit Meets Foundation Models: Leveraging a Comprehensive
  Multi-Fruit Dataset for Advancing Agricultural Foundation Models
MetaFruit Meets Foundation Models: Leveraging a Comprehensive Multi-Fruit Dataset for Advancing Agricultural Foundation Models
Jiajia Li
Kyle Lammers
Xunyuan Yin
Xiang Yin
Long He
Renfu Lu
Zhaojian Li
98
3
0
14 May 2024
Wild Berry image dataset collected in Finnish forests and peatlands using drones
Wild Berry image dataset collected in Finnish forests and peatlands using drones
Luigi Riz
Sergio Povoli
Andrea Caraffa
Davide Boscaini
M. L. Mekhalfi
...
Elisa Castelli
Giacomo Piccinini
L. Marchesotti
Micael S. Couceiro
Fabio Poiesi
124
1
0
13 May 2024
Replication Study and Benchmarking of Real-Time Object Detection Models
Replication Study and Benchmarking of Real-Time Object Detection Models
Pierre-Luc Asselin
Vincent Coulombe
William Guimont-Martin
William Larrivée-Hardy
88
0
0
11 May 2024
How to Augment for Atmospheric Turbulence Effects on Thermal Adapted
  Object Detection Models?
How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models?
Engin Uzun
Erdem Akagündüz
70
0
0
10 May 2024
Prompt When the Animal is: Temporal Animal Behavior Grounding with
  Positional Recovery Training
Prompt When the Animal is: Temporal Animal Behavior Grounding with Positional Recovery Training
Sheng Yan
Xin Du
Zongying Li
Yi Wang
Hongcang Jin
Mengyuan Liu
OODVLM
57
0
0
09 May 2024
Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via
  Editable Gaussian Splatting
Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting
O. Shorinwa
Johnathan Tucker
Aliyah Smith
Aiden Swann
Timothy Chen
Roya Firoozi
Monroe Kennedy
Mac Schwager
145
25
0
07 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World
  Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGenLM&Ro
176
48
0
06 May 2024
Enhancing DETRs Variants through Improved Content Query and Similar
  Query Aggregation
Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation
Yingying Zhang
Chuangji Shi
Xin Guo
Jiangwei Lao
Jian Wang
Jiaotuan Wang
Jingdong Chen
81
3
0
06 May 2024
PTQ4SAM: Post-Training Quantization for Segment Anything
PTQ4SAM: Post-Training Quantization for Segment Anything
Chengtao Lv
Hong Chen
Jinyang Guo
Yifu Ding
Xianglong Liu
VLMMQ
85
16
0
06 May 2024
Multi-method Integration with Confidence-based Weighting for Zero-shot
  Image Classification
Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification
Siqi Yin
Lifan Jiang
47
0
0
03 May 2024
Towards Consistent Object Detection via LiDAR-Camera Synergy
Towards Consistent Object Detection via LiDAR-Camera Synergy
Kai Luo
Hao Wu
Kefu Yi
Kailun Yang
Wei Hao
Rongdong Hu
62
1
0
02 May 2024
Spider: A Unified Framework for Context-dependent Concept Segmentation
Spider: A Unified Framework for Context-dependent Concept Segmentation
Xiaoqi Zhao
Youwei Pang
Wei Ji
Baicheng Sheng
Jiaming Zuo
Lihe Zhang
Huchuan Lu
98
8
0
02 May 2024
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Xiaoshi Wu
Yiming Hao
Manyuan Zhang
Keqiang Sun
Zhaoyang Huang
Guanglu Song
Yu Liu
Hongsheng Li
EGVM
127
25
0
01 May 2024
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target
  Identification with Large Multimodal Models
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models
Hongzhan Lin
Zixin Chen
Ziyang Luo
Mingfei Cheng
Jing Ma
Guang Chen
91
6
0
01 May 2024
Towards End-to-End Semi-Supervised Table Detection with Semantic Aligned
  Matching Transformer
Towards End-to-End Semi-Supervised Table Detection with Semantic Aligned Matching Transformer
Tahira Shehzadi
Shalini Sarode
Didier Stricker
Muhammad Zeshan Afzal
LMTD
108
4
0
30 Apr 2024
UniFS: Universal Few-shot Instance Perception with Point Representations
UniFS: Universal Few-shot Instance Perception with Point Representations
Sheng Jin
Ruijie Yao
Lumin Xu
Wentao Liu
Chao Qian
Ji Wu
Ping Luo
119
2
0
30 Apr 2024
MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection
MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection
H. R. Medeiros
David Latortue
Fidel Alejandro Guerrero Peña
Eric Granger
M. Pedersoli
58
1
0
29 Apr 2024
A Hybrid Approach for Document Layout Analysis in Document images
A Hybrid Approach for Document Layout Analysis in Document images
Tahira Shehzadi
Didier Stricker
Muhammad Zeshan Afzal
69
5
0
27 Apr 2024
Sparse Reconstruction of Optical Doppler Tomography with Alternative State Space Model and Attention
Sparse Reconstruction of Optical Doppler Tomography with Alternative State Space Model and Attention
Zhenghong Li
Jiaxiang Ren
Wensheng Cheng
C. Du
Yingtian Pan
Haibin Ling
66
0
0
26 Apr 2024
AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with
  Foundation Models
AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models
Zhiqiang Tang
Haoyang Fang
Su Zhou
Taojiannan Yang
Zihan Zhong
Tony Hu
Katrin Kirchhoff
George Karypis
109
14
0
24 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
165
0
0
23 Apr 2024
On-the-Fly Point Annotation for Fast Medical Video Labeling
On-the-Fly Point Annotation for Fast Medical Video Labeling
A. Meyer
J. Mazellier
Jérémy Dana
Nicolas Padoy
70
0
0
22 Apr 2024
MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed
  3D Human Motions
MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions
Sheng Yan
Mengyuan Liu
Yong Wang
Yang Liu
Chong Chen
Hong Liu
94
2
0
21 Apr 2024
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
Zhuofan Zong
Bingqi Ma
Dazhong Shen
Guanglu Song
Hao Shao
Dongzhi Jiang
Hongsheng Li
Yu Liu
MoE
108
51
0
19 Apr 2024
Groma: Localized Visual Tokenization for Grounding Multimodal Large
  Language Models
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
Chuofan Ma
Yi Jiang
Jiannan Wu
Zehuan Yuan
Xiaojuan Qi
VLMObjD
113
65
0
19 Apr 2024
Curriculum Point Prompting for Weakly-Supervised Referring Image
  Segmentation
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
Qiyuan Dai
Sibei Yang
91
9
0
18 Apr 2024
The 8th AI City Challenge
The 8th AI City Challenge
Shuo Wang
D. Anastasiu
Zhenghang Tang
Ming-Ching Chang
Yue Yao
...
Xunlei Wu
S. Pusegaonkar
Yizhou Wang
Sujit Biswas
Rama Chellappa
117
32
0
15 Apr 2024
Arena: A Patch-of-Interest ViT Inference Acceleration System for
  Edge-Assisted Video Analytics
Arena: A Patch-of-Interest ViT Inference Acceleration System for Edge-Assisted Video Analytics
Haosong Peng
Wei Feng
Hao Li
Yufeng Zhan
Qihua Zhou
Yuanqing Xia
56
3
0
14 Apr 2024
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
Lewei Yao
Renjie Pi
Jianhua Han
Xiaodan Liang
Hang Xu
Wei Zhang
Zhenguo Li
Dan Xu
VLMObjD
96
26
0
14 Apr 2024
Enhancing Mobile "How-to" Queries with Automated Search Results
  Verification and Reranking
Enhancing Mobile "How-to" Queries with Automated Search Results Verification and Reranking
Lei Ding
Jeshwanth Bheemanpally
Yi Zhang
85
1
0
13 Apr 2024
COCONut: Modernizing COCO Segmentation
COCONut: Modernizing COCO Segmentation
XueQing Deng
Qihang Yu
Peng Wang
Xiaohui Shen
Liang-Chieh Chen
84
17
0
12 Apr 2024
Lightweight Deep Learning for Resource-Constrained Environments: A
  Survey
Lightweight Deep Learning for Resource-Constrained Environments: A Survey
Hou-I Liu
Marco Galindo
Hongxia Xie
Lai-Kuan Wong
Hong-Han Shuai
Yung-Hui Li
Wen-Huang Cheng
134
67
0
08 Apr 2024
LOGO: A Long-Form Video Dataset for Group Action Quality Assessment
LOGO: A Long-Form Video Dataset for Group Action Quality Assessment
Shiyi Zhang
Wen-Dao Dai
Sujia Wang
Xiangwei Shen
Jiwen Lu
Jie Zhou
Yansong Tang
111
28
0
07 Apr 2024
Hyperbolic Learning with Synthetic Captions for Open-World Detection
Hyperbolic Learning with Synthetic Captions for Open-World Detection
Fanjie Kong
Yanbei Chen
Jiarui Cai
Davide Modolo
VLMObjD
67
7
0
07 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
121
2
0
06 Apr 2024
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
Yi-Xin Huang
Hou-I Liu
Hong-Han Shuai
Wen-Huang Cheng
104
19
0
04 Apr 2024
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Qinji Yu
Yirui Wang
K. Yan
Haoshen Li
Dazhou Guo
...
Na Shen
Qifeng Wang
Xiaowei Ding
X. Ye
Dakai Jin
MedIm
137
2
0
04 Apr 2024
TE-TAD: Towards Full End-to-End Temporal Action Detection via
  Time-Aligned Coordinate Expression
TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression
Ho-Joong Kim
Jung-Ho Hong
Heejo Kong
Seong-Whan Lee
80
5
0
03 Apr 2024
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object
  Detection
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection
Tahira Shehzadi
K. Hashmi
Didier Stricker
Muhammad Zeshan Afzal
90
13
0
02 Apr 2024
Roadside Monocular 3D Detection via 2D Detection Prompting
Roadside Monocular 3D Detection via 2D Detection Prompting
Yechi Ma
Shuoquan Wei
Churun Zhang
Wei Hua
Yanan Li
Shu Kong
93
0
0
01 Apr 2024
Previous
123...678...131415
Next