Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.21868
Cited By
Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection
28 May 2025
Guiping Cao
Wenjian Huang
X. Lan
Jianguo Zhang
D. Jiang
Yaowei Wang
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection"
22 / 22 papers shown
Title
Visible and Clear: Finding Tiny Objects in Difference Map
Bing Cao
Haiyu Yao
Pengfei Zhu
Qinghua Hu
ObjD
59
5
0
18 May 2024
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
Yi-Xin Huang
Hou-I Liu
Hong-Han Shuai
Wen-Huang Cheng
61
16
0
04 Apr 2024
Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning
Xiang Yuan
Gong Cheng
Ke Yan
Qinghua Zeng
Junwei Han
ObjD
52
52
0
18 Aug 2023
DiffusionDet: Diffusion Model for Object Detection
Shoufa Chen
Pei Sun
Yibing Song
Ping Luo
72
450
0
17 Nov 2022
Towards Large-Scale Small Object Detection: Survey and Benchmarks
Gong Cheng
Xiang Yuan
Xiwen Yao
Ke Yan
Qinghua Zeng
Xingxing Xie
Junwei Han
ObjD
58
311
0
28 Jul 2022
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
110
1,399
0
07 Mar 2022
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
Feng Li
Hao Zhang
Shi-guang Liu
Jian Guo
L. Ni
Lei Zhang
ViT
85
660
0
02 Mar 2022
Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity
Byungseok Roh
Jaewoong Shin
Wuhyun Shin
Saehoon Kim
ViT
31
145
0
29 Nov 2021
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Ruiyang Liu
Hai-Tao Zheng
Li Tao
Dun Liang
Haitao Zheng
102
98
0
07 Nov 2021
Conditional DETR for Fast Training Convergence
Depu Meng
Xiaokang Chen
Zejia Fan
Gang Zeng
Houqiang Li
Yuhui Yuan
Lei-huan Sun
Jingdong Wang
ViT
29
606
0
13 Aug 2021
Dynamic Head: Unifying Object Detection Heads with Attentions
Xiyang Dai
Yinpeng Chen
Bin Xiao
Dongdong Chen
Mengchen Liu
Lu Yuan
Lei Zhang
31
566
0
15 Jun 2021
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
Chun-Fu Chen
Quanfu Fan
Yikang Shen
ViT
37
1,450
0
27 Mar 2021
Training data-efficient image transformers & distillation through attention
Hugo Touvron
Matthieu Cord
Matthijs Douze
Francisco Massa
Alexandre Sablayrolles
Hervé Jégou
ViT
197
6,657
0
23 Dec 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
118
40,217
0
22 Oct 2020
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
126
4,993
0
08 Oct 2020
WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild
Shifeng Zhang
Yiliang Xie
Jun Wan
Hansheng Xia
Stan Z. Li
G. Guo
32
135
0
25 Sep 2019
CenterNet: Keypoint Triplets for Object Detection
Kaiwen Duan
S. Bai
Lingxi Xie
H. Qi
Qingming Huang
Q. Tian
NoLa
84
2,663
0
17 Apr 2019
SNIPER: Efficient Multi-Scale Training
Bharat Singh
Mahyar Najibi
L. Davis
ObjD
44
483
0
23 May 2018
Learning Deep Features for Discriminative Localization
Bolei Zhou
A. Khosla
Àgata Lapedriza
A. Oliva
Antonio Torralba
SSL
SSeg
FAtt
84
9,266
0
14 Dec 2015
Multi-Scale Context Aggregation by Dilated Convolutions
Feng Yu
V. Koltun
SSeg
127
8,421
0
23 Nov 2015
You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon
S. Divvala
Ross B. Girshick
Ali Farhadi
ObjD
465
36,643
0
08 Jun 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
317
61,900
0
04 Jun 2015
1