ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.02404
  4. Cited By
Fast Convergence of DETR with Spatially Modulated Co-Attention

Fast Convergence of DETR with Spatially Modulated Co-Attention

5 August 2021
Peng Gao
Minghang Zheng
Xiaogang Wang
Jifeng Dai
Hongsheng Li
    ViT
ArXivPDFHTML

Papers citing "Fast Convergence of DETR with Spatially Modulated Co-Attention"

50 / 79 papers shown
Title
A 2D Semantic-Aware Position Encoding for Vision Transformers
A 2D Semantic-Aware Position Encoding for Vision Transformers
Xi Chen
Shiyang Zhou
Muqi Huang
Jiaxu Feng
Yun Xiong
...
Yujie Zhang
Huishuai Bao
Sijia Peng
Chong Li
Feng Shi
ViT
31
0
0
14 May 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
45
1
0
18 Jan 2025
DEIM: DETR with Improved Matching for Fast Convergence
DEIM: DETR with Improved Matching for Fast Convergence
Shihua Huang
Zhichao Lu
Xiaodong Cun
Yongjun Yu
Xiao Zhou
Xi Shen
VLM
231
2
0
05 Dec 2024
Cross Resolution Encoding-Decoding For Detection Transformers
Cross Resolution Encoding-Decoding For Detection Transformers
Ashish Kumar
Jaesik Park
ViT
38
0
0
05 Oct 2024
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Minghan Chen
Guikun Chen
Wenguan Wang
Yi Yang
56
3
0
16 Sep 2024
SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based
  3D Object Detection
SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection
Lin Liu
Ziying Song
Qiming Xia
Feiyang Jia
Caiyan Jia
Lei Yang
Hongyu Pan
3DPC
58
6
0
16 Jun 2024
A Hybrid Approach for Document Layout Analysis in Document images
A Hybrid Approach for Document Layout Analysis in Document images
Tahira Shehzadi
Didier Stricker
Muhammad Zeshan Afzal
37
5
0
27 Apr 2024
LORS: Low-rank Residual Structure for Parameter-Efficient Network
  Stacking
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li
Qiang Nie
Weifu Fu
Yuhuan Lin
Guangpin Tao
Yong-Jin Liu
Chengjie Wang
38
4
0
07 Mar 2024
Deployment Prior Injection for Run-time Calibratable Object Detection
Deployment Prior Injection for Run-time Calibratable Object Detection
Mo Zhou
Yiding Yang
Haoxiang Li
Vishal M. Patel
Gang Hua
44
0
0
27 Feb 2024
A Graph-Based Approach for Category-Agnostic Pose Estimation
A Graph-Based Approach for Category-Agnostic Pose Estimation
Or Hirschorn
S. Avidan
31
10
0
29 Nov 2023
DETR Doesn't Need Multi-Scale or Locality Design
DETR Doesn't Need Multi-Scale or Locality Design
Yutong Lin
Yuhui Yuan
Zheng-Wei Zhang
Chen Li
Nanning Zheng
Han Hu
37
5
0
03 Aug 2023
Box-DETR: Understanding and Boxing Conditional Spatial Queries
Box-DETR: Understanding and Boxing Conditional Spatial Queries
Wenze Liu
Hao Lu
Yuliang Liu
Zhiguo Cao
ViT
31
2
0
17 Jul 2023
PM-DETR: Domain Adaptive Prompt Memory for Object Detection with
  Transformers
PM-DETR: Domain Adaptive Prompt Memory for Object Detection with Transformers
Peidong Jia
Jiaming Liu
Senqiao Yang
Jiarui Wu
Xiaodong Xie
Shanghang Zhang
VLM
45
2
0
01 Jul 2023
DEYOv2: Rank Feature with Greedy Matching for End-to-End Object
  Detection
DEYOv2: Rank Feature with Greedy Matching for End-to-End Object Detection
Hao Ouyang
39
4
0
15 Jun 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video
  Object Segmentation
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
22
30
0
25 May 2023
A Strong and Reproducible Object Detector with Only Public Datasets
A Strong and Reproducible Object Detector with Only Public Datasets
Tianhe Ren
Jianwei Yang
Siyi Liu
Ailing Zeng
Feng Li
Hao Zhang
Hongyang Li
Zhaoyang Zeng
Lei Zhang
ObjD
41
11
0
25 Apr 2023
StageInteractor: Query-based Object Detector with Cross-stage
  Interaction
StageInteractor: Query-based Object Detector with Cross-stage Interaction
Yao Teng
Haisong Liu
Sheng Guo
Limin Wang
ObjD
34
8
0
11 Apr 2023
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
Sheng Xu
Yanjing Li
Mingbao Lin
Penglei Gao
Guodong Guo
Jinhu Lu
Baochang Zhang
MQ
29
23
0
01 Apr 2023
One-to-Few Label Assignment for End-to-End Dense Detection
One-to-Few Label Assignment for End-to-End Dense Detection
Shuai Li
Minghan Li
Ruihuang Li
Chenhang He
Lei Zhang
38
19
0
21 Mar 2023
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set
  Object Detection
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
...
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
109
1,823
0
09 Mar 2023
KS-DETR: Knowledge Sharing in Attention Learning for Detection
  Transformer
KS-DETR: Knowledge Sharing in Attention Learning for Detection Transformer
Kaikai Zhao
Norimichi Ukita
MU
43
1
0
22 Feb 2023
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Kunlin Wang
Zi Wang
Zhang Li
Ang Su
Xichao Teng
Minhao Liu
Qifeng Yu
Qifeng Yu
ObjD
89
9
0
21 Feb 2023
Transformadores: Fundamentos teoricos y Aplicaciones
Transformadores: Fundamentos teoricos y Aplicaciones
J. D. L. Torre
78
0
0
18 Feb 2023
Text to Point Cloud Localization with Relation-Enhanced Transformer
Text to Point Cloud Localization with Relation-Enhanced Transformer
Guangzhi Wang
Hehe Fan
Mohan S. Kankanhalli
3DPC
33
14
0
13 Jan 2023
End-to-End 3D Dense Captioning with Vote2Cap-DETR
End-to-End 3D Dense Captioning with Vote2Cap-DETR
Sijin Chen
Erik Cambria
Xin Chen
Yinjie Lei
Tao Chen
YU Gang
ViT
21
52
0
06 Jan 2023
GPTR: Gestalt-Perception Transformer for Diagram Object Detection
GPTR: Gestalt-Perception Transformer for Diagram Object Detection
Xin Hu
Lingling Zhang
Jun Liu
Jinfu Fan
Yang You
Yaqiang Wu
ViT
37
5
0
29 Dec 2022
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and
  Grounding
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
Siyi Liu
Yaoyuan Liang
Feng Li
Shijia Huang
Hao Zhang
Hang Su
Jun Zhu
Lei Zhang
ObjD
50
25
0
28 Nov 2022
3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection
  Transformers
3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection Transformers
Changyong Shu
Jiajun Deng
Feng Yu
Yifan Liu
3DPC
27
10
0
27 Nov 2022
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Ya Lu
Yuqiao Chen
Nicholas Ruozzi
Yu Xiang
24
23
0
21 Nov 2022
DiffusionDet: Diffusion Model for Object Detection
DiffusionDet: Diffusion Model for Object Detection
Shoufa Chen
Pei Sun
Yibing Song
Ping Luo
65
443
0
17 Nov 2022
Knowledge Distillation for Detection Transformer with Consistent
  Distillation Points Sampling
Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling
Yu Wang
Xin Li
Shengzhao Wen
Fu-En Yang
Wanping Zhang
Gang Zhang
Haocheng Feng
Junyu Han
Errui Ding
45
5
0
15 Nov 2022
SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based
  Transformer Detector for Fast Model Convergency
SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency
Yang Liu
Yao Zhang
Yixin Wang
Yang Zhang
Jiang Tian
Zhongchao Shi
Jianping Fan
Zhiqiang He
42
14
0
03 Nov 2022
FQDet: Fast-converging Query-based Detector
FQDet: Fast-converging Query-based Detector
Cédric Picron
Punarjay Chakravarty
Tinne Tuytelaars
ObjD
41
2
0
05 Oct 2022
IoU-Enhanced Attention for End-to-End Task Specific Object Detection
IoU-Enhanced Attention for End-to-End Task Specific Object Detection
Jing Zhao
Shengjian Wu
Li Sun
Qingli Li
33
6
0
21 Sep 2022
Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation
  Exploitation
Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation Exploitation
Gongjie Zhang
Zhipeng Luo
Kaiwen Cui
Shijian Lu
Eric P. Xing
ViT
44
93
0
30 Jul 2022
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Qiang Chen
Xiaokang Chen
Jian Wang
Shan Zhang
Kun Yao
Haocheng Feng
Junyu Han
Errui Ding
Gang Zeng
Jingdong Wang
ViT
49
120
0
26 Jul 2022
Conditional DETR V2: Efficient Detection Transformer with Box Queries
Conditional DETR V2: Efficient Detection Transformer with Box Queries
Xiaokang Chen
Fangyun Wei
Gang Zeng
Jingdong Wang
ViT
30
33
0
18 Jul 2022
Polar Parametrization for Vision-based Surround-View 3D Detection
Polar Parametrization for Vision-based Surround-View 3D Detection
Shaoyu Chen
Xinggang Wang
Tianheng Cheng
Qian Zhang
Chang Huang
Wenyu Liu
3DPC
32
68
0
22 Jun 2022
Featurized Query R-CNN
Featurized Query R-CNN
Wenqiang Zhang
Tianheng Cheng
Xinggang Wang
Shaoyu Chen
Qian Zhang
Wenyu Liu
ObjD
27
5
0
13 Jun 2022
VITA: Video Instance Segmentation via Object Token Association
VITA: Video Instance Segmentation via Object Token Association
Miran Heo
Sukjun Hwang
Seoung Wug Oh
Joon-Young Lee
Seon Joo Kim
VOS
23
88
0
09 Jun 2022
What Are Expected Queries in End-to-End Object Detection?
What Are Expected Queries in End-to-End Object Detection?
Shilong Zhang
Xinjiang Wang
Jiaqi Wang
Jiangmiao Pang
Kai-xiang Chen
25
5
0
02 Jun 2022
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud
  Pre-training
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training
Renrui Zhang
Ziyu Guo
Rongyao Fang
Bingyan Zhao
Dong Wang
Yu Qiao
Hongsheng Li
Peng Gao
3DPC
184
245
0
28 May 2022
ConvMAE: Masked Convolution Meets Masked Autoencoders
ConvMAE: Masked Convolution Meets Masked Autoencoders
Peng Gao
Teli Ma
Hongsheng Li
Ziyi Lin
Jifeng Dai
Yu Qiao
ViT
19
121
0
08 May 2022
BatchFormerV2: Exploring Sample Relationships for Dense Representation
  Learning
BatchFormerV2: Exploring Sample Relationships for Dense Representation Learning
Zhi Hou
Baosheng Yu
Chaoyue Wang
Yibing Zhan
Dacheng Tao
ViT
32
11
0
04 Apr 2022
Dynamic Focus-aware Positional Queries for Semantic Segmentation
Dynamic Focus-aware Positional Queries for Semantic Segmentation
Haoyu He
Jianfei Cai
Zizheng Pan
Jing Liu
Jing Zhang
Dacheng Tao
Bohan Zhuang
34
17
0
04 Apr 2022
POS-BERT: Point Cloud One-Stage BERT Pre-Training
POS-BERT: Point Cloud One-Stage BERT Pre-Training
Kexue Fu
Peng Gao
Shaolei Liu
Renrui Zhang
Yu Qiao
Manning Wang
3DPC
30
18
0
03 Apr 2022
AdaMixer: A Fast-Converging Query-Based Object Detector
AdaMixer: A Fast-Converging Query-Based Object Detector
Ziteng Gao
Limin Wang
Bing Han
Sheng Guo
ObjD
39
105
0
30 Mar 2022
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang
Licheng Jiao
Xuantong Liu
F. Liu
Shuyuan Yang
Zhixi Feng
Xu Tang
ViT
MedIm
27
28
0
24 Mar 2022
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
Renrui Zhang
Han Qiu
Tai Wang
Ziyu Guo
Xuan Xu
Xuanzhuo Xu
Ziteng Cui
Peng Gao
Hongsheng Li
Hongsheng Li
ViT
MDE
56
82
0
24 Mar 2022
Open-Vocabulary DETR with Conditional Matching
Open-Vocabulary DETR with Conditional Matching
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
ObjD
VLM
41
197
0
22 Mar 2022
12
Next