ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.12860
  4. Cited By
DETRs with Collaborative Hybrid Assignments Training

DETRs with Collaborative Hybrid Assignments Training

22 November 2022
Zhuofan Zong
Guanglu Song
Yu Liu
    ViT
ArXivPDFHTML

Papers citing "DETRs with Collaborative Hybrid Assignments Training"

50 / 157 papers shown
Title
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
Taeheon Kim
Sangyun Chung
Damin Yeom
Youngjoon Yu
Hak Gu Kim
Y. Ro
38
2
0
22 Mar 2024
VRSO: Visual-Centric Reconstruction for Static Object Annotation
VRSO: Visual-Centric Reconstruction for Static Object Annotation
Chenyao Yu
Yingfeng Cai
Jiaxin Zhang
Hui Kong
Wei Sui
Cong Yang
3DPC
36
0
0
22 Mar 2024
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Yufan Chen
Jiaming Zhang
Kunyu Peng
Junwei Zheng
Ruiping Liu
Philip H. S. Torr
Rainer Stiefelhagen
OOD
29
5
0
21 Mar 2024
Pruning for Improved ADC Efficiency in Crossbar-based Analog In-memory
  Accelerators
Pruning for Improved ADC Efficiency in Crossbar-based Analog In-memory Accelerators
Timur Ibrayev
Isha Garg
I. Chakraborty
Kaushik Roy
25
0
0
19 Mar 2024
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of
  MLLM
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
YiXuan Wu
Yizhou Wang
Shixiang Tang
Wenhao Wu
Tong He
Wanli Ouyang
Jian Wu
Philip H. S. Torr
ObjD
VLM
32
18
0
19 Mar 2024
SimPB: A Single Model for 2D and 3D Object Detection from Multiple
  Cameras
SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras
Yingqi Tang
Zhaotie Meng
Guoliang Chen
Erkang Cheng
3DPC
24
1
0
15 Mar 2024
MoAI: Mixture of All Intelligence for Large Language and Vision Models
MoAI: Mixture of All Intelligence for Large Language and Vision Models
Byung-Kwan Lee
Beomchan Park
Chae Won Kim
Yonghyun Ro
MLLM
VLM
45
20
0
12 Mar 2024
Real-time Transformer-based Open-Vocabulary Detection with Efficient
  Fusion Head
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head
Tiancheng Zhao
Peng Liu
Xuan He
Lu Zhang
Kyusong Lee
ObjD
43
8
0
11 Mar 2024
AO-DETR: Anti-Overlapping DETR for X-Ray Prohibited Items Detection
AO-DETR: Anti-Overlapping DETR for X-Ray Prohibited Items Detection
Mingyuan Li
Tong Jia
Hao Wang
Bowen Ma
Shuyang Lin
Da Cai
Dongyue Chen
ViT
39
17
0
07 Mar 2024
RegionGPT: Towards Region Understanding Vision Language Model
RegionGPT: Towards Region Understanding Vision Language Model
Qiushan Guo
Shalini De Mello
Hongxu Yin
Wonmin Byeon
Ka Chun Cheung
Yizhou Yu
Ping Luo
Sifei Liu
VLM
41
34
0
04 Mar 2024
Semi-supervised Open-World Object Detection
Semi-supervised Open-World Object Detection
Sahal Shaji Mullappilly
Abhishek Singh Gehlot
Rao Muhammad Anwer
Fahad Shahbaz Khan
Hisham Cholakkal
37
4
0
25 Feb 2024
YOLOv9: Learning What You Want to Learn Using Programmable Gradient
  Information
YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Chien-Yao Wang
I-Hau Yeh
Hongpeng Liao
54
1,151
0
21 Feb 2024
EEND-M2F: Masked-attention mask transformers for speaker diarization
EEND-M2F: Masked-attention mask transformers for speaker diarization
Marc Härkönen
Samuel J. Broughton
Lahiru Samarakoon
22
7
0
23 Jan 2024
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass
  Diffusion Transformers
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers
Katherine Crowson
Stefan Andreas Baumann
Alex Birch
Tanishq Mathew Abraham
Daniel Z. Kaplan
Enrico Shippole
26
48
0
21 Jan 2024
MS-DETR: Efficient DETR Training with Mixed Supervision
MS-DETR: Efficient DETR Training with Mixed Supervision
Chuyang Zhao
Yifan Sun
Wenhao Wang
Qiang Chen
Errui Ding
Yi Yang
Jingdong Wang
MU
33
20
0
08 Jan 2024
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for
  Open-Vocabulary Object Detection
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection
Joonhyun Jeong
Geondo Park
Jayeon Yoo
Hyungsik Jung
Heesu Kim
VLM
ObjD
39
10
0
12 Dec 2023
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024
Benjamin Kiefer
Lojze Žust
Matej Kristan
J. Pers
Matija Tersek
...
Magdalena Šumunec
Nadir Kapetanović
A. Michel
Wolfgang Gross
Martin Weinmann
22
4
0
23 Nov 2023
T-Rex: Counting by Visual Prompting
T-Rex: Counting by Visual Prompting
Qing Jiang
Feng Li
Tianhe Ren
Shilong Liu
Zhaoyang Zeng
Kent Yu
Lei Zhang
16
11
0
22 Nov 2023
Sparse4D v3: Advancing End-to-End 3D Detection and Tracking
Sparse4D v3: Advancing End-to-End 3D Detection and Tracking
Xuewu Lin
Zi-Hui Pei
Tianwei Lin
Lichao Huang
Zhizhong Su
23
34
0
20 Nov 2023
GLaMM: Pixel Grounding Large Multimodal Model
GLaMM: Pixel Grounding Large Multimodal Model
H. Rasheed
Muhammad Maaz
Sahal Shaji Mullappilly
Abdelrahman M. Shaker
Salman Khan
Hisham Cholakkal
Rao M. Anwer
Erix Xing
Ming-Hsuan Yang
Fahad S. Khan
MLLM
VLM
38
201
0
06 Nov 2023
MotionAGFormer: Enhancing 3D Human Pose Estimation with a
  Transformer-GCNFormer Network
MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network
Soroush Mehraban
Vida Adeli
Babak Taati
ViT
32
42
0
25 Oct 2023
Multimodal Object Query Initialization for 3D Object Detection
Multimodal Object Query Initialization for 3D Object Detection
Mathijs R. van Geerenstein
Felicia Ruppel
Klaus C. J. Dietmayer
D. Gavrila
3DPC
30
2
0
16 Oct 2023
Large Models for Time Series and Spatio-Temporal Data: A Survey and
  Outlook
Large Models for Time Series and Spatio-Temporal Data: A Survey and Outlook
Ming Jin
Qingsong Wen
Yuxuan Liang
Chaoli Zhang
Siqiao Xue
...
Shirui Pan
Vincent S. Tseng
Yu Zheng
Lei Chen
Hui Xiong
AI4TS
SyDa
35
117
0
16 Oct 2023
Exploring Large Language Models for Multi-Modal Out-of-Distribution
  Detection
Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection
Yi Dai
Hao Lang
Kaisheng Zeng
Fei Huang
Yongbin Li
OODD
26
10
0
12 Oct 2023
ViT-A*: Legged Robot Path Planning using Vision Transformer A*
ViT-A*: Legged Robot Path Planning using Vision Transformer A*
Jianwei Liu
Shirui Lyu
Denis Hadjivelichkov
Valerio Modugno
Dimitrios Kanoulas
32
8
0
11 Oct 2023
Care3D: An Active 3D Object Detection Dataset of Real Robotic-Care
  Environments
Care3D: An Active 3D Object Detection Dataset of Real Robotic-Care Environments
Michael G. Adam
Sebastian Eger
Martin Piccolrovazzi
Maged Iskandar
Joern Vogel
...
Abdeldjallil Naceri
Eckehard G. Steinbach
Alin Albu-Schaeffer
Sami Haddadin
Wolfram Burgard
6
1
0
09 Oct 2023
MoCaE: Mixture of Calibrated Experts Significantly Improves Object
  Detection
MoCaE: Mixture of Calibrated Experts Significantly Improves Object Detection
Kemal Oksuz
Selim Kuzucu
Tom Joy
P. Dokania
MoE
22
5
0
26 Sep 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary
  Instance Segmentation
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffM
VLM
69
35
0
22 Sep 2023
Traveling Words: A Geometric Interpretation of Transformers
Traveling Words: A Geometric Interpretation of Transformers
Raul Molina
22
4
0
13 Sep 2023
Transformers in Small Object Detection: A Benchmark and Survey of
  State-of-the-Art
Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art
Aref Miri Rekavandi
Shima Rashidi
F. Boussaïd
Stephen Hoefs
Emre Akbas
Bennamoun
ViT
46
23
0
10 Sep 2023
Object Detection for Caries or Pit and Fissure Sealing Requirement in
  Children's First Permanent Molars
Object Detection for Caries or Pit and Fissure Sealing Requirement in Children's First Permanent Molars
Chenyao Jiang
Shiyao Zhai
Hengrui Song
Yuqing Ma
Yachen Fan
...
Sanyang Han
Runming Wang
Yong Liu
Jianbo Li
Peiwu Qin
22
0
0
31 Aug 2023
Semi-Supervised Semantic Segmentation via Marginal Contextual
  Information
Semi-Supervised Semantic Segmentation via Marginal Contextual Information
Moshe Kimhi
Shai Kimhi
Evgenii Zheltonozhskii
Or Litany
Chaim Baskin
26
10
0
26 Aug 2023
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive
  Sparse Anchor Generation
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation
Sheng-Hsiang Fu
Junkai Yan
Yipeng Gao
Xiaohua Xie
Wei-Shi Zheng
28
6
0
18 Aug 2023
MSAC: Multiple Speech Attribute Control Method for Reliable Speech
  Emotion Recognition
MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition
Y. Pan
Yuguang Yang
Yuheng Huang
Jixun Yao
Jingjing Yin
Yanni Hu
Heng Lu
Lei Ma
Jianjun Zhao
32
5
0
08 Aug 2023
Revisiting DETR Pre-training for Object Detection
Revisiting DETR Pre-training for Object Detection
Yan Ma
Weicong Liang
Bo-Ying Chen
Yiduo Hao
Bojian Hou
Xiangyu Yue
Chao Zhang
Yuhui Yuan
VLM
ViT
35
4
0
02 Aug 2023
Enhancing Your Trained DETRs with Box Refinement
Enhancing Your Trained DETRs with Box Refinement
Yiqun Chen
Qiang Chen
Pei Sun
Shoufa Chen
Jingdong Wang
Jian Cheng
30
2
0
21 Jul 2023
Parameter-efficient is not sufficient: Exploring Parameter, Memory, and
  Time Efficient Adapter Tuning for Dense Predictions
Parameter-efficient is not sufficient: Exploring Parameter, Memory, and Time Efficient Adapter Tuning for Dense Predictions
Dongshuo Yin
Xueting Han
Bin Li
Hao Feng
Jinghua Bai
VPVLM
26
17
0
16 Jun 2023
DEYOv2: Rank Feature with Greedy Matching for End-to-End Object
  Detection
DEYOv2: Rank Feature with Greedy Matching for End-to-End Object Detection
Hao Ouyang
34
4
0
15 Jun 2023
Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object
  Tracking
Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking
Feng Yan
Weihua Luo
Yujie Zhong
Yiyang Gan
Lin Ma
VOT
38
15
0
22 May 2023
Hausdorff Distance Matching with Adaptive Query Denoising for Rotated
  Detection Transformer
Hausdorff Distance Matching with Adaptive Query Denoising for Rotated Detection Transformer
Hakjin Lee
Minki Song
Jamyoung Koo
Junghoon Seo
37
7
0
12 May 2023
A Strong and Reproducible Object Detector with Only Public Datasets
A Strong and Reproducible Object Detector with Only Public Datasets
Tianhe Ren
Jianwei Yang
Siyi Liu
Ailing Zeng
Feng Li
Hao Zhang
Hongyang Li
Zhaoyang Zeng
Lei Zhang
ObjD
33
11
0
25 Apr 2023
Transformer-Based Visual Segmentation: A Survey
Transformer-Based Visual Segmentation: A Survey
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViT
MedIm
42
132
0
19 Apr 2023
DETRs Beat YOLOs on Real-time Object Detection
DETRs Beat YOLOs on Real-time Object Detection
Yian Zhao
Wenyu Lv
Shangliang Xu
Jinman Wei
Guanzhong Wang
Qingqing Dang
Yi Liu
Cheng Cui
24
824
0
17 Apr 2023
Temporal Enhanced Training of Multi-view 3D Object Detector via
  Historical Object Prediction
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Zhuofan Zong
Dong Jiang
Guanglu Song
Zeyue Xue
Jingyong Su
Hongsheng Li
Yu Liu
49
35
0
03 Apr 2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining
The effectiveness of MAE pre-pretraining for billion-scale pretraining
Mannat Singh
Quentin Duval
Kalyan Vasudev Alwala
Haoqi Fan
Vaibhav Aggarwal
...
Piotr Dollár
Christoph Feichtenhofer
Ross B. Girshick
Rohit Girdhar
Ishan Misra
LRM
113
63
0
23 Mar 2023
Consistency-Aware Anchor Pyramid Network for Crowd Localization
Consistency-Aware Anchor Pyramid Network for Crowd Localization
Xinyan Liu
Guorong Li
Yuankai Qi
Zhenjun Han
Qingming Huang
Ming-Hsuan Yang
N. Sebe
29
6
0
08 Dec 2022
Vision Transformer Computation and Resilience for Dynamic Inference
Vision Transformer Computation and Resilience for Dynamic Inference
Kavya Sreedhar
Jason Clemons
Rangharajan Venkatesan
S. Keckler
M. Horowitz
24
2
0
06 Dec 2022
Polite Teacher: Semi-Supervised Instance Segmentation with Mutual
  Learning and Pseudo-Label Thresholding
Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding
Dominik Filipiak
Andrzej Zapala
Piotr Tempczyk
A. Fensel
Marek Cygan
ISeg
13
10
0
07 Nov 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without
  Fine-tuning
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng-Wei Zhang
Chao Zhang
Hanhua Hu
27
25
0
03 Oct 2022
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Qiang Chen
Xiaokang Chen
Jian Wang
Shan Zhang
Kun Yao
Haocheng Feng
Junyu Han
Errui Ding
Gang Zeng
Jingdong Wang
ViT
46
120
0
26 Jul 2022
Previous
1234
Next