ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.11876
  4. Cited By
Open-Vocabulary DETR with Conditional Matching
v1v2 (latest)

Open-Vocabulary DETR with Conditional Matching

22 March 2022
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
    ObjDVLM
ArXiv (abs)PDFHTML

Papers citing "Open-Vocabulary DETR with Conditional Matching"

50 / 158 papers shown
Title
RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Fangyi Chen
Han Zhang
Zhantao Yang
Hao Chen
Kai Hu
Marios Savvides
ObjDVLM
89
5
0
30 May 2024
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and
  Open-World Unknown Objects Supervision
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
Junjie Wang
Bin Chen
Bin Kang
Yulin Li
Yichi Chen
Weizhi Xian
Huifeng Chang
VLMObjD
84
7
0
28 May 2024
Diagnosing the Compositional Knowledge of Vision Language Models from a
  Game-Theoretic View
Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Jin Wang
Shichao Dong
Yapeng Zhu
Kelu Yao
Weidong Zhao
Chao Li
Ping Luo
CoGeLRM
85
2
0
27 May 2024
LLM-Optic: Unveiling the Capabilities of Large Language Models for
  Universal Visual Grounding
LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding
Haoyu Zhao
Wenhang Ge
Ying-Cong Chen
ObjDMLLMVLM
90
5
0
27 May 2024
Unsupervised Image Prior via Prompt Learning and CLIP Semantic Guidance
  for Low-Light Image Enhancement
Unsupervised Image Prior via Prompt Learning and CLIP Semantic Guidance for Low-Light Image Enhancement
Igor Morawski
Kai He
Shusil Dangi
Winston H. Hsu
VLM
106
4
0
19 May 2024
Open-Vocabulary Spatio-Temporal Action Detection
Open-Vocabulary Spatio-Temporal Action Detection
Tao Wu
Shuqiu Ge
Jie Qin
Gangshan Wu
Limin Wang
ObjD
75
7
0
17 May 2024
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
Mingxuan Liu
Tyler L. Hayes
Elisa Ricci
G. Csurka
Riccardo Volpi
ObjD
116
3
0
16 May 2024
Open-Vocabulary Object Detection via Neighboring Region Attention
  Alignment
Open-Vocabulary Object Detection via Neighboring Region Attention Alignment
Sunyuan Qiang
Xianfei Li
Yanyan Liang
Wenlong Liao
Tao He
Pai Peng
ObjD
86
0
0
14 May 2024
OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images
OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images
Ye Mao
Junpeng Jing
K. Mikolajczyk
VLM
52
3
0
25 Apr 2024
DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with
  Retrieval Guidelines
DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines
Xin Jiang
Hao Tang
Rui Yan
Jinhui Tang
Zechao Li
82
5
0
24 Apr 2024
ChEX: Interactive Localization and Region Description in Chest X-rays
ChEX: Interactive Localization and Region Description in Chest X-rays
Philip Muller
Georgios Kaissis
Daniel Rueckert
88
5
0
24 Apr 2024
Curriculum Point Prompting for Weakly-Supervised Referring Image
  Segmentation
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
Qiyuan Dai
Sibei Yang
91
9
0
18 Apr 2024
OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection
  and Discovery
OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery
Matthew J. Inkawhich
Nathan Inkawhich
Hao Yang
Jingyang Zhang
Randolph Linderman
Yiran Chen
ObjD
103
0
0
16 Apr 2024
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
Lewei Yao
Renjie Pi
Jianhua Han
Xiaodan Liang
Hang Xu
Wei Zhang
Zhenguo Li
Dan Xu
VLMObjD
99
26
0
14 Apr 2024
Retrieval-Augmented Open-Vocabulary Object Detection
Retrieval-Augmented Open-Vocabulary Object Detection
Jooyeon Kim
Eulrang Cho
Sehyung Kim
Hyunwoo J. Kim
VLMObjD
109
8
0
08 Apr 2024
3D-COCO: extension of MS-COCO dataset for image detection and 3D
  reconstruction modules
3D-COCO: extension of MS-COCO dataset for image detection and 3D reconstruction modules
Maxence Bideaux
Alice Phe
Mohamed Chaouch
B. Luvison
Q. C. Pham
ISeg3DV
80
0
0
08 Apr 2024
Hyperbolic Learning with Synthetic Captions for Open-World Detection
Hyperbolic Learning with Synthetic Captions for Open-World Detection
Fanjie Kong
Yanbei Chen
Jiarui Cai
Davide Modolo
VLMObjD
67
7
0
07 Apr 2024
ViTamin: Designing Scalable Vision Models in the Vision-Language Era
ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Jienneg Chen
Qihang Yu
Xiaohui Shen
Alan Yuille
Liang-Chieh Chen
3DVVLM
109
29
0
02 Apr 2024
Open-Vocabulary Object Detectors: Robustness Challenges under
  Distribution Shifts
Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts
Prakash Chandra Chhipa
Kanjar De
Meenakshi Subhash Chippa
Rajkumar Saini
Marcus Liwicki
ObjDVLM
82
1
0
01 Apr 2024
Open-Set Recognition in the Age of Vision-Language Models
Open-Set Recognition in the Age of Vision-Language Models
Dimity Miller
Niko Sünderhauf
Alex Kenna
Keita Mason
VLM
66
6
0
25 Mar 2024
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Qing Jiang
Feng Li
Zhaoyang Zeng
Tianhe Ren
Shilong Liu
Lei Zhang
VLM
114
44
0
21 Mar 2024
Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban
  Environments
Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments
Djamahl Etchegaray
Zi Huang
Tatsuya Harada
Yadan Luo
104
9
0
20 Mar 2024
vid-TLDR: Training Free Token merging for Light-weight Video Transformer
vid-TLDR: Training Free Token merging for Light-weight Video Transformer
Joonmyung Choi
Sanghyeok Lee
Jaewon Chu
Minhyuk Choi
Hyunwoo J. Kim
MoMeViT
112
17
0
20 Mar 2024
As Firm As Their Foundations: Can open-sourced foundation models be used
  to create adversarial examples for downstream tasks?
As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Anjun Hu
Jindong Gu
Francesco Pinto
Konstantinos Kamnitsas
Philip Torr
AAMLSILM
86
5
0
19 Mar 2024
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of
  MLLM
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
YiXuan Wu
Yizhou Wang
Shixiang Tang
Wenhao Wu
Tong He
Wanli Ouyang
Jian Wu
Philip Torr
ObjDVLM
104
22
0
19 Mar 2024
Generative Region-Language Pretraining for Open-Ended Object Detection
Generative Region-Language Pretraining for Open-Ended Object Detection
Chuang Lin
Yi Jiang
Zhuang Li
Zehuan Yuan
Jianfei Cai
ObjDVLM
86
20
0
15 Mar 2024
Open-Vocabulary Object Detection with Meta Prompt Representation and
  Instance Contrastive Optimization
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization
Zhao Wang
Aoxue Li
Fengwei Zhou
Zhenguo Li
Qi Dou
ObjDVLM
126
2
0
14 Mar 2024
Exploring Robust Features for Few-Shot Object Detection in Satellite
  Imagery
Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery
Xavier Bou
Gabriele Facciolo
R. G. V. Gioi
Jean-Michel Morel
T. Ehret
ObjD
86
3
0
08 Mar 2024
Enhancing Embodied Object Detection through Language-Image Pre-training
  and Implicit Object Memory
Enhancing Embodied Object Detection through Language-Image Pre-training and Implicit Object Memory
N. H. Chapman
Feras Dayoub
Will N. Browne
Chris Lehnert
ObjDVLMLM&Ro
65
1
0
06 Feb 2024
YOLO-World: Real-Time Open-Vocabulary Object Detection
YOLO-World: Real-Time Open-Vocabulary Object Detection
Tianheng Cheng
Lin Song
Yixiao Ge
Wenyu Liu
Xinggang Wang
Ying Shan
VLMObjD
128
301
0
30 Jan 2024
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD
  Generalization
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization
Yuhang Zang
Hanlin Goh
Josh Susskind
Chen Huang
VLM
115
13
0
29 Jan 2024
OMG-Seg: Is One Model Good Enough For All Segmentation?
OMG-Seg: Is One Model Good Enough For All Segmentation?
Xiangtai Li
Haobo Yuan
Wei Li
Henghui Ding
Size Wu
Wenwei Zhang
Yining Li
Kai Chen
Chen Change Loy
VLMMLLMViT
150
64
0
18 Jan 2024
APLe: Token-Wise Adaptive for Multi-Modal Prompt Learning
APLe: Token-Wise Adaptive for Multi-Modal Prompt Learning
Guiming Cao
Kaize Shi
Hong Fu
Huaiwen Zhang
Guandong Xu
VLM
75
2
0
12 Jan 2024
Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion
Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion
Minglong Xue
Jinhong He
Wenhai Wang
Mingliang Zhou
86
14
0
08 Jan 2024
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes
  Interactively
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively
Haobo Yuan
Xiangtai Li
Chong Zhou
Yining Li
Kai Chen
Chen Change Loy
VLM
118
51
0
05 Jan 2024
Leveraging Open-Vocabulary Diffusion to Camouflaged Instance
  Segmentation
Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation
Tuan-Anh Vu
Duc Thanh Nguyen
Qing Guo
Binh-Son Hua
N. Chung
Ivor W. Tsang
Sai-Kit Yeung
DiffM
83
3
0
29 Dec 2023
Weakly Supervised Open-Vocabulary Object Detection
Weakly Supervised Open-Vocabulary Object Detection
Jianghang Lin
Yunhang Shen
Bingquan Wang
Shaohui Lin
Ke Li
Liujuan Cao
WSOD
85
9
0
19 Dec 2023
CLIM: Contrastive Language-Image Mosaic for Region Representation
CLIM: Contrastive Language-Image Mosaic for Region Representation
Size Wu
Wenwei Zhang
Lumin Xu
Sheng Jin
Wentao Liu
Chen Change Loy
ObjDVLM
104
15
0
18 Dec 2023
Understanding the Multi-modal Prompts of the Pre-trained Vision-Language
  Model
Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model
Shuailei Ma
Chen-Wei Xie
Ying-yu Wei
Siyang Sun
Jiaqi Fan
Xiaoyi Bao
Yuxin Guo
Yun Zheng
VLMVPVLM
76
2
0
18 Dec 2023
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
P. Nguyen
T.D. Ngo
E. Kalogerakis
Chuang Gan
Anh Tran
Cuong Pham
Khoi Duc Minh Nguyen
ISeg
154
55
0
17 Dec 2023
LAMM: Label Alignment for Multi-Modal Prompt Learning
LAMM: Label Alignment for Multi-Modal Prompt Learning
Jingsheng Gao
Jiacheng Ruan
Suncheng Xiang
Zefang Yu
Ke Ji
Mingye Xie
Ting Liu
Yuzhuo Fu
MLLMVLMVPVLM
51
18
0
13 Dec 2023
Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic
  Segmentation
Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Yuanbin Wang
Shaofei Huang
Yulu Gao
Zhen Wang
Rui Wang
Kehua Sheng
Bo Zhang
Si Liu
VLM
80
14
0
12 Dec 2023
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object
  Detection
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection
Hu Zhang
Jianhua Xu
Tao Tang
Haiyang Sun
Xin Yu
Zi Huang
Kaicheng Yu
ObjD3DPC
81
12
0
12 Dec 2023
Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object
  Detection
Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection
Sunghun Kang
Junbum Cha
Jonghwan Mun
Byungseok Roh
Chang D. Yoo
VLMObjD
82
1
0
04 Dec 2023
APoLLo: Unified Adapter and Prompt Learning for Vision Language Models
APoLLo: Unified Adapter and Prompt Learning for Vision Language Models
Sanjoy Chowdhury
Sayan Nag
Dinesh Manocha
VLM
75
19
0
04 Dec 2023
Language-conditioned Detection Transformer
Language-conditioned Detection Transformer
Jang Hyun Cho
Philipp Krahenbuhl
VLMObjD
95
1
0
29 Nov 2023
Active Open-Vocabulary Recognition: Let Intelligent Moving Mitigate CLIP
  Limitations
Active Open-Vocabulary Recognition: Let Intelligent Moving Mitigate CLIP Limitations
Lei Fan
Jianxiong Zhou
Xiaoying Xing
Ying Wu
VLM
80
4
0
28 Nov 2023
Visual In-Context Prompting
Visual In-Context Prompting
Feng Li
Qing Jiang
Hao Zhang
Tianhe Ren
Shilong Liu
...
Hongyang Li
Chun-yue Li
Jianwei Yang
Lei Zhang
Jianfeng Gao
VLMLRMMLLM
96
36
0
22 Nov 2023
Point, Segment and Count: A Generalized Framework for Object Counting
Point, Segment and Count: A Generalized Framework for Object Counting
Zhizhong Huang
Mingliang Dai
Yi Zhang
Junping Zhang
Hongming Shan
85
21
0
21 Nov 2023
Toward Open Vocabulary Aerial Object Detection with CLIP-Activated
  Student-Teacher Learning
Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning
Yan Li
Weiwei Guo
Xue Yang
Ning Liao
Dunyun He
Jiaqi Zhou
Wenxian Yu
ObjDVLM
87
10
0
20 Nov 2023
Previous
1234
Next