Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.03752
Cited By
V3Det: Vast Vocabulary Visual Detection Dataset
7 April 2023
Jiaqi Wang
Pan Zhang
Tao Chu
Yuhang Cao
Yujie Zhou
Tong Wu
Bin Wang
Conghui He
Dahua Lin
VLM
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"V3Det: Vast Vocabulary Visual Detection Dataset"
50 / 68 papers shown
Title
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun
Huazhang Hu
Yidong Ma
Gang Liu
Nemo Chen
Xu Tang
Feng-Long Xie
Yongchao Xu
ObjD
94
0
0
24 Mar 2025
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
Jiannan Wu
Muyan Zhong
Sen Xing
Zeqiang Lai
Zhaoyang Liu
...
Lewei Lu
Tong Lu
Ping Luo
Yu Qiao
Jifeng Dai
MLLM
VLM
LRM
285
55
0
03 Jan 2025
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding
Chenxin Tao
Shiqian Su
X. Zhu
Chenyu Zhang
Zhe Chen
...
Wenhai Wang
Lewei Lu
Gao Huang
Yu Qiao
Jifeng Dai
MLLM
VLM
172
2
0
20 Dec 2024
Fractal Calibration for long-tailed object detection
Konstantinos Panagiotis Alexandridis
Ismail Elezi
Jiankang Deng
Anh H. Nguyen
Shan Luo
383
0
0
15 Oct 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLM
DiffM
95
3
0
28 Jun 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin
Xinyu Wei
Ruichuan An
Peng Gao
Bocheng Zou
Yulin Luo
Siyuan Huang
Shanghang Zhang
Hongsheng Li
VLM
129
39
0
29 Mar 2024
Subobject-level Image Tokenization
Delong Chen
Samuel Cahyawijaya
Jianfeng Liu
Baoyuan Wang
Pascale Fung
VLM
OCL
211
9
0
22 Feb 2024
BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image
Tao Chu
Pan Zhang
Qiong Liu
Jiaqi Wang
96
8
0
01 Jun 2023
Dense Distinct Query for End-to-End Object Detection
Shilong Zhang
Wang xinjiang
Jiaqi Wang
Jiangmiao Pang
Chengqi Lyu
Wenwei Zhang
Ping Luo
Kai-xiang Chen
86
132
0
22 Mar 2023
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
CLIP
168
710
0
14 Nov 2022
Exploiting Unlabeled Data with Vision and Language Models for Object Detection
Shiyu Zhao
Zhixing Zhang
S. Schulter
Long Zhao
Vijay Kumar B.G
Anastasis Stathopoulos
Manmohan Chandraker
Dimitris N. Metaxas
VLM
ObjD
80
102
0
18 Jul 2022
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection
H. Rasheed
Muhammad Maaz
Muhammad Uzair Khattak
Salman Khan
Fahad Shahbaz Khan
ObjD
VLM
93
154
0
07 Jul 2022
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models
Chunyuan Li
Haotian Liu
Liunian Harold Li
Pengchuan Zhang
J. Aneja
...
Ping Jin
Houdong Hu
Zicheng Liu
Yong Jae Lee
Jianfeng Gao
63
148
0
19 Apr 2022
PromptDet: Towards Open-vocabulary Detection using Uncurated Images
Chengjian Feng
Yujie Zhong
Zequn Jie
Xiangxiang Chu
Haibing Ren
Xiaolin K. Wei
Weidi Xie
Lin Ma
VPVLM
VLM
34
155
0
30 Mar 2022
Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model
Yu Du
Fangyun Wei
Zihe Zhang
Miaojing Shi
Yue Gao
Guoqi Li
VPVLM
VLM
66
332
0
28 Mar 2022
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
Likun Cai
Zhi-Li Zhang
Yi Zhu
Li Zhang
Mu Li
Xiangyang Xue
VLM
ObjD
74
41
0
24 Mar 2022
Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy
Yuanhan Zhang
Qi Sun
Yichun Zhou
Zexin He
Zhen-fei Yin
Kunze Wang
Lu Sheng
Yu Qiao
Jing Shao
Ziwei Liu
ObjD
VLM
65
19
0
15 Mar 2022
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
157
1,434
0
07 Mar 2022
Detecting Twenty-thousand Classes using Image-level Supervision
Xingyi Zhou
Rohit Girdhar
Armand Joulin
Phillip Krahenbuhl
Ishan Misra
CLIP
VLM
97
614
0
07 Jan 2022
RegionCLIP: Region-based Language-Image Pretraining
Yiwu Zhong
Jianwei Yang
Pengchuan Zhang
Chunyuan Li
Noel Codella
...
Luowei Zhou
Xiyang Dai
Lu Yuan
Yin Li
Jianfeng Gao
VLM
CLIP
130
575
0
16 Dec 2021
Grounded Language-Image Pre-training
Liunian Harold Li
Pengchuan Zhang
Haotian Zhang
Jianwei Yang
Chunyuan Li
...
Lu Yuan
Lei Zhang
Lei Li
Kai-Wei Chang
Jianfeng Gao
ObjD
VLM
116
1,060
0
07 Dec 2021
Open Vocabulary Object Detection with Pseudo Bounding-Box Labels
M. Gao
Chen Xing
Juan Carlos Niebles
Junnan Li
Ran Xu
Wenhao Liu
Caiming Xiong
VLM
ObjD
75
86
0
18 Nov 2021
Dynamic Head: Unifying Object Detection Heads with Attentions
Xiyang Dai
Yinpeng Chen
Bin Xiao
Dongdong Chen
Mengchen Liu
Lu Yuan
Lei Zhang
58
581
0
15 Jun 2021
Robust Mutual Learning for Semi-supervised Semantic Segmentation
Pan Zhang
Bo Zhang
Ting Zhang
Dong Chen
Fang Wen
71
17
0
01 Jun 2021
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Nayeon Lee
Weicheng Kuo
Huayu Chen
VLM
ObjD
267
915
0
28 Apr 2021
MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
Aishwarya Kamath
Mannat Singh
Yann LeCun
Gabriel Synnaeve
Ishan Misra
Nicolas Carion
ObjD
VLM
165
881
0
26 Apr 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
432
21,392
0
25 Mar 2021
Probabilistic two-stage detection
Xingyi Zhou
V. Koltun
Philipp Krahenbuhl
ObjD
84
225
0
12 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
861
29,341
0
26 Feb 2021
Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation
Pan Zhang
Bo Zhang
Ting Zhang
Dong Chen
Yong Wang
Fang Wen
170
496
0
26 Jan 2021
Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation
Golnaz Ghiasi
Huayu Chen
A. Srinivas
Rui Qian
Nayeon Lee
E. D. Cubuk
Quoc V. Le
Barret Zoph
ISeg
286
990
0
13 Dec 2020
CARAFE++: Unified Content-Aware ReAssembly of FEatures
Jiaqi Wang
Kai-xiang Chen
Rui Xu
Ziwei Liu
Chen Change Loy
Dahua Lin
46
56
0
07 Dec 2020
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
Pei Sun
Rufeng Zhang
Yi Jiang
Tao Kong
Chenfeng Xu
...
Masayoshi Tomizuka
Lei Li
Zehuan Yuan
Changhu Wang
Ping Luo
ObjD
93
1,094
0
25 Nov 2020
Open-Vocabulary Object Detection Using Captions
Alireza Zareian
Kevin Dela Rosa
Derek Hao Hu
Shih-Fu Chang
VLM
ObjD
120
429
0
20 Nov 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
593
40,961
0
22 Oct 2020
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
204
5,068
0
08 Oct 2020
Seesaw Loss for Long-Tailed Instance Segmentation
Jiaqi Wang
Wenwei Zhang
Yuhang Zang
Yuhang Cao
Jiangmiao Pang
Tao Gong
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
Dahua Lin
61
240
0
23 Aug 2020
TIDE: A General Toolbox for Identifying Object Detection Errors
Daniel Bolya
Sean Foley
James Hays
Judy Hoffman
76
194
0
18 Aug 2020
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
372
13,025
0
26 May 2020
Conditional Convolutions for Instance Segmentation
Zhi Tian
Chunhua Shen
Hao Chen
ISeg
224
607
0
12 Mar 2020
SOLO: Segmenting Objects by Locations
Xinlong Wang
Tao Kong
Chunhua Shen
Yuning Jiang
Lei Li
SSeg
ISeg
66
675
0
10 Dec 2019
Side-Aware Boundary Localization for More Precise Object Detection
Jiaqi Wang
Wenwei Zhang
Yuhang Cao
Kai-xiang Chen
Jiangmiao Pang
Tao Gong
Jianping Shi
Chen Change Loy
Dahua Lin
ObjD
59
138
0
09 Dec 2019
Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection
Shifeng Zhang
Cheng Chi
Yongqiang Yao
Zhen Lei
Stan Z. Li
ObjD
151
1,545
0
05 Dec 2019
LVIS: A Dataset for Large Vocabulary Instance Segmentation
Agrim Gupta
Piotr Dollár
Ross B. Girshick
ISeg
VLM
100
1,367
0
08 Aug 2019
Cascade R-CNN: High Quality Object Detection and Instance Segmentation
Zhaowei Cai
Nuno Vasconcelos
ObjD
79
1,355
0
24 Jun 2019
MMDetection: Open MMLab Detection Toolbox and Benchmark
Kai-xiang Chen
Jiaqi Wang
Jiangmiao Pang
Yuhang Cao
Yu Xiong
...
Jingdong Wang
Jianping Shi
Wanli Ouyang
Chen Change Loy
Dahua Lin
VOS
135
2,866
0
17 Jun 2019
CARAFE: Content-Aware ReAssembly of FEatures
Jiaqi Wang
Kai-xiang Chen
Rui Xu
Ziwei Liu
Chen Change Loy
Dahua Lin
83
571
0
06 May 2019
CenterNet: Keypoint Triplets for Object Detection
Kaiwen Duan
S. Bai
Lingxi Xie
H. Qi
Qingming Huang
Q. Tian
NoLa
109
2,692
0
17 Apr 2019
FCOS: Fully Convolutional One-Stage Object Detection
Zhi Tian
Chunhua Shen
Hao Chen
Tong He
ObjD
119
5,006
0
02 Apr 2019
Mask Scoring R-CNN
Zhaojin Huang
Lichao Huang
Yongchao Gong
Chang Huang
Xinggang Wang
ISeg
SSeg
65
913
0
01 Mar 2019
1
2
Next