Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.19580
Cited By
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation
28 March 2024
Zhenyu Wang
Yali Li
Taichi Liu
Hengshuang Zhao
Shengjin Wang
3DPC
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation"
26 / 26 papers shown
Title
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Zhongyu Xia
Jishuo Li
Zhiwei Lin
Xinhao Wang
Yansen Wang
Ming-Hsuan Yang
VLM
101
2
0
26 Nov 2024
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
Yang Cao
Yihan Zeng
Hang Xu
Dan Xu
3DPC
ObjD
39
33
0
04 Oct 2023
Open-Vocabulary Point-Cloud Object Detection without 3D Annotation
Yuheng Lu
Chenfeng Xu
Xi Wei
Xiaodong Xie
Masayoshi Tomizuka
Kurt Keutzer
Shanghang Zhang
3DPC
39
53
0
03 Apr 2023
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection
H. Rasheed
Muhammad Maaz
Muhammad Uzair Khattak
Salman Khan
Fahad Shahbaz Khan
ObjD
VLM
72
153
0
07 Jul 2022
GLIPv2: Unifying Localization and Vision-Language Understanding
Haotian Zhang
Pengchuan Zhang
Xiaowei Hu
Yen-Chun Chen
Liunian Harold Li
Xiyang Dai
Lijuan Wang
Lu Yuan
Lei Li
Jianfeng Gao
ObjD
VLM
36
295
0
12 Jun 2022
Unifying Voxel-based Representation with Transformer for 3D Object Detection
Yanwei Li
Yilun Chen
Xiaojuan Qi
Zeming Li
Jian Sun
Jiaya Jia
ViT
45
249
0
01 Jun 2022
BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework
Tingting Liang
Hongwei Xie
Kaicheng Yu
Zhongyu Xia
Zhiwei Lin
Yongtao Wang
T. Tang
Bing Wang
Zhi Tang
3DPC
44
402
0
27 May 2022
Multimodal Token Fusion for Vision Transformers
Yikai Wang
Xinghao Chen
Lele Cao
Wen-bing Huang
Gang Hua
Yunhe Wang
ViT
56
170
0
19 Apr 2022
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
89
1,269
0
31 Mar 2022
Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation
Zongyang Ma
Guan Luo
Jin Gao
Liang Li
Yuxin Chen
Shaoru Wang
Congxuan Zhang
Weiming Hu
VLM
ObjD
94
82
0
20 Mar 2022
Detecting Twenty-thousand Classes using Image-level Supervision
Xingyi Zhou
Rohit Girdhar
Armand Joulin
Phillip Krahenbuhl
Ishan Misra
CLIP
VLM
79
602
0
07 Jan 2022
RegionCLIP: Region-based Language-Image Pretraining
Yiwu Zhong
Jianwei Yang
Pengchuan Zhang
Chunyuan Li
Noel Codella
...
Luowei Zhou
Xiyang Dai
Lu Yuan
Yin Li
Jianfeng Gao
VLM
CLIP
81
568
0
16 Dec 2021
FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection
D. Rukhovich
Anna Vorontsova
Anton Konushin
3DPC
78
113
0
01 Dec 2021
An End-to-End Transformer Model for 3D Object Detection
Ishan Misra
Rohit Girdhar
Armand Joulin
3DPC
ViT
48
477
0
16 Sep 2021
Geometry Uncertainty Projection Network for Monocular 3D Object Detection
Yan Lu
Xinzhu Ma
Lei Yang
Tianzhu Zhang
Yating Liu
Qi Chu
Junjie Yan
Wanli Ouyang
MDE
32
214
0
29 Jul 2021
Objects are Different: Flexible Monocular 3D Object Detection
Yunpeng Zhang
Jiwen Lu
Jie Zhou
3DPC
46
256
0
06 Apr 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
372
3,778
0
11 Feb 2021
Open-Vocabulary Object Detection Using Captions
Alireza Zareian
Kevin Dela Rosa
Derek Hao Hu
Shih-Fu Chang
VLM
ObjD
87
423
0
20 Nov 2020
ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes
C. Qi
Xinlei Chen
Or Litany
Leonidas Guibas
3DPC
208
249
0
29 Jan 2020
Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection
Benjin Zhu
Zhengkai Jiang
Xiangxin Zhou
Zeming Li
Gang Yu
3DPC
181
487
0
26 Aug 2019
LVIS: A Dataset for Large Vocabulary Instance Segmentation
Agrim Gupta
Piotr Dollár
Ross B. Girshick
ISeg
VLM
66
1,352
0
08 Aug 2019
Deep Hough Voting for 3D Object Detection in Point Clouds
C. Qi
Or Litany
Kaiming He
Leonidas Guibas
3DPC
62
1,275
0
21 Apr 2019
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
210
5,653
0
26 Mar 2019
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
Angela Dai
Angel X. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
3DPC
3DV
169
4,001
0
14 Feb 2017
Feature Pyramid Networks for Object Detection
Nayeon Lee
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
364
21,951
0
09 Dec 2016
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
327
61,900
0
04 Jun 2015
1