Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2310.02960
Cited By
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
Neural Information Processing Systems (NeurIPS), 2023
4 October 2023
Yang Cao
Yihan Zeng
Hang Xu
Dan Xu
3DPC
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection"
39 / 39 papers shown
Title
LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight
Yunze Man
S. S. Wang
Guowen Zhang
Johan Bjorck
Zhiqi Li
Liang-Yan Gui
Jim Fan
Jan Kautz
Yu Wang
Zhiding Yu
28
0
0
25 Nov 2025
Zoo3D: Zero-Shot 3D Object Detection at Scene Level
Andrey Lemeshko
Bulat Gabdullin
Nikita Drozdov
Anton Konushin
D. Rukhovich
Maksim Kolodiazhnyi
3DPC
ObjD
VLM
202
0
0
25 Nov 2025
Towards 3D Objectness Learning in an Open World
Taichi Liu
Zhenyu Wang
Ruofeng Liu
Guang Wang
Desheng Zhang
3DPC
VLM
77
0
0
20 Oct 2025
Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabulary Occupancy Prediction
Chi Yan
Dan Xu
3DGS
136
0
0
06 Oct 2025
OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations
Peng-Hao Hsu
Ke Zhang
Fu-En Wang
Tao Tu
Ming-feng Li
Yu-Lun Liu
Albert Y. C. Chen
Min Sun
Cheng-Hao Kuo
3DPC
VLM
64
1
0
27 Aug 2025
Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes
Xinhao Xiang
Kuan-Chuan Peng
Suhas Lohit
Michael Jeffrey Jones
Jiawei Zhang
3DPC
106
0
0
22 Aug 2025
BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion
Yuqing Lan
Chenyang Zhu
Zhirui Gao
JIazhao Zhang
Yihan Cao
Renjiao Yi
Yijie Wang
Kai Xu
3DPC
335
0
0
18 Jun 2025
OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model
Kunshen Zhang
LRM
141
0
0
05 Jun 2025
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
Xingyu Peng
Si Liu
Chen Gao
Yan Bai
Beipeng Mu
Xiaofei Wang
Huaxia Xia
261
2
0
26 Mar 2025
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
Adrian Chow
Evelien Riddell
Yimu Wang
Sean Sedwards
Krzysztof Czarnecki
3DPC
135
0
0
09 Mar 2025
From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning
Shuangzhi Li
Junlong Shen
Lei Ma
Xingyu Li
3DPC
202
0
0
08 Mar 2025
Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning
Computer Vision and Pattern Recognition (CVPR), 2025
Hanxun Yu
Wentong Li
Song Wang
Jintai Chen
Jianke Zhu
3DV
LRM
304
23
0
01 Mar 2025
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Jin-Cheng Jhang
Tao Tu
Fu-En Wang
Ke Zhang
Min Sun
Cheng-Hao Kuo
234
4
0
16 Dec 2024
Scene Co-pilot: Procedural Text to Video Generation with Human in the Loop
Zhaofang Qian
Abolfazl Sharifi
Tucker Carroll
Ser-Nam Lim
VGen
259
0
0
26 Nov 2024
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Zhongyu Xia
Jishuo Li
Zhiwei Lin
Xinhao Wang
Longji Xu
Ming-Hsuan Yang
VLM
369
5
0
26 Nov 2024
Open Vocabulary Monocular 3D Object Detection
Jin Yao
Hao Gu
Xuweiyi Chen
Jiayun Wang
Zezhou Cheng
ObjD
VLM
366
9
0
25 Nov 2024
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Rui Huang
Henry Zheng
Yan Wang
Zhuofan Xia
Marco Pavone
Gao Huang
3DPC
VLM
317
1
0
23 Nov 2024
VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation
Neural Information Processing Systems (NeurIPS), 2024
Youpeng Wen
Junfan Lin
Yinlin Zhu
Jiawei Han
Hang Xu
Shen Zhao
Xiaodan Liang
VGen
DiffM
223
25
0
14 Nov 2024
SA3DIP: Segment Any 3D Instance with Potential 3D Priors
Neural Information Processing Systems (NeurIPS), 2024
Xi Yang
Xu Gu
Xingyilang Yin
Xinbo Gao
189
1
0
06 Nov 2024
One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection
Neural Information Processing Systems (NeurIPS), 2024
Zhenyu Wang
Yali Li
Hengshuang Zhao
Shengjin Wang
3DPC
225
6
0
03 Nov 2024
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images
Neural Information Processing Systems (NeurIPS), 2024
Timing Yang
Yuanliang Ju
Li Yi
3DPC
224
11
0
31 Oct 2024
3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection
Yang Cao
Yuanliang Jv
Dan Xu
3DGS
177
7
0
02 Oct 2024
OW-Rep: Open World Object Detection with Instance Representation Learning
Sunoh Lee
Minsik Jeon
Jihong Min
Junwon Seo
ObjD
1.1K
1
0
24 Sep 2024
UNIT: Unifying Image and Text Recognition in One Vision Encoder
Neural Information Processing Systems (NeurIPS), 2024
Yi Zhu
Yanpeng Zhou
Chunwei Wang
Yang Cao
Jianhua Han
Lu Hou
Hang Xu
ViT
VLM
205
9
0
06 Sep 2024
OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation
Muhammad Rameez Ur Rahman
Piero Simonetto
Anna Polato
Francesco Pasti
Luca Tonin
Sebastiano Vascon
3DPC
129
1
0
25 Aug 2024
Multimodal Foundational Models for Unsupervised 3D General Obstacle Detection
Tamás Matuszka
Peter Hajas
Dávid Szeghy
146
0
0
22 Aug 2024
HOTS3D: Hyper-Spherical Optimal Transport for Semantic Alignment of Text-to-3D Generation
Zezeng Li
Weimin Wang
WenHai Li
Na Lei
Na Lei
Xianfeng Gu
OT
DiffM
260
0
0
19 Jul 2024
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Ruihuang Li
Zhengqiang Zhang
Chenhang He
Zhiyuan Ma
Vishal M. Patel
Lei Zhang
3DV
VLM
180
10
0
13 Jul 2024
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection
Xingyu Peng
Yan Bai
Chen Gao
Lirong Yang
Fei Xia
Beipeng Mu
Xiaofei Wang
Si Liu
ObjD
171
7
0
12 Jul 2024
Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image
Pengkun Jiao
Na Zhao
Yue Yu
Yu-Gang Jiang
VLM
ObjD
165
10
0
07 Jul 2024
Towards Open-set Camera 3D Object Detection
Zhuolin He
Xinrun Li
Heng Gao
Jiachen Tang
Shoumeng Qiu
Wenfu Wang
Lvjian Lu
Xuchong Qiu
Xiangyang Xue
Jian Pu
3DPC
198
1
0
25 Jun 2024
OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding
Y. Wu
Jiarui Meng
Haijie Li
Chenming Wu
Yahao Shi
...
Chen Zhao
Haocheng Feng
Errui Ding
Jingdong Wang
Jian Zhang
3DGS
3DPC
168
75
0
04 Jun 2024
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection
Yang Cao
Yihan Zeng
Hang Xu
Dan Xu
3DPC
ObjD
240
13
0
02 Jun 2024
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation
Zhenyu Wang
Yali Li
Taichi Liu
Hengshuang Zhao
Shengjin Wang
3DPC
ObjD
225
14
0
28 Mar 2024
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
P. Nguyen
T.D. Ngo
E. Kalogerakis
Chuang Gan
Anh Tran
Cuong Pham
Khoi Duc Minh Nguyen
ISeg
348
87
0
17 Dec 2023
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection
European Conference on Computer Vision (ECCV), 2023
Hu Zhang
Jianhua Xu
Tao Tang
Haiyang Sun
Xin Yu
Zi Huang
Kaicheng Yu
ObjD
3DPC
177
22
0
12 Dec 2023
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Computer Vision and Pattern Recognition (CVPR), 2023
Sijin Chen
Xin Chen
C. Zhang
Mingsheng Li
Gang Yu
Hao Fei
Erik Cambria
Jiayuan Fan
Tao Chen
MLLM
253
156
0
30 Nov 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Chaoyang Zhu
Long Chen
ObjD
VLM
371
62
0
18 Jul 2023
Towards Open Vocabulary Learning: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jianzong Wu
Xiangtai Li
Shilin Xu
Haobo Yuan
Henghui Ding
...
Jiangning Zhang
Yu Tong
Xudong Jiang
Guohao Li
Dacheng Tao
ObjD
VLM
306
210
0
28 Jun 2023
1