Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.15654
Cited By
OpenScene: 3D Scene Understanding with Open Vocabularies
28 November 2022
Songyou Peng
Kyle Genova
ChiyuMaxJiang
Andrea Tagliasacchi
Marc Pollefeys
Thomas Funkhouser
3DPC
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenScene: 3D Scene Understanding with Open Vocabularies"
50 / 285 papers shown
Title
Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration
Kim Jun-Seong
GeonU Kim
Kim Yu-Ji
Yu-Chun Wang
Jaesung Choe
Tae-Hyun Oh
3DGS
69
1
0
23 Feb 2025
Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field
Wenhao Hu
Wenhao Chai
Shengyu Hao
Xiaotong Cui
Xuexiang Wen
Lei Li
Gaoang Wang
3DV
60
0
0
22 Feb 2025
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Zekun Qi
Wenyao Zhang
Yufei Ding
Runpei Dong
Xinqiang Yu
...
Xin Jin
Kaisheng Ma
Zhizheng Zhang
He Wang
Li Yi
LM&Ro
131
4
0
18 Feb 2025
LeAP: Consistent multi-domain 3D labeling using Foundation Models
Simon Gebraad
Andras Palffy
Holger Caesar
155
1
0
06 Feb 2025
Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation
Rohan Chacko
Nicolai Haeni
Eldar Khaliullin
Lin Sun
Douglas Lee
3DGS
49
1
0
31 Jan 2025
When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysis
Ruixuan Zhang
Beichen Wang
Juexiao Zhang
Zilin Bian
Chen Feng
K. Ozbay
45
3
0
17 Jan 2025
OpenIN: Open-Vocabulary Instance-Oriented Navigation in Dynamic Domestic Environments
Yujie Tang
Hao Wu
Yinan Deng
Zibo Zheng
Jingchuan Deng
Yufeng Yue
LM&Ro
41
0
0
08 Jan 2025
PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Runnan Chen
Zhaoqing Wang
Jiepeng Wang
Yuexin Ma
Mingming Gong
Wenping Wang
Tongliang Liu
3DGS
37
2
0
03 Jan 2025
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
Zhangyang Qi
Zhixiong Zhang
Ye Fang
Jiaqi Wang
Hengshuang Zhao
86
7
0
02 Jan 2025
RelationField: Relate Anything in Radiance Fields
Sebastian Koch
Johanna Wald
Mirco Colosi
Narunas Vaskevicius
Pedro Hermosilla
F. Tombari
Timo Ropinski
114
1
0
18 Dec 2024
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction
Yi Feng
Yu Han
Xijing Zhang
Tanghui Li
Yanting Zhang
Rui Fan
117
3
0
15 Dec 2024
Feat2GS: Probing Visual Foundation Models with Gaussian Splatting
Yue Chen
Xingyu Chen
Anpei Chen
Gerard Pons-Moll
Yuliang Xiu
3DGS
86
3
0
12 Dec 2024
TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances
Wenting Xu
Viorela Ila
Luping Zhou
Craig T. Jin
72
0
0
07 Dec 2024
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences
Hongyan Zhi
Peihao Chen
Junyan Li
Shuailei Ma
Xinyu Sun
Tianhang Xiang
Yinjie Lei
Mingkui Tan
Chuang Gan
80
3
0
02 Dec 2024
Language Driven Occupancy Prediction
Zhu Yu
Bowen Pang
Lizhe Liu
Runmin Zhang
Qihao Peng
Maochun Luo
Sheng Yang
Mingxia Chen
Si-Yuan Cao
Hui-Liang Shen
95
2
0
25 Nov 2024
ROOT: VLM based System for Indoor Scene Understanding and Beyond
Yonghui Wang
Shi-Yong Chen
Zhenxing Zhou
Siyi Li
Haoran Li
Wengang Zhou
Yiming Li
VLM
67
3
0
24 Nov 2024
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Rui Huang
Henry Zheng
Yan Wang
Zhuofan Xia
Marco Pavone
Gao Huang
3DPC
VLM
88
1
0
23 Nov 2024
3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning
Yuncong Yang
Han Yang
Jiachen Zhou
Peihao Chen
Hongxin Zhang
Yilun Du
Chuang Gan
69
0
0
23 Nov 2024
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
Ziyi Wang
Yijiao Wang
Xumin Yu
Jie Zhou
Jiwen Lu
74
0
0
20 Nov 2024
Find Any Part in 3D
Ziqi Ma
Yisong Yue
Georgia Gkioxari
3DPC
115
3
0
20 Nov 2024
VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation
Bangguo Yu
Yuzhen Liu
Lei Han
H. Kasaei
Tingguang Li
M. Cao
LM&Ro
72
3
0
18 Nov 2024
SA3DIP: Segment Any 3D Instance with Potential 3D Priors
Xi Yang
Xu Gu
Xingyilang Yin
Xinbo Gao
47
0
0
06 Nov 2024
The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare
Souren Pashangpour
Goldie Nejat
LM&MA
53
7
0
05 Nov 2024
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images
Timing Yang
Yuanliang Ju
Li Yi
3DPC
34
3
0
31 Oct 2024
Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping
Qianxu Wang
Congyue Deng
Tyler Ga Wei Lum
Yuanpei Chen
Yaodong Yang
Jeannette Bohg
Yixin Zhu
Leonidas J. Guibas
47
4
0
30 Oct 2024
Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation
Zhaochong An
Guolei Sun
Yun Liu
Runjia Li
Min Wu
Ming-Ming Cheng
Ender Konukoglu
Serge Belongie
64
4
0
29 Oct 2024
Scene Graph Generation with Role-Playing Large Language Models
Guikun Chen
Jin Li
Wenguan Wang
VLM
48
5
0
20 Oct 2024
Semantically Safe Robot Manipulation: From Semantic Scene Understanding to Motion Safeguards
Lukas Brunke
Yanni Zhang
Ralf Romer
Jack Naimer
Nikola Staykov
Siqi Zhou
Angela P. Schoellig
59
4
0
19 Oct 2024
VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
Runsen Xu
Zhiwei Huang
Tai Wang
Yuxiao Chen
Jiangmiao Pang
Dahua Lin
VGen
44
11
0
17 Oct 2024
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
Guangda Ji
Silvan Weder
Francis Engelmann
Marc Pollefeys
Hermann Blum
3DV
69
4
0
17 Oct 2024
SAM-Guided Masked Token Prediction for 3D Scene Understanding
Zhimin Chen
Liang Yang
Yingwei Li
Longlong Jing
Bing Li
37
3
0
16 Oct 2024
Flex: End-to-End Text-Instructed Visual Navigation from Foundation Model Features
Makram Chahine
Alex Quach
Alaa Maalouf
Tsun-Hsuan Wang
Daniela Rus
26
0
0
16 Oct 2024
Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Zhijie Yan
Shufei Li
Zihan Wang
Lixiu Wu
Han Wang
Jun Zhu
Lijiang Chen
Jihong Liu
39
1
0
15 Oct 2024
Multimodal 3D Fusion and In-Situ Learning for Spatially Aware AI
Chengyuan Xu
Radha Kumaran
Noah Stier
Kangyou Yu
Tobias Höllerer
40
0
0
06 Oct 2024
In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding
Shenghao Li
40
1
0
06 Oct 2024
Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
Ayesha Ishaq
Mohamed El Amine Boudjoghra
Jean Lahoud
Fahad Shahbaz Khan
Salman Khan
Hisham Cholakkal
Rao Muhammad Anwer
117
1
0
02 Oct 2024
OpenObject-NAV: Open-Vocabulary Object-Oriented Navigation Based on Dynamic Carrier-Relationship Scene Graph
Yujie Tang
Meiling Wang
Yinan Deng
Zibo Zheng
Jiagui Zhong
Yufeng Yue
36
0
0
27 Sep 2024
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
Ayca Takmaz
Alexandros Delitzas
R. Sumner
Francis Engelmann
Johanna Wald
Federico Tombari
86
11
0
27 Sep 2024
ChatCam: Empowering Camera Control through Conversational AI
Xinhang Liu
Yu-Wing Tai
Chi-Keung Tang
VGen
33
2
0
25 Sep 2024
Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with Large Language Models
Mike Zhang
Kaixian Qu
Vaishakh Patil
Cesar Cadena
Marco Hutter
LM&Ro
3DV
41
4
0
23 Sep 2024
Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation
Yifan Xu
Ziming Luo
Qianwei Wang
Vineet Kamat
Carol Menassa
3DV
3DPC
33
0
0
16 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-xiong Wang
78
15
0
05 Sep 2024
Multi-modal Situated Reasoning in 3D Scenes
Xiongkun Linghu
Jiangyong Huang
Xuesong Niu
Xiaojian Ma
Baoxiong Jia
Siyuan Huang
39
12
0
04 Sep 2024
Image-to-Lidar Relational Distillation for Autonomous Driving Data
Anas Mahmoud
Ali Harakeh
Steven Waslander
29
0
0
01 Sep 2024
ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images
Xiaoshuai Zhang
Zhicheng Wang
Howard Zhou
Soham Ghosh
Danushen Gnanapragasam
Varun Jampani
Hao Su
Leonidas J. Guibas
DD
51
5
0
30 Aug 2024
OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation
Muhammad Rameez Ur Rahman
Piero Simonetto
Anna Polato
Francesco Pasti
Luca Tonin
Sebastiano Vascon
3DPC
41
0
0
25 Aug 2024
Open-Ended 3D Point Cloud Instance Segmentation
Phuc D. A. Nguyen
Minh Luu
Anh Tran
Cuong Pham
Khoi Nguyen
3DPC
56
1
0
21 Aug 2024
Positional Prompt Tuning for Efficient 3D Representation Learning
Shaochen Zhang
Zekun Qi
Runpei Dong
Xiuxiu Bai
Xing Wei
39
4
0
21 Aug 2024
Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant
Guofeng Mei
Luigi Riz
Yiming Wang
Fabio Poiesi
ISeg
VLM
69
3
0
20 Aug 2024
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Youjun Zhao
Jiaying Lin
Shuquan Ye
Qianshi Pang
Rynson W. H. Lau
64
1
0
20 Aug 2024
Previous
1
2
3
4
5
6
Next