Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.15654
Cited By
OpenScene: 3D Scene Understanding with Open Vocabularies
28 November 2022
Songyou Peng
Kyle Genova
ChiyuMaxJiang
Andrea Tagliasacchi
Marc Pollefeys
Thomas Funkhouser
3DPC
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenScene: 3D Scene Understanding with Open Vocabularies"
50 / 285 papers shown
Title
Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation
Tri Ton
Ji Woo Hong
Soohwan Eom
Jun Yeop Shim
Junyeong Kim
Chang D. Yoo
3DPC
ISeg
47
2
0
16 Aug 2024
SceneGPT: A Language Model for 3D Scene Understanding
Shivam Chandhok
LRM
39
4
0
13 Aug 2024
Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection
Christian Fruhwirth-Reisinger
Wei Lin
Dušan Malić
Horst Bischof
Horst Possegger
3DPC
36
1
0
07 Aug 2024
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Yuanwen Yue
Anurag Das
Francis Engelmann
Siyu Tang
J. E. Lenssen
48
24
0
29 Jul 2024
MILAN: Milli-Annotations for Lidar Semantic Segmentation
Nermin Samet
Gilles Puy
Oriane Siméoni
Renaud Marlet
3DPC
32
0
0
22 Jul 2024
Foundation Models for Autonomous Robots in Unstructured Environments
Hossein Naderi
Alireza Shojaei
Lifu Huang
LM&Ro
47
0
0
19 Jul 2024
OpenSU3D: Open World 3D Scene Understanding using Foundation Models
Rafay Mohiuddin
Sai Manoj Prakhya
Fiona Collins
Ziyuan Liu
André Borrmann
41
2
0
19 Jul 2024
SegPoint: Segment Any Point Cloud via Large Language Model
Shuting He
Henghui Ding
Xudong Jiang
Bihan Wen
3DV
MLLM
3DPC
48
19
0
18 Jul 2024
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu
Hao Zhou
Pengfei Xing
Long Zhao
Hao Xu
Junwei Liang
Alex Hauptmann
Ting Liu
Andrew C. Gallagher
DiffM
62
4
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
48
2
0
18 Jul 2024
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Cheng Shi
Yulin Zhang
Bin Yang
Jiajin Tang
Yuexin Ma
Sibei Yang
3DPC
54
1
0
14 Jul 2024
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Ruihuang Li
Zhengqiang Zhang
Chenhang He
Zhiyuan Ma
Vishal M. Patel
Lei Zhang
3DV
VLM
42
5
0
13 Jul 2024
3x2: 3D Object Part Segmentation by 2D Semantic Correspondences
Anh Thai
Weiyao Wang
Hao Tang
Stefan Stojanov
Matt Feiszli
James M. Rehg
3DPC
47
3
0
12 Jul 2024
Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining
Tianfang Sun
Zhizhong Zhang
Xin Tan
Yanyun Qu
Yuan Xie
35
0
0
10 Jul 2024
A Unified Framework for 3D Scene Understanding
Wei Xu
Chunsheng Shi
Sifan Tu
Xin Zhou
Dingkang Liang
Xiang Bai
VOS
34
5
0
03 Jul 2024
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction
Xuan Yu
Yili Liu
Chenrui Han
Sitong Mao
Shunbo Zhou
R. Xiong
Yiyi Liao
Yue Wang
ISeg
52
2
0
01 Jul 2024
Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Takayuki Nishimura
Katsuyuki Kuyo
Motonari Kambara
Komei Sugiura
DiffM
30
0
0
01 Jul 2024
3D Feature Distillation with Object-Centric Priors
Georgios Tziafas
Yucheng Xu
Zhibin Li
H. Kasaei
36
1
0
26 Jun 2024
Point-SAM: Promptable 3D Segmentation Model for Point Clouds
Yuchen Zhou
Jiayuan Gu
Tung Yen Chiang
Fanbo Xiang
Hao Su
48
17
0
25 Jun 2024
AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings
Jamie Watson
Filippo Aleotti
Mohamed Sayed
Z. Qureshi
Oisin Mac Aodha
Gabriel J. Brostow
Michael Firman
Sara Vicente
3DPC
29
0
0
13 Jun 2024
OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding
Yinan Deng
Jiahui Wang
Jingyu Zhao
Jianyu Dou
Yi Yang
Yufeng Yue
AI4CE
38
6
0
12 Jun 2024
Situational Awareness Matters in 3D Vision Language Reasoning
Yunze Man
Liang-Yan Gui
Yu-Xiong Wang
43
12
0
11 Jun 2024
Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph
S. Linok
T. Zemskova
Svetlana Ladanova
Roman Titkov
Dmitry A. Yudin
Maxim Monastyrny
Aleksei Valenkov
LM&Ro
57
3
0
11 Jun 2024
VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification
Jianmeng Liu
Yichen Liu
Yuyao Zhang
Zeyuan Meng
Yu-Wing Tai
Chi-Keung Tang
49
0
0
08 Jun 2024
OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding
Y. Wu
Jiarui Meng
Haijie Li
Chenming Wu
Yahao Shi
...
Chen Zhao
Haocheng Feng
Errui Ding
Jingdong Wang
Jian Zhang
3DGS
3DPC
35
29
0
04 Jun 2024
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Mohamed El Amine Boudjoghra
Angela Dai
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
Fahad Shahbaz Khan
VLM
ISeg
83
6
0
04 Jun 2024
CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos
Trong-Thuan Nguyen
Pha Nguyen
Xin Li
Jackson Cothren
Alper Yilmaz
Khoa Luu
48
3
0
03 Jun 2024
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection
Yang Cao
Yihan Zeng
Hang Xu
Dan Xu
3DPC
ObjD
47
6
0
02 Jun 2024
Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models
Tianrun Chen
Chunan Yu
Jing Li
Jianqi Zhang
Lanyun Zhu
Deyi Ji
Yong Zhang
Ying Zang
Zejian Li
Lingyun Sun
LRM
49
9
0
29 May 2024
3D StreetUnveiler with Semantic-aware 2DGS -- a simple baseline
Jingwei Xu
Yikai Wang
Yiqun Zhao
Yanwei Fu
Shenghua Gao
3DGS
62
2
0
28 May 2024
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
Kuan-Chih Huang
Xiangtai Li
Lu Qi
Shuicheng Yan
Ming-Hsuan Yang
LRM
76
10
0
27 May 2024
Open-Vocabulary SAM3D: Understand Any 3D Scene
Hanchen Tai
Qingdong He
Jiangning Zhang
Yijie Qian
Zhenyu Zhang
Xiaobin Hu
Yabiao Wang
Yong Liu
VLM
54
0
0
24 May 2024
Unifying 3D Vision-Language Understanding via Promptable Queries
Ziyu Zhu
Zhuofan Zhang
Xiaojian Ma
Xuesong Niu
Yixin Chen
Baoxiong Jia
Zhidong Deng
Siyuan Huang
Qing Li
48
21
0
19 May 2024
Grounded 3D-LLM with Referent Tokens
Yilun Chen
Shuai Yang
Haifeng Huang
Tai Wang
Ruiyuan Lyu
Runsen Xu
Dahua Lin
Jiangmiao Pang
53
23
0
16 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
33
13
0
16 May 2024
Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception
Haoming Chen
Zhizhong Zhang
Yanyun Qu
Ruixin Zhang
Xin Tan
Yuan Xie
45
1
0
12 May 2024
Challenges and Opportunities for Large-Scale Exploration with Air-Ground Teams using Semantics
F. Ojeda
Ian D. Miller
Zachary Ravichandran
Varun Murali
Jason Hughes
M. A. Hsieh
Camillo J Taylor
Vijay Kumar
46
2
0
12 May 2024
Probing Multimodal LLMs as World Models for Driving
Shiva Sreeram
Tsun-Hsuan Wang
Alaa Maalouf
Guy Rosman
S. Karaman
Daniela Rus
32
7
0
09 May 2024
Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving
Lingdong Kong
Xiang Xu
Jiawei Ren
Wenwei Zhang
Liang Pan
Kai-xiang Chen
Wei Tsang Ooi
Ziwei Liu
45
17
0
08 May 2024
Multi-Space Alignments Towards Universal LiDAR Segmentation
You-Chen Liu
Lingdong Kong
Xiaoyang Wu
Runnan Chen
Xin Li
Liang Pan
Ziwei Liu
Yuexin Ma
3DPC
48
17
0
02 May 2024
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Dominic Maggio
Yun Chang
Nathan Hughes
Matthew Trang
Dan Griffith
Carlyn Dougherty
Eric Cristofalo
Lukas Schmid
Luca Carlone
3DV
38
33
0
21 Apr 2024
Contrastive Gaussian Clustering: Weakly Supervised 3D Scene Segmentation
Myrna C. Silva
Mahtab Dahaghin
M. Toso
Alessio Del Bue
3DGS
37
11
0
19 Apr 2024
Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds
Oliver Lemke
Z. Bauer
René Zurbrugg
Marc Pollefeys
Francis Engelmann
Hermann Blum
3DPC
24
11
0
18 Apr 2024
Zero-shot detection of buildings in mobile LiDAR using Language Vision Model
June Moh Goo
Zichao Zeng
Jan Boehm
46
2
0
15 Apr 2024
QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding
Yash Mehan
Kumaraditya Gupta
Rohit Jayanti
Anirudh Govil
Sourav Garg
Madhava Krishna
3DPC
31
2
0
09 Apr 2024
GHOST: Grounded Human Motion Generation with Open Vocabulary Scene-and-Text Contexts
Z. '. Milacski
Koichiro Niinuma
Ryosuke Kawamura
Fernando de la Torre
László A. Jeni
29
1
0
08 Apr 2024
Physical Property Understanding from Language-Embedded Feature Fields
Albert J. Zhai
Yuan Shen
Emily Y. Chen
Gloria X. Wang
Xinlei Wang
Sheng Wang
Kaiyu Guan
Shenlong Wang
38
13
0
05 Apr 2024
PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model
Amrin Kareem
Jean Lahoud
Hisham Cholakkal
LRM
50
4
0
04 Apr 2024
Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning
Rui Li
Tobias Fischer
Mattia Segu
Marc Pollefeys
Luc Van Gool
Federico Tombari
24
8
0
04 Apr 2024
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views
Francis Engelmann
Fabian Manhardt
Michael Niemeyer
Keisuke Tateno
Marc Pollefeys
Federico Tombari
VLM
73
32
1
04 Apr 2024
Previous
1
2
3
4
5
6
Next