ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.15654
  4. Cited By
OpenScene: 3D Scene Understanding with Open Vocabularies

OpenScene: 3D Scene Understanding with Open Vocabularies

28 November 2022
Songyou Peng
Kyle Genova
ChiyuMaxJiang
Andrea Tagliasacchi
Marc Pollefeys
Thomas Funkhouser
    3DPC
    VLM
ArXivPDFHTML

Papers citing "OpenScene: 3D Scene Understanding with Open Vocabularies"

50 / 285 papers shown
Title
Densify Your Labels: Unsupervised Clustering with Bipartite Matching for
  Weakly Supervised Point Cloud Segmentation
Densify Your Labels: Unsupervised Clustering with Bipartite Matching for Weakly Supervised Point Cloud Segmentation
Shaobo Xia
Jun Yue
Kacper Kania
Leyuan Fang
Andrea Tagliasacchi
Kwang Moo Yi
Weiwei Sun
3DPC
24
3
0
11 Dec 2023
PartDistill: 3D Shape Part Segmentation by Vision-Language Model
  Distillation
PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation
Ardian Umam
Cheng-Kun Yang
Min-Hung Chen
Jen-Hui Chuang
Yen-Yu Lin
29
11
0
07 Dec 2023
FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation
FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation
Xiang Xu
Lingdong Kong
Hui Shuai
Qingshan Liu
3DPC
64
23
0
07 Dec 2023
Novel class discovery meets foundation models for 3D semantic
  segmentation
Novel class discovery meets foundation models for 3D semantic segmentation
Luigi Riz
Cristiano Saltori
Yiming Wang
Elisa Ricci
Fabio Poiesi
3DPC
39
0
0
06 Dec 2023
Geometrically-driven Aggregation for Zero-shot 3D Point Cloud
  Understanding
Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding
Guofeng Mei
Luigi Riz
Yiming Wang
Fabio Poiesi
3DPC
30
6
0
04 Dec 2023
FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models
FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models
Andrea Caraffa
Davide Boscaini
Amir Hamza
Fabio Poiesi
61
15
0
01 Dec 2023
Gaussian Grouping: Segment and Edit Anything in 3D Scenes
Gaussian Grouping: Segment and Edit Anything in 3D Scenes
Mingqiao Ye
Martin Danelljan
Fisher Yu
Lei Ke
3DGS
DiffM
24
167
0
01 Dec 2023
Segment Any 3D Gaussians
Segment Any 3D Gaussians
Jiazhong Cen
Jiemin Fang
Chen Yang
Lingxi Xie
Xiaopeng Zhang
Wei Shen
Qi Tian
3DGS
79
70
0
01 Dec 2023
Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding
Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding
Jin-Chuan Shi
Miao Wang
Hao-Bin Duan
Shao-Hua Guan
3DGS
46
84
0
30 Nov 2023
Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D
  Features
Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features
Thomas Wimmer
Peter Wonka
M. Ovsjanikov
39
9
0
29 Nov 2023
ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic
  Reconstruction
ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic Reconstruction
Silvan Weder
Francis Engelmann
Johannes L. Schonberger
Akihito Seki
Marc Pollefeys
Martin R. Oswald
3DPC
3DV
19
4
0
29 Nov 2023
ViT-Lens: Towards Omni-modal Representations
ViT-Lens: Towards Omni-modal Representations
Weixian Lei
Yixiao Ge
Kun Yi
Jianfeng Zhang
Difei Gao
Dylan Sun
Yuying Ge
Ying Shan
Mike Zheng Shou
21
18
0
27 Nov 2023
Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Zhihao Yuan
Jinke Ren
Chun-Mei Feng
Hengshuang Zhao
Shuguang Cui
Zhen Li
39
26
0
26 Nov 2023
LABELMAKER: Automatic Semantic Label Generation from RGB-D Trajectories
LABELMAKER: Automatic Semantic Label Generation from RGB-D Trajectories
Silvan Weder
Hermann Blum
Francis Engelmann
Marc Pollefeys
VLM
24
11
0
20 Nov 2023
Applications of Large Scale Foundation Models for Autonomous Driving
Applications of Large Scale Foundation Models for Autonomous Driving
Yu Huang
Yue Chen
Zhu Li
ELM
AI4CE
LRM
ALM
LM&Ro
61
15
0
20 Nov 2023
OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive
  Learning
OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning
Haiyang Ying
Yixuan Yin
Jinzhi Zhang
Fan Wang
Tao Yu
Ruqi Huang
Lu Fang
15
30
0
20 Nov 2023
An Embodied Generalist Agent in 3D World
An Embodied Generalist Agent in 3D World
Jiangyong Huang
Silong Yong
Xiaojian Ma
Xiongkun Linghu
Puhao Li
Yan Wang
Qing Li
Song-Chun Zhu
Baoxiong Jia
Siyuan Huang
LM&Ro
31
139
0
18 Nov 2023
OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D
  Data
OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data
Shiyang Lu
Haonan Chang
E. Jing
Abdeslam Boularias
Kostas Bekris
21
55
0
06 Nov 2023
Sculpting Holistic 3D Representation in Contrastive Language-Image-3D
  Pre-training
Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training
Yipeng Gao
Zeyu Wang
Wei-Shi Zheng
Cihang Xie
Yuyin Zhou
3DPC
34
8
0
03 Nov 2023
3D-Aware Visual Question Answering about Parts, Poses and Occlusions
3D-Aware Visual Question Answering about Parts, Poses and Occlusions
Xingrui Wang
Wufei Ma
Zhuowan Li
Adam Kortylewski
Alan L. Yuille
CoGe
27
12
0
27 Oct 2023
Drive Anywhere: Generalizable End-to-end Autonomous Driving with
  Multi-modal Foundation Models
Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models
Tsun-Hsuan Wang
Alaa Maalouf
Wei Xiao
Yutong Ban
Alexander Amini
Guy Rosman
S. Karaman
Daniela Rus
27
42
0
26 Oct 2023
Three Pillars improving Vision Foundation Model Distillation for Lidar
Three Pillars improving Vision Foundation Model Distillation for Lidar
Gilles Puy
Spyros Gidaris
Alexandre Boulch
Oriane Siméoni
Corentin Sautier
Patrick Pérez
Andrei Bursuc
Renaud Marlet
107
18
0
26 Oct 2023
SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous
  Manipulation
SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation
Qianxu Wang
Haotong Zhang
Congyue Deng
Yang You
Hao Dong
Yixin Zhu
Leonidas J. Guibas
29
18
0
25 Oct 2023
Lang3DSG: Language-based contrastive pre-training for 3D Scene Graph
  prediction
Lang3DSG: Language-based contrastive pre-training for 3D Scene Graph prediction
Sebastian Koch
Pedro Hermosilla
Narunas Vaskevicius
Mirco Colosi
Timo Ropinski
37
9
0
25 Oct 2023
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive
  Survey and Evaluation
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation
Yinjie Lei
Zixuan Wang
Feng Chen
Guoqing Wang
Peng Wang
Yang Yang
37
10
0
24 Oct 2023
Vision Language Models in Autonomous Driving: A Survey and Outlook
Vision Language Models in Autonomous Driving: A Survey and Outlook
Xingcheng Zhou
Mingyu Liu
Ekim Yurtsever
B. L. Žagar
Walter Zimmer
Hu Cao
Alois C. Knoll
VLM
37
39
0
22 Oct 2023
OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal 3D
  Data
OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal 3D Data
Yijie Zhou
Likun Cai
Xianhui Cheng
Zhongxue Gan
Xiangyang Xue
Wenchao Ding
3DV
VLM
19
13
0
20 Oct 2023
Learning to Adapt SAM for Segmenting Cross-domain Point Clouds
Learning to Adapt SAM for Segmenting Cross-domain Point Clouds
Xidong Peng
Runnan Chen
Feng Qiao
Lingdong Kong
You-Chen Liu
Tai Wang
Xinge Zhu
Yuexin Ma
36
12
0
13 Oct 2023
Think, Act, and Ask: Open-World Interactive Personalized Robot
  Navigation
Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation
Yinpei Dai
Run Peng
Sikai Li
Joyce Chai
LM&Ro
40
24
0
12 Oct 2023
S4C: Self-Supervised Semantic Scene Completion with Neural Fields
S4C: Self-Supervised Semantic Scene Completion with Neural Fields
Adrian Hayler
Felix Wimbauer
Dominik Muhle
Christian Rupprecht
Daniel Cremers
23
22
0
11 Oct 2023
Compositional Semantics for Open Vocabulary Spatio-semantic
  Representations
Compositional Semantics for Open Vocabulary Spatio-semantic Representations
Robin Karlsson
Francisco Lepe-Salazar
K. Takeda
VLM
53
1
0
08 Oct 2023
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for
  Open-vocabulary 3D Object Detection
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
Yang Cao
Yihan Zeng
Hang Xu
Dan Xu
3DPC
ObjD
24
33
0
04 Oct 2023
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and
  Planning
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning
Yuanyi Zhong
Alihusein Kuwajerwala
Sacha Morin
Krishna Murthy Jatavallabhula
Bipasha Sen
...
Celso Miguel de Melo
Joshua B. Tenenbaum
Antonio Torralba
Florian Shkurti
Liam Paull
LM&Ro
36
168
0
28 Sep 2023
Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time
  Visual Scene Understanding
Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding
Christina Kassab
Matías Mattamala
Lintong Zhang
Maurice F. Fallon
34
18
0
26 Sep 2023
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language
  Model as an Agent
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent
Jianing Yang
Xuweiyi Chen
Shengyi Qian
Nikhil Madaan
Madhavan Iyengar
David Fouhey
Joyce Chai
LM&Ro
LLMAG
43
84
0
21 Sep 2023
Open-Vocabulary Affordance Detection using Knowledge Distillation and
  Text-Point Correlation
Open-Vocabulary Affordance Detection using Knowledge Distillation and Text-Point Correlation
Tuan V. Vo
Minh Nhat Vu
Baoru Huang
Toan Tien Nguyen
Ngan Le
T. Vo
Anh Nguyen
3DPC
24
10
0
19 Sep 2023
Grasp-Anything: Large-scale Grasp Dataset from Foundation Models
Grasp-Anything: Large-scale Grasp Dataset from Foundation Models
An Vuong
Minh Nhat Vu
Hieu Le
Baoru Huang
B. Huynh
T. Vo
Andreas Kugi
Anh Nguyen
VLM
21
28
0
18 Sep 2023
Object2Scene: Putting Objects in Context for Open-Vocabulary 3D
  Detection
Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection
Chenming Zhu
Wenwei Zhang
Tai Wang
Xihui Liu
Kai-xiang Chen
3DPC
41
18
0
18 Sep 2023
Language Embedded Radiance Fields for Zero-Shot Task-Oriented Grasping
Language Embedded Radiance Fields for Zero-Shot Task-Oriented Grasping
Adam Rashid
Satvik Sharma
C. Kim
J. Kerr
L. Chen
Angjoo Kanazawa
Ken Goldberg
62
87
0
14 Sep 2023
Panoptic Vision-Language Feature Fields
Panoptic Vision-Language Feature Fields
Haoran Chen
Kenneth Blomqvist
Francesco Milano
Roland Siegwart
VLM
24
13
0
11 Sep 2023
Physically Grounded Vision-Language Models for Robotic Manipulation
Physically Grounded Vision-Language Models for Robotic Manipulation
Jensen Gao
Bidipta Sarkar
F. Xia
Ted Xiao
Jiajun Wu
Brian Ichter
Anirudha Majumdar
Dorsa Sadigh
LM&Ro
30
114
0
05 Sep 2023
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
Zhening Huang
Xiaoyang Wu
Xi Chen
Hengshuang Zhao
Lei Zhu
Joan Lasenby
ISeg
3DPC
VLM
55
46
0
01 Sep 2023
ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights
ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights
Weixian Lei
Yixiao Ge
Jianfeng Zhang
Dylan Sun
Kun Yi
Ying Shan
Mike Zheng Shou
33
1
0
20 Aug 2023
Retro-FPN: Retrospective Feature Pyramid Network for Point Cloud
  Semantic Segmentation
Retro-FPN: Retrospective Feature Pyramid Network for Point Cloud Semantic Segmentation
Peng Xiang
Xin Wen
Yu-Shen Liu
Hui Zhang
Yi Fang
Zhizhong Han
3DPC
18
9
0
18 Aug 2023
Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field
  maps with natural language
Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language
Francesco Taioli
Federico Cunico
Federico Girella
Riccardo Bologna
Alessandro Farinelli
Marco Cristani
26
7
0
17 Aug 2023
Lowis3D: Language-Driven Open-World Instance-Level 3D Scene
  Understanding
Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding
Runyu Ding
Jihan Yang
Chuhui Xue
Wenqing Zhang
Song Bai
Xiaojuan Qi
3DV
VLM
21
28
0
01 Aug 2023
VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive
  Representation
VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation
Zekun Qi
Muzhou Yu
Runpei Dong
Kaisheng Ma
3DPC
26
11
0
28 Jul 2023
Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation
Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation
Bokui (William) Shen
Ge Yang
Alan Yu
J. Wong
L. Kaelbling
Phillip Isola
VLM
29
104
0
27 Jul 2023
Industrial Segment Anything -- a Case Study in Aircraft Manufacturing,
  Intralogistics, Maintenance, Repair, and Overhaul
Industrial Segment Anything -- a Case Study in Aircraft Manufacturing, Intralogistics, Maintenance, Repair, and Overhaul
Keno Moenck
Arne Wendt
Philipp Prünte
Julian Koch
Arne Sahrhage
...
Falko Kähler
Dirk Holst
Martin Gomse
Thorsten Schuppstuhl
Daniel Schoepflin
VLM
30
6
0
24 Jul 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present,
  and Future
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Chaoyang Zhu
Long Chen
ObjD
VLM
31
32
0
18 Jul 2023
Previous
123456
Next