ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.19199
  4. Cited By
Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces

Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces

24 March 2025
Chenyangguang Zhang
Alexandros Delitzas
Fangjinhua Wang
Ruida Zhang
Xiangyang Ji
Marc Pollefeys
Francis Engelmann
    3DV3DPC
ArXiv (abs)PDFHTML

Papers citing "Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces"

50 / 55 papers shown
Title
SLAG: Scalable Language-Augmented Gaussian Splatting
SLAG: Scalable Language-Augmented Gaussian Splatting
Laszlo Szilagyi
Francis Engelmann
Jeannette Bohg
3DGS
92
0
0
12 May 2025
Compile Scene Graphs with Reinforcement Learning
Compile Scene Graphs with Reinforcement Learning
Zuyao Chen
Jinlin Wu
Zhen Lei
Marc Pollefeys
Chang Wen Chen
OffRLLRM
122
3
0
18 Apr 2025
SuperDec: 3D Scene Decomposition with Superquadric Primitives
SuperDec: 3D Scene Decomposition with Superquadric Primitives
Elisabetta Fedele
Boyang Sun
Leonidas Guibas
Marc Pollefeys
Francis Engelmann
3DPC
91
2
0
01 Apr 2025
OpenCity3D: What do Vision-Language Models know about Urban Environments?
OpenCity3D: What do Vision-Language Models know about Urban Environments?
Valentin Bieri
Marco Zamboni
Nicolas S. Blumer
Qingxuan Chen
Francis Engelmann
92
1
0
21 Mar 2025
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
Guangda Ji
Silvan Weder
Francis Engelmann
Marc Pollefeys
Hermann Blum
3DV
123
4
0
17 Oct 2024
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
Ayca Takmaz
Alexandros Delitzas
R. Sumner
Francis Engelmann
Johanna Wald
Federico Tombari
140
13
0
27 Sep 2024
Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking
Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking
Prithviraj Banerjee
Sindi Shkodrani
Pierre Moulon
Shreyas Hampali
Fan Zhang
...
Selen Basol
Richard Newcombe
Robert Y. Wang
Jakob Julian Engel
Tomás Hodan
97
17
0
13 Jun 2024
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features
  and Rendered Novel Views
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views
Francis Engelmann
Fabian Manhardt
Michael Niemeyer
Keisuke Tateno
Marc Pollefeys
Federico Tombari
VLM
130
34
1
04 Apr 2024
Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot
  Navigation
Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation
Abdelrhman Werby
Chen Huang
M. Büchner
Abhinav Valada
Wolfram Burgard
90
85
0
26 Mar 2024
ICGNet: A Unified Approach for Instance-Centric Grasping
ICGNet: A Unified Approach for Instance-Centric Grasping
René Zurbrugg
Yifan Liu
Francis Engelmann
Suryansh Kumar
Marco Hutter
Vaishakh Patil
Fisher Yu
3DV
78
10
0
18 Jan 2024
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D
  Scene Understanding
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding
Xingxing Zuo
Pouya Samangouei
Yunwen Zhou
Yan Di
Mingyang Li
3DGS
87
52
0
03 Jan 2024
Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled
  Feature Fields
Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields
Shijie Zhou
Haoran Chang
Sicheng Jiang
Zhiwen Fan
Zehao Zhu
Dejia Xu
Pradyumna Chari
Suya You
Zhangyang Wang
A. Kadambi
3DGS
86
182
0
06 Dec 2023
ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic
  Reconstruction
ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic Reconstruction
Silvan Weder
Francis Engelmann
Johannes L. Schonberger
Akihito Seki
Marc Pollefeys
Martin R. Oswald
3DPC3DV
70
4
0
29 Nov 2023
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and
  Planning
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning
Yuanyi Zhong
Alihusein Kuwajerwala
Sacha Morin
Krishna Murthy Jatavallabhula
Bipasha Sen
...
Celso Miguel de Melo
Joshua B. Tenenbaum
Antonio Torralba
Florian Shkurti
Liam Paull
LM&Ro
107
186
0
28 Sep 2023
OpenMask3D: Open-Vocabulary 3D Instance Segmentation
OpenMask3D: Open-Vocabulary 3D Instance Segmentation
Ayca Takmaz
Elisabetta Fedele
R. Sumner
Marc Pollefeys
F. Tombari
Francis Engelmann
ISegVLM
79
173
0
23 Jun 2023
Multi-CLIP: Contrastive Vision-Language Pre-training for Question
  Answering tasks in 3D Scenes
Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes
Alexandros Delitzas
Maria Parelli
Nikolas Hars
G. Vlassis
Sotiris Anagnostidis
Gregor Bachmann
Thomas Hofmann
CLIP
45
20
0
04 Jun 2023
AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation
AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation
Yuanwen Yue
Sabarinath Mahadevan
Jonas Schult
Francis Engelmann
Bastian Leibe
Konrad Schindler
Theodora Kontogianni
3DPC
85
31
0
01 Jun 2023
Visual Instruction Tuning
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDaVLMMLLM
569
4,910
0
17 Apr 2023
NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations
NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations
Joy Hsu
Jiayuan Mao
Jiajun Wu
PINN
78
52
0
23 Mar 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAGMLLM
1.5K
14,699
0
15 Mar 2023
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set
  Object Detection
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
...
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
191
2,015
0
09 Mar 2023
ConceptFusion: Open-set Multimodal 3D Mapping
ConceptFusion: Open-set Multimodal 3D Mapping
Krishna Murthy Jatavallabhula
Ali Kuwajerwala
Qiao Gu
Mohd. Omama
Tao Chen
...
Celso Miguel de Melo
Madhava Krishna
Liam Paull
Florian Shkurti
Antonio Torralba
78
245
0
14 Feb 2023
OpenScene: 3D Scene Understanding with Open Vocabularies
OpenScene: 3D Scene Understanding with Open Vocabularies
Songyou Peng
Kyle Genova
ChiyuMaxJiang
Andrea Tagliasacchi
Marc Pollefeys
Thomas Funkhouser
3DPCVLM
100
366
0
28 Nov 2022
AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object
  Reconstruction
AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction
Zerui Chen
Yana Hasson
Cordelia Schmid
Ivan Laptev
3DH
90
55
0
26 Jul 2022
TASKOGRAPHY: Evaluating robot task planning over large 3D scene graphs
TASKOGRAPHY: Evaluating robot task planning over large 3D scene graphs
Christopher Agia
Krishna Murthy Jatavallabhula
M. Khodeir
O. Mikšík
Vibhav Vineet
Mustafa Mukadam
Liam Paull
Florian Shkurti
93
71
0
11 Jul 2022
What's in your hands? 3D Reconstruction of Generic Objects in Hands
What's in your hands? 3D Reconstruction of Generic Objects in Hands
Yufei Ye
Abhinav Gupta
Shubham Tulsiani
3DH
60
86
0
14 Apr 2022
Multi-View Transformer for 3D Visual Grounding
Multi-View Transformer for 3D Visual Grounding
Shijia Huang
Yilun Chen
Jiaya Jia
Liwei Wang
91
127
0
05 Apr 2022
SoftGroup for 3D Instance Segmentation on Point Clouds
SoftGroup for 3D Instance Segmentation on Point Clouds
Thang Vu
Kookhoi Kim
Tung M. Luu
Xuan Thanh Nguyen
Chang D. Yoo
3DPC
67
240
0
03 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&RoLRMAI4CEReLM
839
9,644
0
28 Jan 2022
ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene
  Understanding Using Mobile RGB-D Data
ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data
Gilad Baruch
Zhuoyuan Chen
Afshin Dehghan
Tal Dimry
Yuri Feigin
...
Thomas Gebauer
Brandon Joffe
Daniel Kurz
Arik Schwartz
Elad Shulman
3DV3DPC
89
210
0
17 Nov 2021
Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using
  Scene Graphs
Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs
Helisa Dhamo
Fabian Manhardt
Nassir Navab
F. Tombari
3DV
44
70
0
19 Aug 2021
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Junha Roh
Karthik Desingh
Ali Farhadi
Dieter Fox
70
95
0
07 Jul 2021
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
Zhengyuan Yang
Songyang Zhang
Liwei Wang
Jiebo Luo
3DPC
81
126
0
24 May 2021
SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D
  Sequences
SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences
Shun-cheng Wu
Johanna Wald
Keisuke Tateno
Nassir Navab
Federico Tombari
3DPC
52
161
0
27 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
967
29,810
0
26 Feb 2021
Kimera: from SLAM to Spatial Perception with 3D Dynamic Scene Graphs
Kimera: from SLAM to Spatial Perception with 3D Dynamic Scene Graphs
Antoni Rosinol
Andrew Violette
Marcus Abate
Nathan Hughes
Yun Chang
Jingang Shi
Arjun Gupta
Luca Carlone
3DV
104
238
0
18 Jan 2021
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
Johanna Wald
Helisa Dhamo
Nassir Navab
Federico Tombari
3DV3DPC
71
218
0
08 Apr 2020
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
Li Jiang
Hengshuang Zhao
Shaoshuai Shi
Shu Liu
Chi-Wing Fu
Jiaya Jia
3DPC
83
436
0
03 Apr 2020
3D-MPA: Multi Proposal Aggregation for 3D Semantic Instance Segmentation
3D-MPA: Multi Proposal Aggregation for 3D Semantic Instance Segmentation
Francis Engelmann
M. Bokeloh
Alireza Fathi
Bastian Leibe
Matthias Nießner
3DPC
78
214
0
30 Mar 2020
OccuSeg: Occupancy-aware 3D Instance Segmentation
OccuSeg: Occupancy-aware 3D Instance Segmentation
Lei Han
Tian Zheng
Lan Xu
Lu Fang
3DPC
246
260
0
14 Mar 2020
3D Dynamic Scene Graphs: Actionable Spatial Perception with Places,
  Objects, and Humans
3D Dynamic Scene Graphs: Actionable Spatial Perception with Places, Objects, and Humans
Antoni Rosinol
Arjun Gupta
Marcus Abate
Jingang Shi
Luca Carlone
87
196
0
15 Feb 2020
SuperGlue: Learning Feature Matching with Graph Neural Networks
SuperGlue: Learning Feature Matching with Graph Neural Networks
Paul-Edouard Sarlin
Daniel DeTone
Tomasz Malisiewicz
Andrew Rabinovich
3DPCOffRL
127
1,949
0
26 Nov 2019
3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera
3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera
Iro Armeni
Zhi-Yang He
JunYoung Gwak
Amir Zamir
Martin Fischer
Jitendra Malik
Silvio Savarese
3DV3DPC
103
349
0
06 Oct 2019
KPConv: Flexible and Deformable Convolution for Point Clouds
KPConv: Flexible and Deformable Convolution for Point Clouds
Hugues Thomas
C. Qi
Jean-Emmanuel Deschaud
B. Marcotegui
F. Goulette
Leonidas Guibas
3DPC
169
2,541
0
18 Apr 2019
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
Chris Choy
JunYoung Gwak
Silvio Savarese
3DPC
167
1,792
0
18 Apr 2019
3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans
3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans
Ji Hou
Angela Dai
Matthias Nießner
ISeg
82
468
0
17 Dec 2018
Grounded Human-Object Interaction Hotspots from Video
Grounded Human-Object Interaction Hotspots from Video
Tushar Nagarajan
Christoph Feichtenhofer
Kristen Grauman
81
161
0
11 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,175
0
11 Oct 2018
PointCNN: Convolution On $\mathcal{X}$-Transformed Points
PointCNN: Convolution On X\mathcal{X}X-Transformed Points
Yangyan Li
Rui Bu
Mingchao Sun
Wei Wu
Xinhan Di
Baoquan Chen
3DPC
235
2,450
0
23 Jan 2018
AffordanceNet: An End-to-End Deep Learning Approach for Object
  Affordance Detection
AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection
Thanh-Toan Do
A. Nguyen
Ian Reid
63
297
0
21 Sep 2017
12
Next