ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.07761
  4. Cited By
Language-Grounded Indoor 3D Semantic Segmentation in the Wild

Language-Grounded Indoor 3D Semantic Segmentation in the Wild

16 April 2022
Dávid Rozenberszki
Or Litany
Angela Dai
    3DV
    VLM
ArXivPDFHTML

Papers citing "Language-Grounded Indoor 3D Semantic Segmentation in the Wild"

50 / 50 papers shown
Title
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
Feng Xue
Wenzhuang Xu
Guofeng Zhong
Anlong Minga
N. Sebe
65
0
0
01 May 2025
A Review of 3D Object Detection with Vision-Language Models
A Review of 3D Object Detection with Vision-Language Models
Ranjan Sapkota
Konstantinos I Roumeliotis
Rahul Harsha Cheppally
Marco Flores Calero
Manoj Karkee
VLM
82
2
0
25 Apr 2025
MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion
MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion
Yikun Ma
Yiqing Li
Jiawei Wu
Xing Luo
Zhi Jin
DiffM
VGen
65
0
0
22 Mar 2025
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Jinlong Li
Cristiano Saltori
Fabio Poiesi
N. Sebe
168
0
0
20 Mar 2025
OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
Yijie Tang
Jiazhao Zhang
Yuqing Lan
Yulan Guo
Dezun Dong
Chenyang Zhu
K. Xu
168
0
0
03 Mar 2025
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
Shehan Munasinghe
Hanan Gani
Wenqi Zhu
Jiale Cao
Eric P. Xing
F. Khan
Salman Khan
MLLM
VGen
VLM
44
6
0
07 Nov 2024
Deep Learning on 3D Semantic Segmentation: A Detailed Review
Deep Learning on 3D Semantic Segmentation: A Detailed Review
Thodoris Betsas
Andreas Georgopoulos
Anastasios Doulamis
Pierre Grussenmeyer
3DV
3DPC
33
1
0
04 Nov 2024
Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation
Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation
Zhaochong An
Guolei Sun
Yun Liu
Runjia Li
Min Wu
Ming-Ming Cheng
Ender Konukoglu
Serge J. Belongie
64
4
0
29 Oct 2024
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
Guangda Ji
Silvan Weder
Francis Engelmann
Marc Pollefeys
Hermann Blum
3DV
69
4
0
17 Oct 2024
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Rasoul Shafipour
David Harrison
Maxwell Horton
Jeffrey Marker
Houman Bedayat
Sachin Mehta
Mohammad Rastegari
Mahyar Najibi
Saman Naderiparizi
MQ
57
3
0
14 Oct 2024
Interactive4D: Interactive 4D LiDAR Segmentation
Interactive4D: Interactive 4D LiDAR Segmentation
Ilya Fradlin
Idil Esen Zulfikar
Kadir Yilmaz
Theodora Kontogianni
Bastian Leibe
49
1
0
10 Oct 2024
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
Ayca Takmaz
Alexandros Delitzas
R. Sumner
Francis Engelmann
Johanna Wald
Federico Tombari
80
11
0
27 Sep 2024
Open-Ended 3D Point Cloud Instance Segmentation
Open-Ended 3D Point Cloud Instance Segmentation
Phuc D. A. Nguyen
Minh Luu
Anh Tran
Cuong Pham
Khoi Nguyen
3DPC
56
1
0
21 Aug 2024
EmbodiedSAM: Online Segment Any 3D Thing in Real Time
EmbodiedSAM: Online Segment Any 3D Thing in Real Time
Xiuwei Xu
Huangxing Chen
Linqing Zhao
Ziwei Wang
Jie Zhou
Jiwen Lu
39
15
0
21 Aug 2024
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Youjun Zhao
Jiaying Lin
Shuquan Ye
Qianshi Pang
Rynson W. H. Lau
64
1
0
20 Aug 2024
Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant
Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant
Guofeng Mei
Luigi Riz
Yiming Wang
Fabio Poiesi
ISeg
VLM
64
3
0
20 Aug 2024
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Cheng Shi
Yulin Zhang
Bin Yang
Jiajin Tang
Yuexin Ma
Sibei Yang
3DPC
54
1
0
14 Jul 2024
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination
Jianing Yang
Xuweiyi Chen
Nikhil Madaan
Madhavan Iyengar
Shengyi Qian
David Fouhey
Joyce Chai
3DV
75
11
0
07 Jun 2024
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Mohamed El Amine Boudjoghra
Angela Dai
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
F. Khan
VLM
ISeg
83
6
0
04 Jun 2024
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
Kuan-Chih Huang
Xiangtai Li
Lu Qi
Shuicheng Yan
Ming-Hsuan Yang
LRM
76
10
0
27 May 2024
Segment Any 3D Object with Language
Segment Any 3D Object with Language
Seungjun Lee
Yuyang Zhao
Gim Hee Lee
44
1
0
02 Apr 2024
ODIN: A Single Model for 2D and 3D Segmentation
ODIN: A Single Model for 2D and 3D Segmentation
Ayush Jain
Pushkal Katara
N. Gkanatsios
Adam W. Harley
Gabriel H. Sarch
Kriti Aggarwal
Vishrav Chaudhary
Katerina Fragkiadaki
3DPC
53
7
0
04 Jan 2024
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding,
  Reasoning, and Planning
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Sijin Chen
Xin Chen
C. Zhang
Mingsheng Li
Gang Yu
Hao Fei
Hongyuan Zhu
Jiayuan Fan
Tao Chen
MLLM
26
79
0
30 Nov 2023
Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D
  Features
Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features
Thomas Wimmer
Peter Wonka
M. Ovsjanikov
36
9
0
29 Nov 2023
GenZI: Zero-Shot 3D Human-Scene Interaction Generation
GenZI: Zero-Shot 3D Human-Scene Interaction Generation
Lei Li
Angela Dai
VGen
30
21
0
29 Nov 2023
PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
Shehan Munasinghe
Rusiru Thushara
Muhammad Maaz
H. Rasheed
Salman Khan
Mubarak Shah
Fahad Khan
VLM
MLLM
27
34
0
22 Nov 2023
LABELMAKER: Automatic Semantic Label Generation from RGB-D Trajectories
LABELMAKER: Automatic Semantic Label Generation from RGB-D Trajectories
Silvan Weder
Hermann Blum
Francis Engelmann
Marc Pollefeys
VLM
21
11
0
20 Nov 2023
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
Haoyi Zhu
Honghui Yang
Xiaoyang Wu
Di Huang
Sha Zhang
...
Hengshuang Zhao
Chunhua Shen
Yu Qiao
Tong He
Wanli Ouyang
SSL
71
43
0
12 Oct 2023
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
Zhening Huang
Xiaoyang Wu
Xi Chen
Hengshuang Zhao
Lei Zhu
Joan Lasenby
ISeg
3DPC
VLM
55
46
0
01 Sep 2023
ImGeoNet: Image-induced Geometry-aware Voxel Representation for
  Multi-view 3D Object Detection
ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection
Tao Tu
Shun-Po Chuang
Yu-Lun Liu
Cheng Sun
Kecheng Zhang
D. Roy
Cheng-Hao Kuo
Min Sun
3DPC
41
5
0
17 Aug 2023
Cross3DVG: Cross-Dataset 3D Visual Grounding on Different RGB-D Scans
Cross3DVG: Cross-Dataset 3D Visual Grounding on Different RGB-D Scans
Taiki Miyanishi
Daich Azuma
Shuhei Kurita
M. Kawanabe
36
2
0
23 May 2023
OpenShape: Scaling Up 3D Shape Representation Towards Open-World
  Understanding
OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding
Minghua Liu
Ruoxi Shi
Kaiming Kuang
Yinhao Zhu
Xuanlin Li
Shizhong Han
H. Cai
Fatih Porikli
Hao Su
3DPC
39
116
0
18 May 2023
RegionPLC: Regional Point-Language Contrastive Learning for Open-World
  3D Scene Understanding
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
Jihan Yang
Runyu Ding
Weipeng Deng
Zhe Wang
Xiaojuan Qi
20
62
0
03 Apr 2023
UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes
UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes
Dávid Rozenberszki
Or Litany
Angela Dai
3DPC
ISeg
34
23
0
25 Mar 2023
Open-Vocabulary Affordance Detection in 3D Point Clouds
Open-Vocabulary Affordance Detection in 3D Point Clouds
Toan Ngyen
Minh Nhat Vu
Annalies Vuong
Dzung Nguyen
T. Vo
Ngan Le
A. Nguyen
3DPC
24
32
0
04 Mar 2023
Top-Down Beats Bottom-Up in 3D Instance Segmentation
Top-Down Beats Bottom-Up in 3D Instance Segmentation
Maksim Kolodiazhnyi
Anna Vorontsova
Anton Konushin
D. Rukhovich
3DPC
36
22
0
06 Feb 2023
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP
Runnan Chen
Youquan Liu
Lingdong Kong
Xinge Zhu
Yuexin Ma
Yikang Li
Yuenan Hou
Yu Qiao
Wenping Wang
CLIP
3DPC
31
139
0
12 Jan 2023
Understanding Imbalanced Semantic Segmentation Through Neural Collapse
Understanding Imbalanced Semantic Segmentation Through Neural Collapse
Zhisheng Zhong
Jiequan Cui
Yibo Yang
Xiaoyang Wu
Xiaojuan Qi
Xinming Zhang
Jiaya Jia
135
45
0
03 Jan 2023
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image
  Transformers Help 3D Representation Learning?
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
Runpei Dong
Zekun Qi
Linfeng Zhang
Junbo Zhang
Jian‐Yuan Sun
Zheng Ge
Li Yi
Kaisheng Ma
ViT
3DPC
29
84
0
16 Dec 2022
LidarCLIP or: How I Learned to Talk to Point Clouds
LidarCLIP or: How I Learned to Talk to Point Clouds
Georg Hess
Adam Tonderski
Christoffer Petersson
Kalle AAstrom
Lennart Svensson
DiffM
27
22
0
13 Dec 2022
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved
  Visio-Linguistic Models in 3D Scenes
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved Visio-Linguistic Models in 3D Scenes
Ahmed Abdelreheem
Kyle Olszewski
Hsin-Ying Lee
Peter Wonka
Panos Achlioptas
3DPC
22
28
0
12 Dec 2022
Text2Mesh: Text-Driven Neural Stylization for Meshes
Text2Mesh: Text-Driven Neural Stylization for Meshes
O. Michel
Roi Bar-On
Richard Liu
Sagie Benaim
Rana Hanocka
CLIP
AI4CE
199
351
0
06 Dec 2021
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D
  Scene Understanding
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding
Yujin Chen
Matthias Nießner
Angela Dai
3DPC
105
58
0
06 Dec 2021
PointCLIP: Point Cloud Understanding by CLIP
PointCLIP: Point Cloud Understanding by CLIP
Renrui Zhang
Ziyu Guo
Wei Zhang
Kunchang Li
Xupeng Miao
Bin Cui
Yu Qiao
Peng Gao
Hongsheng Li
VLM
3DPC
175
435
0
04 Dec 2021
Mix3D: Out-of-Context Data Augmentation for 3D Scenes
Mix3D: Out-of-Context Data Augmentation for 3D Scenes
Alexey Nekrasov
Jonas Schult
Or Litany
Bastian Leibe
Francis Engelmann
3DPC
164
154
0
05 Oct 2021
Self-Supervised Pretraining of 3D Features on any Point-Cloud
Self-Supervised Pretraining of 3D Features on any Point-Cloud
Zaiwei Zhang
Rohit Girdhar
Armand Joulin
Ishan Misra
3DPC
126
268
0
07 Jan 2021
PointContrast: Unsupervised Pre-training for 3D Point Cloud
  Understanding
PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding
Saining Xie
Jiatao Gu
Demi Guo
C. Qi
Leonidas J. Guibas
Or Litany
3DPC
141
622
0
21 Jul 2020
OccuSeg: Occupancy-aware 3D Instance Segmentation
OccuSeg: Occupancy-aware 3D Instance Segmentation
Lei Han
Tian Zheng
Lan Xu
Lu Fang
3DPC
160
256
0
14 Mar 2020
SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional
  Filters
SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional Filters
Yifan Xu
Tianqi Fan
Mingye Xu
Long Zeng
Yu Qiao
3DV
3DPC
152
769
0
30 Mar 2018
PointNet: Deep Learning on Point Sets for 3D Classification and
  Segmentation
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
C. Qi
Hao Su
Kaichun Mo
Leonidas J. Guibas
3DH
3DPC
3DV
PINN
222
14,103
0
02 Dec 2016
1