ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.03546
  4. Cited By
Language-driven Semantic Segmentation

Language-driven Semantic Segmentation

10 January 2022
Boyi Li
Kilian Q. Weinberger
Serge J. Belongie
V. Koltun
René Ranftl
    VLM
ArXivPDFHTML

Papers citing "Language-driven Semantic Segmentation"

50 / 478 papers shown
Title
USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation
USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation
Xiaoqi Wang
Wenbin He
Xiwei Xuan
Clint Sebastian
Jorge Henrique Piazentin Ono
...
Sima Behpour
T. Doan
Liang Gou
Han-Wei Shen
Liu Ren
VLM
34
5
0
07 Jun 2024
Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware
  Spatio-Temporal Sampling
Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling
Xinhang Liu
Yu-Wing Tai
Chi-Keung Tang
Pedro Miraldo
Suhas Lohit
Moitreya Chatterjee
3DGS
33
4
0
06 Jun 2024
OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary
  Understanding
OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding
Y. Wu
Jiarui Meng
Haijie Li
Chenming Wu
Yahao Shi
...
Chen Zhao
Haocheng Feng
Errui Ding
Jingdong Wang
Jian Zhang
3DGS
3DPC
31
29
0
04 Jun 2024
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Mohamed El Amine Boudjoghra
Angela Dai
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
F. Khan
VLM
ISeg
83
6
0
04 Jun 2024
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view
  Understanding
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding
Thanh-Dat Truong
Utsav Prabhu
Dongyi Wang
Bhiksha Raj
Susan Gauch
J. Subbiah
Khoa Luu
48
2
0
03 Jun 2024
DGD: Dynamic 3D Gaussians Distillation
DGD: Dynamic 3D Gaussians Distillation
Isaac Labe
Noam Issachar
Itai Lang
Sagie Benaim
46
4
0
29 May 2024
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary
  Semantic-space Hyperplane
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
Yansong Qu
Shaohui Dai
Xinyang Li
Jianghang Lin
Liujuan Cao
Shengchuan Zhang
Rongrong Ji
35
19
0
27 May 2024
Diagnosing the Compositional Knowledge of Vision Language Models from a
  Game-Theoretic View
Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Jin Wang
Shichao Dong
Yapeng Zhu
Kelu Yao
Weidong Zhao
Chao Li
Ping Luo
CoGe
LRM
48
2
0
27 May 2024
Open-Vocabulary SAM3D: Understand Any 3D Scene
Open-Vocabulary SAM3D: Understand Any 3D Scene
Hanchen Tai
Qingdong He
Jiangning Zhang
Yijie Qian
Zhenyu Zhang
Xiaobin Hu
Yabiao Wang
Yong Liu
VLM
54
0
0
24 May 2024
Probing Multimodal LLMs as World Models for Driving
Probing Multimodal LLMs as World Models for Driving
Shiva Sreeram
Tsun-Hsuan Wang
Alaa Maalouf
Guy Rosman
S. Karaman
Daniela Rus
30
7
0
09 May 2024
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
Lingdong Kong
You-Chen Liu
Lai Xing Ng
Benoit R. Cottereau
Wei Tsang Ooi
VLM
34
14
0
08 May 2024
${M^2D}$NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields
M2D{M^2D}M2DNeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields
N. Wang
Lefei Zhang
Angel X Chang
53
0
0
08 May 2024
Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via
  Editable Gaussian Splatting
Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting
O. Shorinwa
Johnathan Tucker
Aliyah Smith
Aiden Swann
Timothy Chen
Roya Firoozi
Monroe Kennedy
Mac Schwager
31
22
0
07 May 2024
NeRF in Robotics: A Survey
NeRF in Robotics: A Survey
Guangming Wang
Lei Pan
Songyou Peng
Shaohui Liu
Chenfeng Xu
Yanzi Miao
Wei Zhan
Masayoshi Tomizuka
Marc Pollefeys
Hesheng Wang
29
12
0
02 May 2024
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
Minghao Chen
Iro Laina
Andrea Vedaldi
3DGS
45
23
0
29 Apr 2024
CoARF: Controllable 3D Artistic Style Transfer for Radiance Fields
CoARF: Controllable 3D Artistic Style Transfer for Radiance Fields
Deheng Zhang
Clara Fernandez-Labrador
Christopher Schroers
34
9
0
23 Apr 2024
CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and
  View-consistent 3D Semantic Understanding
CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Understanding
Guibiao Liao
Jiankun Li
Zhenyu Bao
Xiaoqing Ye
Jingdong Wang
Qing Li
Kanglin Liu
3DGS
43
14
0
22 Apr 2024
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Dominic Maggio
Yun Chang
Nathan Hughes
Matthew Trang
Dan Griffith
Carlyn Dougherty
Eric Cristofalo
Lukas Schmid
Luca Carlone
3DV
38
33
0
21 Apr 2024
Contrastive Gaussian Clustering: Weakly Supervised 3D Scene Segmentation
Contrastive Gaussian Clustering: Weakly Supervised 3D Scene Segmentation
Myrna C. Silva
Mahtab Dahaghin
M. Toso
Alessio Del Bue
3DGS
34
11
0
19 Apr 2024
What does CLIP know about peeling a banana?
What does CLIP know about peeling a banana?
Claudia Cuttano
Gabriele Rosi
Gabriele Trivigno
Giuseppe Averta
29
2
0
18 Apr 2024
Unifying Global and Local Scene Entities Modelling for Precise Action
  Spotting
Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
Kim Hoang Tran
Phuc Vuong Do
Ngoc Quoc Ly
Ngan Le
36
4
0
15 Apr 2024
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually
  Expanding Large Vocabularies
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Zhongrui Gui
Shuyang Sun
Runjia Li
Jianhao Yuan
Zhaochong An
Karsten Roth
Ameya Prabhu
Philip H. S. Torr
VLM
CLL
32
6
0
15 Apr 2024
LaSagnA: Language-based Segmentation Assistant for Complex Queries
LaSagnA: Language-based Segmentation Assistant for Complex Queries
Cong Wei
Haoxian Tan
Yujie Zhong
Yujiu Yang
Lin Ma
43
14
0
12 Apr 2024
Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs
Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs
Kanchana Ranasinghe
Satya Narayan Shukla
Omid Poursaeed
Michael S. Ryoo
Tsung-Yu Lin
LRM
49
23
0
11 Apr 2024
Transferable and Principled Efficiency for Open-Vocabulary Segmentation
Transferable and Principled Efficiency for Open-Vocabulary Segmentation
Jingxuan Xu
Wuyang Chen
Yao-Min Zhao
Yunchao Wei
VLM
36
2
0
11 Apr 2024
Training-Free Open-Vocabulary Segmentation with Offline
  Diffusion-Augmented Prototype Generation
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation
Luca Barsellotti
Roberto Amoroso
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
VLM
DiffM
47
13
0
09 Apr 2024
GHOST: Grounded Human Motion Generation with Open Vocabulary
  Scene-and-Text Contexts
GHOST: Grounded Human Motion Generation with Open Vocabulary Scene-and-Text Contexts
Z. '. Milacski
Koichiro Niinuma
Ryosuke Kawamura
Fernando de la Torre
László A. Jeni
29
1
0
08 Apr 2024
Retrieval-Augmented Open-Vocabulary Object Detection
Retrieval-Augmented Open-Vocabulary Object Detection
Jooyeon Kim
Eulrang Cho
Sehyung Kim
Hyunwoo J. Kim
VLM
ObjD
45
8
0
08 Apr 2024
Physical Property Understanding from Language-Embedded Feature Fields
Physical Property Understanding from Language-Embedded Feature Fields
Albert J. Zhai
Yuan Shen
Emily Y. Chen
Gloria X. Wang
Xinlei Wang
Sheng Wang
Kaiyu Guan
Shenlong Wang
38
13
0
05 Apr 2024
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
Ji-Jia Wu
Andy Chia-Hao Chang
Chieh-Yu Chuang
Chun-Pei Chen
Yu-Lun Liu
Min-Hung Chen
Hou-Ning Hu
Yung-Yu Chuang
Yen-Yu Lin
VLM
46
9
0
05 Apr 2024
Know Your Neighbors: Improving Single-View Reconstruction via Spatial
  Vision-Language Reasoning
Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning
Rui Li
Tobias Fischer
Mattia Segu
Marc Pollefeys
Luc Van Gool
Federico Tombari
21
8
0
04 Apr 2024
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features
  and Rendered Novel Views
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views
Francis Engelmann
Fabian Manhardt
Michael Niemeyer
Keisuke Tateno
Marc Pollefeys
Federico Tombari
VLM
73
32
1
04 Apr 2024
ASAP: Interpretable Analysis and Summarization of AI-generated Image
  Patterns at Scale
ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale
Jinbin Huang
Cheng Chen
Aditi Mishra
Bum Chul Kwon
Zhicheng Liu
Chris Bryan
47
4
0
03 Apr 2024
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency
  Decomposition
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
Yisheng He
Weihao Yuan
Siyu Zhu
Zilong Dong
Liefeng Bo
Qixing Huang
42
3
0
03 Apr 2024
GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields
GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields
Yunsong Wang
Hanlin Chen
Gim Hee Lee
34
5
0
01 Apr 2024
Training-Free Semantic Segmentation via LLM-Supervision
Training-Free Semantic Segmentation via LLM-Supervision
Wenfang Sun
Yingjun Du
Gaowen Liu
Ramana Rao Kompella
Cees G. M. Snoek
VLM
44
2
0
31 Mar 2024
IVLMap: Instance-Aware Visual Language Grounding for Consumer Robot
  Navigation
IVLMap: Instance-Aware Visual Language Grounding for Consumer Robot Navigation
Jiacui Huang
Hongtao Zhang
Mingbo Zhao
Zhou Wu
LM&Ro
39
5
0
28 Mar 2024
Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction
Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction
Xiaoyang Lyu
Chirui Chang
Peng Dai
Yang-tian Sun
Xiaojuan Qi
3DGS
46
3
0
28 Mar 2024
Online Embedding Multi-Scale CLIP Features into 3D Maps
Online Embedding Multi-Scale CLIP Features into 3D Maps
Shun Taguchi
Hideki Deguchi
27
0
0
27 Mar 2024
Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot
  Navigation
Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation
Abdelrhman Werby
Chen Huang
M. Büchner
Abhinav Valada
Wolfram Burgard
36
64
0
26 Mar 2024
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian
  Splatting
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting
Jun Guo
Xiaojian Ma
Yue Fan
Huaping Liu
Qing Li
3DGS
36
26
0
22 Mar 2024
Long-CLIP: Unlocking the Long-Text Capability of CLIP
Long-CLIP: Unlocking the Long-Text Capability of CLIP
Beichen Zhang
Pan Zhang
Xiao-wen Dong
Yuhang Zang
Jiaqi Wang
CLIP
VLM
39
110
0
22 Mar 2024
Open-Vocabulary Attention Maps with Token Optimization for Semantic
  Segmentation in Diffusion Models
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
Pablo Marcos-Manchón
Roberto Alcover-Couso
Juan C. Sanmiguel
Jose M. Martínez
VLM
49
18
0
21 Mar 2024
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic
  Segmentation
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation
Kwanyoung Kim
Y. Oh
Jong Chul Ye
VLM
50
7
0
21 Mar 2024
Better Call SAL: Towards Learning to Segment Anything in Lidar
Better Call SAL: Towards Learning to Segment Anything in Lidar
Aljovsa Ovsep
Tim Meinhardt
Francesco Ferroni
Neehar Peri
Deva Ramanan
Laura Leal-Taixé
VLM
35
15
0
19 Mar 2024
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Wenqi Zhu
Jiale Cao
Jin Xie
Shuangming Yang
Yanwei Pang
VLM
CLIP
39
2
0
19 Mar 2024
OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy
  Representation
OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation
Haochen Jiang
Yueming Xu
Yihan Zeng
Hang Xu
Wei Zhang
Jianfeng Feng
Li Zhang
40
1
0
18 Mar 2024
TAG: Guidance-free Open-Vocabulary Semantic Segmentation
TAG: Guidance-free Open-Vocabulary Semantic Segmentation
Yasufumi Kawano
Yoshimitsu Aoki
VLM
30
2
0
17 Mar 2024
Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural
  Radiance Fields
Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields
Yonggan Fu
Huaizhi Qu
Zhifan Ye
Chaojian Li
Kevin Zhao
Yingyan Lin
AI4CE
30
0
0
17 Mar 2024
N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields
N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields
Yash Bhalgat
Iro Laina
João F. Henriques
Andrew Zisserman
Andrea Vedaldi
46
14
0
16 Mar 2024
Previous
12345...8910
Next