ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.03546
  4. Cited By
Language-driven Semantic Segmentation

Language-driven Semantic Segmentation

10 January 2022
Boyi Li
Kilian Q. Weinberger
Serge J. Belongie
V. Koltun
René Ranftl
    VLM
ArXivPDFHTML

Papers citing "Language-driven Semantic Segmentation"

50 / 478 papers shown
Title
Embodiment-Agnostic Action Planning via Object-Part Scene Flow
Embodiment-Agnostic Action Planning via Object-Part Scene Flow
Weiliang Tang
Jia-Hui Pan
Wei Zhan
Jianshu Zhou
Huaxiu Yao
Yun-Hui Liu
M. Tomizuka
Mingyu Ding
Chi-Wing Fu
50
0
0
16 Sep 2024
E2Map: Experience-and-Emotion Map for Self-Reflective Robot Navigation with Language Models
E2Map: Experience-and-Emotion Map for Self-Reflective Robot Navigation with Language Models
Chan Kim
Keonwoo Kim
Mintaek Oh
Hanbi Baek
Jiyang Lee
...
John Tucker
Roya Firoozi
Seung-Woo Seo
Mac Schwager
Seong-Woo Kim
LM&Ro
32
0
0
16 Sep 2024
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation
Qilong Zhangli
Di Liu
Abhishek Aich
Dimitris Metaxas
S. Schulter
36
0
0
15 Sep 2024
Generalization Boosted Adapter for Open-Vocabulary Segmentation
Generalization Boosted Adapter for Open-Vocabulary Segmentation
Wenhao Xu
Changwei Wang
Xuxiang Feng
Rongtao Xu
Longzhao Huang
Zherui Zhang
Li Guo
Shibiao Xu
VLM
34
2
0
13 Sep 2024
Context-Aware Replanning with Pre-explored Semantic Map for Object
  Navigation
Context-Aware Replanning with Pre-explored Semantic Map for Object Navigation
Hung-Ting Su
Ching-Yuan Chen
Po-Chen Ko
Jia-Fong Yeh
Min Sun
Winston H. Hsu
16
0
0
07 Sep 2024
iSeg: An Iterative Refinement-based Framework for Training-free
  Segmentation
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation
Lin Sun
Jiale Cao
J. Xie
F. Khan
Yanwei Pang
DiffM
43
1
0
05 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-xiong Wang
75
15
0
05 Sep 2024
Image-to-Lidar Relational Distillation for Autonomous Driving Data
Image-to-Lidar Relational Distillation for Autonomous Driving Data
Anas Mahmoud
Ali Harakeh
Steven Waslander
26
0
0
01 Sep 2024
CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality
  Assessment with CLIP
CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP
Zhenchen Tang
Zichuan Wang
Bo Peng
Jing Dong
EGVM
30
2
0
27 Aug 2024
MROVSeg: Breaking the Resolution Curse of Vision-Language Models in
  Open-Vocabulary Semantic Segmentation
MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation
Yuanbing Zhu
Bingke Zhu
Zhen Chen
Huan Xu
Ming Tang
Jinqiao Wang
VLM
34
0
0
27 Aug 2024
Image Segmentation in Foundation Model Era: A Survey
Image Segmentation in Foundation Model Era: A Survey
Tianfei Zhou
Fei Zhang
Boyu Chang
Wenguan Wang
Ye Yuan
E. Konukoglu
Daniel Cremers
VLM
42
4
0
23 Aug 2024
Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant
Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant
Guofeng Mei
Luigi Riz
Yiming Wang
Fabio Poiesi
ISeg
VLM
64
3
0
20 Aug 2024
OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras
OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras
Muhammad Rameez Ur Rahman
Jhony H. Giraldo
Indro Spinelli
Stéphane Lathuilière
Fabio Galasso
VLM
28
0
0
18 Aug 2024
VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object
  Localization Probability Maps
VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object Localization Probability Maps
Senthil Hariharan Arul
Dhruva Kumar
Vivek Sugirtharaj
Richard Kim
Xuewei
Qi
R. Madhivanan
Arnie Sen
Dinesh Manocha
23
1
0
15 Aug 2024
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation
Jingyun Wang
Guoliang Kang
VLM
SSL
47
7
0
13 Aug 2024
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic
  Segmentation
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
Dahyun Kang
Minsu Cho
ObjD
VLM
40
9
0
09 Aug 2024
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
Anurag Das
Xinting Hu
Li Jiang
Bernt Schiele
VLM
46
3
0
31 Jul 2024
Segment Anything for Videos: A Systematic Survey
Segment Anything for Videos: A Systematic Survey
Chunhui Zhang
Yawen Cui
Weilin Lin
Guanjie Huang
Yan Rong
Li Liu
Shiguang Shan
VLM
44
6
0
31 Jul 2024
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Yuanwen Yue
Anurag Das
Francis Engelmann
Siyu Tang
J. E. Lenssen
48
24
0
29 Jul 2024
Diffusion Feedback Helps CLIP See Better
Diffusion Feedback Helps CLIP See Better
Wenxuan Wang
Quan-Sen Sun
Fan Zhang
Yepeng Tang
Jing Liu
Xinlong Wang
VLM
46
14
0
29 Jul 2024
LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume
  Rendering
LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume Rendering
Simon Boeder
Fabian Gigengack
Benjamin Risse
50
7
0
24 Jul 2024
No Re-Train, More Gain: Upgrading Backbones with Diffusion model for Pixel-Wise and Weakly-Supervised Few-Shot Segmentation
No Re-Train, More Gain: Upgrading Backbones with Diffusion model for Pixel-Wise and Weakly-Supervised Few-Shot Segmentation
Shuai Chen
Fanman Meng
Chenhao Wu
Haoran Wei
Runtong Zhang
Qingbo Wu
Linfeng Xu
Hongliang Li
33
0
0
23 Jul 2024
OpenSU3D: Open World 3D Scene Understanding using Foundation Models
OpenSU3D: Open World 3D Scene Understanding using Foundation Models
Rafay Mohiuddin
Sai Manoj Prakhya
Fiona Collins
Ziyuan Liu
André Borrmann
38
2
0
19 Jul 2024
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion
  Models
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu
Hao Zhou
Pengfei Xing
Long Zhao
Hao Xu
Junwei Liang
Alex Hauptmann
Ting Liu
Andrew C. Gallagher
DiffM
59
4
0
18 Jul 2024
Open-World Visual Reasoning by a Neuro-Symbolic Program of Zero-Shot
  Symbols
Open-World Visual Reasoning by a Neuro-Symbolic Program of Zero-Shot Symbols
Gertjan J. Burghouts
Fieke Hillerstrom
Erwin Walraven
M. V. Bekkum
Frank Ruis
J. Sijs
Jelle van Mil
Judith Dijk
NAI
24
1
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided
  Self-Distillation
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
48
2
0
18 Jul 2024
Beyond Mask: Rethinking Guidance Types in Few-shot Segmentation
Beyond Mask: Rethinking Guidance Types in Few-shot Segmentation
Shijie Chang
Youwei Pang
Xiaoqi Zhao
Lihe Zhang
Huchuan Lu
39
1
0
16 Jul 2024
Quantized Prompt for Efficient Generalization of Vision-Language Models
Quantized Prompt for Efficient Generalization of Vision-Language Models
Tianxiang Hao
Xiaohan Ding
Juexiao Feng
Yuhong Yang
Hui Chen
Guiguang Ding
VLM
MQ
32
5
0
15 Jul 2024
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Cheng Shi
Yulin Zhang
Bin Yang
Jiajin Tang
Yuexin Ma
Sibei Yang
3DPC
51
1
0
14 Jul 2024
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
Ruihuang Li
Zhengqiang Zhang
Chenhang He
Zhiyuan Ma
Vishal M. Patel
Lei Zhang
3DV
VLM
42
5
0
13 Jul 2024
Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
Jun Zhu
Zihao Du
Haotian Xu
Fengbo Lan
Zilong Zheng
Bo Ma
Shengjie Wang
Tao Zhang
36
4
0
12 Jul 2024
Textual Query-Driven Mask Transformer for Domain Generalized
  Segmentation
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation
Byeonghyun Pak
Byeongju Woo
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
46
3
0
12 Jul 2024
OVExp: Open Vocabulary Exploration for Object-Oriented Navigation
OVExp: Open Vocabulary Exploration for Object-Oriented Navigation
Meng Wei
Tai Wang
Yilun Chen
Hanqing Wang
Jiangmiao Pang
Xihui Liu
VLM
49
3
0
12 Jul 2024
Enhancing Robustness of Vision-Language Models through Orthogonality
  Learning and Cross-Regularization
Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Jinlong Li
Zequn Jie
Elisa Ricci
Lin Ma
N. Sebe
VLM
39
0
0
11 Jul 2024
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic
  Segmentation
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation
Tong Shao
Zhuotao Tian
Hang Zhao
Jingyong Su
VLM
39
15
0
11 Jul 2024
Segment Any 4D Gaussians
Segment Any 4D Gaussians
Shengxiang Ji
Guanjun Wu
Jiemin Fang
Jiazhong Cen
Taoran Yi
Wenyu Liu
Qi Tian
Xinggang Wang
3DGS
35
7
0
05 Jul 2024
Open Panoramic Segmentation
Open Panoramic Segmentation
Junwei Zheng
Ruiping Liu
Yufan Chen
Kunyu Peng
Chengzhi Wu
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
VLM
36
7
0
02 Jul 2024
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for
  Zero-shot Panoptic Reconstruction
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction
Xuan Yu
Yili Liu
Chenrui Han
Sitong Mao
Shunbo Zhou
R. Xiong
Yiyi Liao
Yue Wang
ISeg
46
2
0
01 Jul 2024
Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation
Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation
Zihan Gao
Lingling Li
Licheng Jiao
Fang Liu
Xu Liu
Wenping Ma
Yuwei Guo
Shuyuan Yang
34
0
0
01 Jul 2024
3D Feature Distillation with Object-Centric Priors
3D Feature Distillation with Object-Centric Priors
Georgios Tziafas
Yucheng Xu
Zhibin Li
H. Kasaei
34
1
0
26 Jun 2024
Open-vocabulary Mobile Manipulation in Unseen Dynamic Environments with
  3D Semantic Maps
Open-vocabulary Mobile Manipulation in Unseen Dynamic Environments with 3D Semantic Maps
Dicong Qiu
Wenzong Ma
Zhenfu Pan
Hui Xiong
Junwei Liang
LM&Ro
39
7
0
26 Jun 2024
SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D
  Scene Editing
SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing
Ruihuang Li
Liyi Chen
Zhengqiang Zhang
Varun Jampani
Vishal M. Patel
Lei Zhang
DiffM
42
0
0
25 Jun 2024
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
Thomas Stegmüller
Tim Lebailly
Nikola Dukic
Behzad Bozorgtabar
Tinne Tuytelaars
Jean-Philippe Thiran
VLM
39
1
0
23 Jun 2024
StableSemantics: A Synthetic Language-Vision Dataset of Semantic
  Representations in Naturalistic Images
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images
Rushikesh Zawar
Shaurya Dewan
Andrew F. Luo
Margaret M. Henderson
Michael J. Tarr
Leila Wehbe
VGen
CoGe
44
1
0
19 Jun 2024
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Jiho Choi
Seonho Lee
Seungho Lee
Minhyun Lee
Hyunjung Shim
OCL
42
0
0
17 Jun 2024
Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites
  Paradox
Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox
Xingming Long
Jie Zhang
Shiguang Shan
Xilin Chen
OODD
37
1
0
14 Jun 2024
ICE-G: Image Conditional Editing of 3D Gaussian Splats
ICE-G: Image Conditional Editing of 3D Gaussian Splats
Vishnu Jaganathan
Hannah Hanyun Huang
Muhammad Zubair Irshad
Varun Jampani
Amit Raj
Z. Kira
3DGS
34
8
0
12 Jun 2024
OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with
  Fine-Grained Understanding
OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding
Yinan Deng
Jiahui Wang
Jingyu Zhao
Jianyu Dou
Yi Yang
Yufeng Yue
AI4CE
30
6
0
12 Jun 2024
Situational Awareness Matters in 3D Vision Language Reasoning
Situational Awareness Matters in 3D Vision Language Reasoning
Yunze Man
Liang-Yan Gui
Yu-Xiong Wang
43
12
0
11 Jun 2024
Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph
Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph
S. Linok
T. Zemskova
Svetlana Ladanova
Roman Titkov
Dmitry A. Yudin
Maxim Monastyrny
Aleksei Valenkov
LM&Ro
54
3
0
11 Jun 2024
Previous
123456...8910
Next