ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.03546
  4. Cited By
Language-driven Semantic Segmentation

Language-driven Semantic Segmentation

10 January 2022
Boyi Li
Kilian Q. Weinberger
Serge J. Belongie
V. Koltun
René Ranftl
    VLM
ArXivPDFHTML

Papers citing "Language-driven Semantic Segmentation"

50 / 478 papers shown
Title
Auto-Vocabulary Semantic Segmentation
Auto-Vocabulary Semantic Segmentation
Osman Ülger
Maksymilian Kulicki
Yuki M. Asano
Martin R. Oswald
VLM
45
2
0
07 Dec 2023
Geometrically-driven Aggregation for Zero-shot 3D Point Cloud
  Understanding
Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding
Guofeng Mei
Luigi Riz
Yiming Wang
Fabio Poiesi
3DPC
30
6
0
04 Dec 2023
Universal Segmentation at Arbitrary Granularity with Language
  Instruction
Universal Segmentation at Arbitrary Granularity with Language Instruction
Yong Liu
Cairong Zhang
Yitong Wang
Jiahao Wang
Yujiu Yang
Yansong Tang
VLM
VOS
55
15
0
04 Dec 2023
Segment and Caption Anything
Segment and Caption Anything
Xiaoke Huang
Jianfeng Wang
Yansong Tang
Zheng Zhang
Han Hu
Jiwen Lu
Lijuan Wang
Zicheng Liu
MLLM
VLM
28
18
0
01 Dec 2023
DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and
  Alignment from an RGB Image
DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and Alignment from an RGB Image
Daoyi Gao
Dávid Rozenberszki
Stefan Leutenegger
Angela Dai
DiffM
27
11
0
30 Nov 2023
Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding
Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding
Jin-Chuan Shi
Miao Wang
Hao-Bin Duan
Shao-Hua Guan
3DGS
46
84
0
30 Nov 2023
A Simple Recipe for Language-guided Domain Generalized Segmentation
A Simple Recipe for Language-guided Domain Generalized Segmentation
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Patrick Pérez
Raoul de Charette
VLM
23
14
0
29 Nov 2023
SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language
  Guidance
SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
Lukas Hoyer
D. Tan
Muhammad Ferjad Naeem
Luc Van Gool
F. Tombari
VLM
MLLM
36
16
0
27 Nov 2023
ViT-Lens: Towards Omni-modal Representations
ViT-Lens: Towards Omni-modal Representations
Weixian Lei
Yixiao Ge
Kun Yi
Jianfeng Zhang
Difei Gao
Dylan Sun
Yuying Ge
Ying Shan
Mike Zheng Shou
21
18
0
27 Nov 2023
End-to-End Breast Cancer Radiotherapy Planning via LMMs with Consistency
  Embedding
End-to-End Breast Cancer Radiotherapy Planning via LMMs with Consistency Embedding
Kwanyoung Kim
Y. Oh
S. Park
H. Byun
Joongyo Lee
Jin Sung Kim
Yong Bae Kim
Jong Chul Ye
23
0
0
27 Nov 2023
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
Bin Xie
Jiale Cao
Jin Xie
Fahad Shahbaz Khan
Yanwei Pang
VLM
28
43
0
27 Nov 2023
Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
Zhihao Yuan
Jinke Ren
Chun-Mei Feng
Hengshuang Zhao
Shuguang Cui
Zhen Li
39
26
0
26 Nov 2023
Text and Click inputs for unambiguous open vocabulary instance
  segmentation
Text and Click inputs for unambiguous open vocabulary instance segmentation
Nikolai Warner
Meera Hahn
Jonathan Huang
Irfan Essa
Vighnesh Birodkar
VLM
27
0
0
24 Nov 2023
DAE-Net: Deforming Auto-Encoder for fine-grained shape co-segmentation
DAE-Net: Deforming Auto-Encoder for fine-grained shape co-segmentation
Zhiqin Chen
Qimin Chen
Hang Zhou
Hao Zhang
3DPC
3DV
37
2
0
22 Nov 2023
OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive
  Learning
OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning
Haiyang Ying
Yixuan Yin
Jinzhi Zhang
Fan Wang
Tao Yu
Ruqi Huang
Lu Fang
15
29
0
20 Nov 2023
Labeling Indoor Scenes with Fusion of Out-of-the-Box Perception Models
Labeling Indoor Scenes with Fusion of Out-of-the-Box Perception Models
Yimeng Li
Navid Rajabi
Sulabh Shrestha
Md. Alimoor Reza
Jana Kosecka
18
3
0
17 Nov 2023
GOAT: GO to Any Thing
GOAT: GO to Any Thing
Matthew Chang
Théophile Gervet
Mukul Khanna
Sriram Yenamandra
Dhruv Shah
...
Saurabh Gupta
Dhruv Batra
Roozbeh Mottaghi
Jitendra Malik
Devendra Singh Chaplot
28
65
0
10 Nov 2023
Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion
Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion
Hao Zhou
Tiancheng Shen
Xu Yang
Hai Huang
Xiangtai Li
Lu Qi
Ming-Hsuan Yang
89
12
0
06 Nov 2023
OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D
  Data
OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data
Shiyang Lu
Haonan Chang
E. Jing
Abdeslam Boularias
Kostas Bekris
18
54
0
06 Nov 2023
FocusTune: Tuning Visual Localization through Focus-Guided Sampling
FocusTune: Tuning Visual Localization through Focus-Guided Sampling
Son Tung Nguyen
Alejandro Fontan
Michael Milford
Tobias Fischer
31
11
0
06 Nov 2023
Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic
  Segmentation
Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation
Fei Zhang
Tianfei Zhou
Boyang Li
Hao He
Chaofan Ma
Tianjiao Zhang
Jiangchao Yao
Ya-Qin Zhang
Yanfeng Wang
VLM
45
17
0
29 Oct 2023
Drive Anywhere: Generalizable End-to-end Autonomous Driving with
  Multi-modal Foundation Models
Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models
Tsun-Hsuan Wang
Alaa Maalouf
Wei Xiao
Yutong Ban
Alexander Amini
Guy Rosman
S. Karaman
Daniela Rus
27
42
0
26 Oct 2023
Lang3DSG: Language-based contrastive pre-training for 3D Scene Graph
  prediction
Lang3DSG: Language-based contrastive pre-training for 3D Scene Graph prediction
Sebastian Koch
Pedro Hermosilla
Narunas Vaskevicius
Mirco Colosi
Timo Ropinski
37
9
0
25 Oct 2023
Open-NeRF: Towards Open Vocabulary NeRF Decomposition
Open-NeRF: Towards Open Vocabulary NeRF Decomposition
Hao Zhang
Fang Li
Narendra Ahuja
24
11
0
25 Oct 2023
4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance
  Fields via Semantic Distillation
4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields via Semantic Distillation
Dadong Jiang
Zhihui Ke
Xiaobo Zhou
Xidong Shi
VGen
DiffM
30
4
0
25 Oct 2023
CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought
  Language Prompting
CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting
Lei Li
26
23
0
24 Oct 2023
SILC: Improving Vision Language Pretraining with Self-Distillation
SILC: Improving Vision Language Pretraining with Self-Distillation
Muhammad Ferjad Naeem
Yongqin Xian
Xiaohua Zhai
Lukas Hoyer
Luc Van Gool
F. Tombari
VLM
26
33
0
20 Oct 2023
NeuroSMPC: A Neural Network guided Sampling Based MPC for On-Road
  Autonomous Driving
NeuroSMPC: A Neural Network guided Sampling Based MPC for On-Road Autonomous Driving
Kaustab Pal
Aditya Sharma
Mohd. Omama
Parth N. Shah
K. M. Krishna
21
0
0
19 Oct 2023
Vision and Language Navigation in the Real World via Online Visual
  Language Mapping
Vision and Language Navigation in the Real World via Online Visual Language Mapping
Chengguang Xu
Hieu T. Nguyen
Christopher Amato
Lawson L. S. Wong
32
9
0
16 Oct 2023
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in
  the Real World
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
Rujie Wu
Xiaojian Ma
Zhenliang Zhang
Wei Wang
Qing Li
Song-Chun Zhu
Yizhou Wang
LRM
VLM
27
7
0
16 Oct 2023
Think, Act, and Ask: Open-World Interactive Personalized Robot
  Navigation
Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation
Yinpei Dai
Run Peng
Sikai Li
Joyce Chai
LM&Ro
40
24
0
12 Oct 2023
CLIP for Lightweight Semantic Segmentation
CLIP for Lightweight Semantic Segmentation
Ke Jin
Wankou Yang
VLM
21
1
0
11 Oct 2023
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Chengyang Zhao
Yikang Shen
Zhenfang Chen
Mingyu Ding
Chuang Gan
51
15
0
10 Oct 2023
Open-Vocabulary Animal Keypoint Detection with Semantic-feature Matching
Open-Vocabulary Animal Keypoint Detection with Semantic-feature Matching
Hao Zhang
Lumin Xu
Shenqi Lai
Wenqi Shao
Nanning Zheng
Ping Luo
Yu Qiao
Kaipeng Zhang
ObjD
VLM
27
8
0
08 Oct 2023
Compositional Semantics for Open Vocabulary Spatio-semantic
  Representations
Compositional Semantics for Open Vocabulary Spatio-semantic Representations
Robin Karlsson
Francisco Lepe-Salazar
K. Takeda
VLM
53
1
0
08 Oct 2023
Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene
  Representation
Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
Kashu Yamazaki
Taisei Hanyu
Khoa T. Vo
Thang M. Pham
Minh-Triet Tran
Gianfranco Doretto
Anh Nguyen
Ngan Le
24
25
0
05 Oct 2023
Point-Based Radiance Fields for Controllable Human Motion Synthesis
Point-Based Radiance Fields for Controllable Human Motion Synthesis
Haitao Yu
Deheng Zhang
Peiyuan Xie
Tianyi Zhang
3DH
26
4
0
05 Oct 2023
ALT-Pilot: Autonomous navigation with Language augmented Topometric maps
ALT-Pilot: Autonomous navigation with Language augmented Topometric maps
Mohammad Omama
Pranav Inani
Pranjal Paul
Sarat Chandra Yellapragada
Krishna Murthy Jatavallabhula
Sandeep P. Chinchali
Madhava Krishna
25
13
0
03 Oct 2023
CLIP Is Also a Good Teacher: A New Learning Framework for Inductive
  Zero-shot Semantic Segmentation
CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation
Jialei Chen
Daisuke Deguchi
Chenkai Zhang
Xu Zheng
Hiroshi Murase
VLM
17
9
0
03 Oct 2023
Domain-Controlled Prompt Learning
Domain-Controlled Prompt Learning
Qinglong Cao
Zhengqin Xu
Yuantian Chen
Chao Ma
Xiaokang Yang
VLM
31
16
0
30 Sep 2023
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and
  Planning
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning
Yuanyi Zhong
Alihusein Kuwajerwala
Sacha Morin
Krishna Murthy Jatavallabhula
Bipasha Sen
...
Celso Miguel de Melo
Joshua B. Tenenbaum
Antonio Torralba
Florian Shkurti
Liam Paull
LM&Ro
36
166
0
28 Sep 2023
FLIP: Cross-domain Face Anti-spoofing with Language Guidance
FLIP: Cross-domain Face Anti-spoofing with Language Guidance
K. Srivatsan
Muzammal Naseer
Karthik Nandakumar
CVBM
47
44
0
28 Sep 2023
Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs
Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs
Haonan Chang
Kowndinya Boyalakuntla
Shiyang Lu
Siwei Cai
E. Jing
...
Shijie Geng
Adeeb Abbas
Lifeng Zhou
Kostas Bekris
Abdeslam Boularias
14
26
0
27 Sep 2023
Unsupervised 3D Perception with 2D Vision-Language Distillation for
  Autonomous Driving
Unsupervised 3D Perception with 2D Vision-Language Distillation for Autonomous Driving
Mahyar Najibi
Jingwei Ji
Yin Zhou
C. Qi
Xinchen Yan
Scott Ettinger
Drago Anguelov
19
27
0
25 Sep 2023
MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP
MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP
Prajwal Ganugula
Y. Kumar
N. Reddy
Prabhath Chellingi
A. Thakur
Neeraj Kasera
C. S. Anand
CLIP
DiffM
11
3
0
24 Sep 2023
Rewrite Caption Semantics: Bridging Semantic Gaps for
  Language-Supervised Semantic Segmentation
Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation
Yun Xing
Jian Kang
Aoran Xiao
Jiahao Nie
Ling Shao
Shijian Lu
VLM
38
12
0
24 Sep 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary
  Instance Segmentation
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffM
VLM
69
35
0
22 Sep 2023
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language
  Model as an Agent
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent
Jianing Yang
Xuweiyi Chen
Shengyi Qian
Nikhil Madaan
Madhavan Iyengar
David Fouhey
Joyce Chai
LM&Ro
LLMAG
43
84
0
21 Sep 2023
Open-Vocabulary Affordance Detection using Knowledge Distillation and
  Text-Point Correlation
Open-Vocabulary Affordance Detection using Knowledge Distillation and Text-Point Correlation
Tuan V. Vo
Minh Nhat Vu
Baoru Huang
Toan Tien Nguyen
Ngan Le
T. Vo
Anh Nguyen
3DPC
21
10
0
19 Sep 2023
PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes
PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes
Xiao Fu
Shangzhan Zhang
Tianrun Chen
Yichong Lu
Xiaowei Zhou
Andreas Geiger
Yiyi Liao
3DPC
19
8
0
19 Sep 2023
Previous
123...1056789
Next