Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.03546
Cited By
Language-driven Semantic Segmentation
10 January 2022
Boyi Li
Kilian Q. Weinberger
Serge Belongie
V. Koltun
René Ranftl
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language-driven Semantic Segmentation"
50 / 478 papers shown
Title
Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill
Wenzhe Cai
Siyuan Huang
Guangran Cheng
Yuxing Long
Peng Gao
Changyin Sun
Hao Dong
LM&Ro
25
41
0
19 Sep 2023
Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering
Chi Zhang
Wei Yin
Gang Yu
Zhibin Wang
Tao Chen
Bin-Bin Fu
Qiufeng Wang
Chunhua Shen
MDE
119
5
0
18 Sep 2023
CLIPUNetr: Assisting Human-robot Interface for Uncalibrated Visual Servoing Control with CLIP-driven Referring Expression Segmentation
Chen Jiang
Yuchen Yang
Martin Jägersand
31
1
0
17 Sep 2023
Language Embedded Radiance Fields for Zero-Shot Task-Oriented Grasping
Adam Rashid
Satvik Sharma
C. Kim
J. Kerr
L. Chen
Angjoo Kanazawa
Ken Goldberg
62
85
0
14 Sep 2023
Multi-dimensional Fusion and Consistency for Semi-supervised Medical Image Segmentation
Yixing Lu
Zhaoxin Fan
Min Xu
27
0
0
12 Sep 2023
Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Yunhao Ge
Lyne Tchapmi
Brian Nlong Zhao
Neel Joshi
Laurent Itti
Vibhav Vineet
DiffM
35
14
0
12 Sep 2023
Panoptic Vision-Language Feature Fields
Haoran Chen
Kenneth Blomqvist
Francesco Milano
Roland Siegwart
VLM
21
13
0
11 Sep 2023
From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models
Changming Xiao
Qi Yang
Feng Zhou
Changshui Zhang
33
17
0
08 Sep 2023
SLiMe: Segment Like Me
Aliasghar Khani
Saeid Asgari Taghanaki
Aditya Sanghi
Ali Mahdavi-Amiri
Ghassan Hamarneh
VLM
34
30
0
06 Sep 2023
Recognition of Heat-Induced Food State Changes by Time-Series Use of Vision-Language Model for Cooking Robot
Naoaki Kanazawa
Kento Kawaharazuka
Yoshiki Obinata
K. Okada
Masayuki Inaba
LM&Ro
11
5
0
04 Sep 2023
EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment
Cheng Shi
Sibei Yang
VLM
ObjD
38
38
0
03 Sep 2023
Contrastive Feature Masking Open-Vocabulary Vision Transformer
Dahun Kim
A. Angelova
Weicheng Kuo
ObjD
VLM
23
27
0
02 Sep 2023
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation
Chaofan Ma
Yu-Hao Yang
Chen Ju
Fei Zhang
Ya-Qin Zhang
Yanfeng Wang
VLM
48
17
0
31 Aug 2023
Shatter and Gather: Learning Referring Image Segmentation with Text Supervision
Dongwon Kim
Nam-Won Kim
Cuiling Lan
Suha Kwak
VLM
42
19
0
29 Aug 2023
PartSeg: Few-shot Part Segmentation via Part-aware Prompt Learning
M. Han
Heliang Zheng
Chaoyue Wang
Yong Luo
Han Hu
Jing Zhang
Yonggang Wen
VLM
32
3
0
24 Aug 2023
Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields
H. Song
Seokhun Choi
Hoseok Do
Chul Lee
Taehyeong Kim
DiffM
33
24
0
23 Aug 2023
ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights
Weixian Lei
Yixiao Ge
Jianfeng Zhang
Dylan Sun
Kun Yi
Ying Shan
Mike Zheng Shou
33
1
0
20 Aug 2023
Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models
Dohwan Ko
Ji Soo Lee
M. Choi
Jaewon Chu
Jihwan Park
Hyunwoo J. Kim
22
5
0
18 Aug 2023
Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language
Francesco Taioli
Federico Cunico
Federico Girella
Riccardo Bologna
Alessandro Farinelli
Marco Cristani
23
7
0
17 Aug 2023
Visual and Textual Prior Guided Mask Assemble for Few-Shot Segmentation and Beyond
Chen Shuai
Meng Fanman
Runtong Zhang
Heqian Qiu
Hongliang Li
Wu Qingbo
Xu Linfeng
VLM
30
12
0
15 Aug 2023
SegPrompt: Boosting Open-world Segmentation via Category-level Prompt Learning
Muzhi Zhu
Hengtao Li
Hao Chen
Chengxiang Fan
Wei Mao
Chenchen Jing
Yifan Liu
Chunhua Shen
VLM
34
17
0
12 Aug 2023
Follow Anything: Open-set detection, tracking, and following in real-time
Alaa Maalouf
Ninad Jadhav
Krishna Murthy Jatavallabhula
Makram Chahine
Daniel M.Vogt
Robert J. Wood
Antonio Torralba
Daniela Rus
24
24
0
10 Aug 2023
Scene-Generalizable Interactive Segmentation of Radiance Fields
Songlin Tang
Wenjie Pei
Xin Tao
Tanghui Jia
Guangming Lu
Yu-Wing Tai
20
11
0
09 Aug 2023
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Qihang Yu
Ju He
XueQing Deng
Xiaohui Shen
Liang-Chieh Chen
VLM
CLIP
42
136
0
04 Aug 2023
Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding
Runyu Ding
Jihan Yang
Chuhui Xue
Wenqing Zhang
Song Bai
Xiaojuan Qi
3DV
VLM
21
28
0
01 Aug 2023
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Junjie Fei
Teng Wang
Jinrui Zhang
Zhenyu He
Chengjie Wang
Feng Zheng
VLM
28
34
0
31 Jul 2023
CARTIER: Cartographic lAnguage Reasoning Targeted at Instruction Execution for Robots
D. Rivkin
Nikhil Kakodkar
F. Hogan
Bobak H. Baghi
Gregory Dudek
LM&Ro
21
3
0
21 Jul 2023
See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data
Yuhang Lu
Qingnan Jiang
Runnan Chen
Yuenan Hou
Xinge Zhu
Yuexin Ma
3DPC
31
19
0
20 Jul 2023
Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Anindya Mondal
Sauradip Nag
J. Prada
Xiatian Zhu
Anjan Dutta
23
9
0
20 Jul 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Chaoyang Zhu
Long Chen
ObjD
VLM
31
32
0
18 Jul 2023
Unified Open-Vocabulary Dense Visual Prediction
Hengcan Shi
Munawar Hayat
Jianfei Cai
ObjD
VLM
43
19
0
17 Jul 2023
Self-regulating Prompts: Foundational Model Adaptation without Forgetting
Muhammad Uzair Khattak
Syed Talal Wasim
Muzammal Naseer
Salman Khan
Ming Yang
F. Khan
VLM
23
166
0
13 Jul 2023
TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation
P. Grimal
Hervé Le Borgne
Olivier Ferret
Julien Tourille
EGVM
42
10
0
11 Jul 2023
Semantic-SAM: Segment and Recognize Anything at Any Granularity
Feng Li
Hao Zhang
Pei Sun
Xueyan Zou
Siyi Liu
Jianwei Yang
Chun-yue Li
Lei Zhang
Jianfeng Gao
VLM
37
173
0
10 Jul 2023
Distilling Large Vision-Language Model with Out-of-Distribution Generalizability
Xuanlin Li
Yunhao Fang
Minghua Liu
Z. Ling
Z. Tu
Haoran Su
VLM
31
23
0
06 Jul 2023
Hierarchical Open-vocabulary Universal Image Segmentation
Xudong Wang
Shufang Li
Konstantinos Kallidromitis
Yu Kato
Kazuki Kozuka
Trevor Darrell
VLM
OCL
43
36
0
03 Jul 2023
Prompting classes: Exploring the Power of Prompt Class Learning in Weakly Supervised Semantic Segmentation
Balamurali Murugesan
Rukhshanda Hussain
Rajarshi Bhattacharya
Ismail Ben Ayed
Jose Dolz
VLM
VPVLM
26
4
0
30 Jun 2023
Seeing in Words: Learning to Classify through Language Bottlenecks
Khalid Saifullah
Yuxin Wen
Jonas Geiping
Micah Goldblum
Tom Goldstein
VLM
15
2
0
29 Jun 2023
Towards Open Vocabulary Learning: A Survey
Jianzong Wu
Xiangtai Li
Shilin Xu
Haobo Yuan
Henghui Ding
...
Jiangning Zhang
Yu Tong
Xudong Jiang
Guohao Li
Dacheng Tao
ObjD
VLM
34
136
0
28 Jun 2023
What a MESS: Multi-Domain Evaluation of Zero-Shot Semantic Segmentation
Benedikt Blumenstiel
Johannes Jakubik
Hilde Kuhne
Michael Vossing
VLM
32
15
0
27 Jun 2023
Explainable Multimodal Emotion Recognition
Zheng Lian
Haiyang Sun
Guoying Zhao
Hao Gu
Zhuofan Wen
...
Shan Liang
Ya Li
Jiangyan Yi
B. Liu
Jianhua Tao
MLLM
15
6
0
27 Jun 2023
DesCo: Learning Object Recognition with Rich Language Descriptions
Liunian Harold Li
Zi-Yi Dou
Nanyun Peng
Kai-Wei Chang
ObjD
VLM
28
20
0
24 Jun 2023
OpenMask3D: Open-Vocabulary 3D Instance Segmentation
Ayca Takmaz
Elisabetta Fedele
R. Sumner
Marc Pollefeys
F. Tombari
Francis Engelmann
ISeg
VLM
31
163
0
23 Jun 2023
Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields
Ori Gordon
Omri Avrahami
Dani Lischinski
DiffM
29
22
0
22 Jun 2023
Exploring the Application of Large-scale Pre-trained Models on Adverse Weather Removal
Zhentao Tan
Yue-bo Wu
Qiankun Liu
Qi Chu
Le Lu
Jieping Ye
Nenghai Yu
42
11
0
15 Jun 2023
EPIC Fields: Marrying 3D Geometry and Video Understanding
Vadim Tschernezki
Ahmad Darkhalil
Zhifan Zhu
David Fouhey
Iro Laina
Diane Larlus
Dima Damen
Andrea Vedaldi
EgoV
40
30
0
14 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
33
7
0
14 Jun 2023
GeneCIS: A Benchmark for General Conditional Image Similarity
S. Vaze
Nicolas Carion
Ishan Misra
VLM
DiffM
31
26
0
13 Jun 2023
RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models
Xing-Chun Zhou
Ying He
F. Richard Yu
Jianqiang Li
You Li
DiffM
12
18
0
09 Jun 2023
UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks
Yanan Sun
Zi-Qi Zhong
Qi Fan
Chi-Keung Tang
Yu-Wing Tai
VLM
33
4
0
07 Jun 2023
Previous
1
2
3
...
10
6
7
8
9
Next