Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01071
Cited By
Extract Free Dense Labels from CLIP
2 December 2021
Chong Zhou
Chen Change Loy
Bo Dai
VLM
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Extract Free Dense Labels from CLIP"
50 / 343 papers shown
Title
A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Junfei Xiao
Ziqi Zhou
Wenxuan Li
Shiyi Lan
Jieru Mei
Zhiding Yu
Alan L. Yuille
Yuyin Zhou
Cihang Xie
VLM
19
1
0
21 Dec 2023
Weakly Supervised Semantic Segmentation for Driving Scenes
Dongseob Kim
Seungho Lee
Junsuk Choe
Hyunjung Shim
7
3
0
21 Dec 2023
TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training
Yuqi Lin
Minghao Chen
Kaipeng Zhang
Hengjia Li
Mingming Li
Zheng Yang
Dongqin Lv
Binbin Lin
Haifeng Liu
Deng Cai
CLIP
VLM
46
11
0
20 Dec 2023
Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic Segmentation
Wenhao Xu
Rongtao Xu
Changwei Wang
Shibiao Xu
Li Guo
Man Zhang
Xiaopeng Zhang
VLM
33
10
0
20 Dec 2023
CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation
Monika Wysoczañska
Oriane Siméoni
Michael Ramamonjisoa
Andrei Bursuc
Tomasz Trzciñski
Patrick Pérez
VLM
CLIP
34
29
0
19 Dec 2023
Zero-shot Building Attribute Extraction from Large-Scale Vision and Language Models
Fei Pan
Sangryul Jeon
Brian Wang
Frank Mckenna
Stella X. Yu
44
2
0
19 Dec 2023
CLIM: Contrastive Language-Image Mosaic for Region Representation
Size Wu
Wenwei Zhang
Lumin Xu
Sheng Jin
Wentao Liu
Chen Change Loy
ObjD
VLM
52
15
0
18 Dec 2023
Tokenize Anything via Prompting
Ting Pan
Lulu Tang
Xinlong Wang
Shiguang Shan
VLM
28
22
0
14 Dec 2023
Foundation Models in Robotics: Applications, Challenges, and the Future
Roya Firoozi
Johnathan Tucker
Stephen Tian
Anirudha Majumdar
Jiankai Sun
...
Brian Ichter
Danny Driess
Jiajun Wu
Cewu Lu
Mac Schwager
LM&Ro
AI4CE
LRM
VLM
37
140
0
13 Dec 2023
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
Shuyang Sun
Runjia Li
Philip H. S. Torr
Xiuye Gu
Siyang Li
VLM
CLIP
33
32
0
12 Dec 2023
Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Yuanbin Wang
Shaofei Huang
Yulu Gao
Zhen Wang
Rui Wang
Kehua Sheng
Bo-Wen Zhang
Si Liu
VLM
30
13
0
12 Dec 2023
Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations
Xiao Zhang
David Yunis
Michael Maire
25
2
0
11 Dec 2023
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Zeyi Sun
Ye Fang
Tong Wu
Pan Zhang
Yuhang Zang
Shu Kong
Yuanjun Xiong
Dahua Lin
Jiaqi Wang
VLM
CLIP
48
83
0
06 Dec 2023
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
B. Ke
Anton Obukhov
Shengyu Huang
Nando Metzger
Rodrigo Caye Daudt
Konrad Schindler
VLM
MDE
37
145
0
04 Dec 2023
Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection
Fan Lu
Kai Zhu
Kecheng Zheng
Wei Zhai
Xuemiao Xu
OODD
155
4
0
04 Dec 2023
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
Feng Wang
Jieru Mei
Alan L. Yuille
VLM
29
55
0
04 Dec 2023
G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training
Che Liu
Ouyang Cheng
Sibo Cheng
Anand Shah
Wenjia Bai
Rossella Arcucci
VLM
MedIm
23
8
0
03 Dec 2023
Grounding Everything: Emerging Localization Properties in Vision-Language Transformers
Walid Bousselham
Felix Petersen
Vittorio Ferrari
Hilde Kuehne
ObjD
VLM
42
39
0
01 Dec 2023
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
Mu Cai
Haotian Liu
Dennis Park
Siva Karthik Mustikovela
Gregory P. Meyer
Yuning Chai
Yong Jae Lee
VLM
LRM
MLLM
46
85
0
01 Dec 2023
Open-vocabulary object 6D pose estimation
Jaime Corsetti
Davide Boscaini
Changjae Oh
Andrea Cavallaro
Fabio Poiesi
23
10
0
01 Dec 2023
A Simple Recipe for Language-guided Domain Generalized Segmentation
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Patrick Pérez
Raoul de Charette
VLM
23
14
0
29 Nov 2023
One-Shot Open Affordance Learning with Foundation Models
Gen Li
Deqing Sun
Laura Sevilla-Lara
Varun Jampani
VLM
70
22
0
29 Nov 2023
Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
Jiayun Luo
Siddhesh Khandelwal
Leonid Sigal
Boyang Albert Li
MLLM
VLM
35
7
0
28 Nov 2023
SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
Lukas Hoyer
D. Tan
Muhammad Ferjad Naeem
Luc Van Gool
F. Tombari
VLM
MLLM
36
16
0
27 Nov 2023
Spatially Covariant Image Registration with Text Prompts
Xiang Chen
Min Liu
Rongguang Wang
Renjiu Hu
Dongdong Liu
Gaolei Li
Hang Zhang
MedIm
35
9
0
27 Nov 2023
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
Bin Xie
Jiale Cao
Jin Xie
Fahad Shahbaz Khan
Yanwei Pang
VLM
28
43
0
27 Nov 2023
Language-guided Few-shot Semantic Segmentation
Jing Wang
Yuang Liu
Qiang-feng Zhou
Fan Wang
VLM
22
3
0
23 Nov 2023
Open-Vocabulary Camouflaged Object Segmentation
Youwei Pang
Xiaoqi Zhao
Jiaming Zuo
Lihe Zhang
Huchuan Lu
VLM
ObjD
31
6
0
19 Nov 2023
CLIP Guided Image-perceptive Prompt Learning for Image Enhancement
Weiwen Chen
Qiuhong Ke
Zinuo Li
CLIP
VLM
16
2
0
07 Nov 2023
Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion
Hao Zhou
Tiancheng Shen
Xu Yang
Hai Huang
Xiangtai Li
Lu Qi
Ming-Hsuan Yang
89
12
0
06 Nov 2023
Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation
Fei Zhang
Tianfei Zhou
Boyang Li
Hao He
Chaofan Ma
Tianjiao Zhang
Jiangchao Yao
Ya-Qin Zhang
Yanfeng Wang
VLM
42
17
0
29 Oct 2023
Text Augmented Spatial-aware Zero-shot Referring Image Segmentation
Yuchen Suo
Linchao Zhu
Yi Yang
31
13
0
27 Oct 2023
Three Pillars improving Vision Foundation Model Distillation for Lidar
Gilles Puy
Spyros Gidaris
Alexandre Boulch
Oriane Siméoni
Corentin Sautier
Patrick Pérez
Andrei Bursuc
Renaud Marlet
104
18
0
26 Oct 2023
Videoprompter: an ensemble of foundational models for zero-shot video understanding
Adeel Yousaf
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
Mubarak Shah
VLM
38
2
0
23 Oct 2023
A Survey on Continual Semantic Segmentation: Theory, Challenge, Method and Application
Bo Yuan
Danpei Zhao
3DV
CLL
32
10
0
22 Oct 2023
SILC: Improving Vision Language Pretraining with Self-Distillation
Muhammad Ferjad Naeem
Yongqin Xian
Xiaohua Zhai
Lukas Hoyer
Luc Van Gool
F. Tombari
VLM
26
33
0
20 Oct 2023
Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds
Sipeng Zheng
Jiazheng Liu
Yicheng Feng
Zongqing Lu
42
29
0
20 Oct 2023
Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation Models
Zhaozheng Chen
Qianru Sun
VLM
27
7
0
19 Oct 2023
Image Clustering with External Guidance
Yunfan Li
Peng Hu
Dezhong Peng
Jiancheng Lv
Jianping Fan
Xi Peng
23
10
0
18 Oct 2023
Towards Training-free Open-world Segmentation via Image Prompt Foundation Models
Lv Tang
Peng-Tao Jiang
Haoke Xiao
Bo Li
VLM
13
7
0
17 Oct 2023
Label-efficient Segmentation via Affinity Propagation
Wentong Li
Yuqian Yuan
Song Wang
Wenyu Liu
Dongqi Tang
Jian Liu
Jianke Zhu
Lei Zhang
32
5
0
16 Oct 2023
SAIR: Learning Semantic-aware Implicit Representation
Canyu Zhang
Xiaoguang Li
Qing-Wu Guo
Song Wang
36
3
0
13 Oct 2023
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu
Kang Zhao
Bo Peng
Yue Jiang
Yingya Zhang
Jing Dong
31
2
0
12 Oct 2023
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Chengyang Zhao
Yikang Shen
Zhenfang Chen
Mingyu Ding
Chuang Gan
51
15
0
10 Oct 2023
OV-PARTS: Towards Open-Vocabulary Part Segmentation
Meng Wei
Xiaoyu Yue
Wenwei Zhang
Shu Kong
Xihui Liu
Jiangmiao Pang
VLM
26
24
0
08 Oct 2023
Compositional Semantics for Open Vocabulary Spatio-semantic Representations
Robin Karlsson
Francisco Lepe-Salazar
K. Takeda
VLM
53
1
0
08 Oct 2023
Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
Kashu Yamazaki
Taisei Hanyu
Khoa T. Vo
Thang M. Pham
Minh-Triet Tran
Gianfranco Doretto
Anh Nguyen
Ngan Le
24
25
0
05 Oct 2023
CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation
Jialei Chen
Daisuke Deguchi
Chenkai Zhang
Xu Zheng
Hiroshi Murase
VLM
17
9
0
03 Oct 2023
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
Size Wu
Wenwei Zhang
Lumin Xu
Sheng Jin
Xiangtai Li
Wentao Liu
Chen Change Loy
CLIP
VLM
24
69
0
02 Oct 2023
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection
Shilin Xu
Xiangtai Li
Size Wu
Wenwei Zhang
Yunhai Tong
Chen Change Loy
ObjD
VLM
31
0
0
02 Oct 2023
Previous
1
2
3
4
5
6
7
Next