ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01071
  4. Cited By
Extract Free Dense Labels from CLIP

Extract Free Dense Labels from CLIP

2 December 2021
Chong Zhou
Chen Change Loy
Bo Dai
    VLM
    CLIP
ArXivPDFHTML

Papers citing "Extract Free Dense Labels from CLIP"

50 / 343 papers shown
Title
A Semantic Space is Worth 256 Language Descriptions: Make Stronger
  Segmentation Models with Descriptive Properties
A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Junfei Xiao
Ziqi Zhou
Wenxuan Li
Shiyi Lan
Jieru Mei
Zhiding Yu
Alan L. Yuille
Yuyin Zhou
Cihang Xie
VLM
19
1
0
21 Dec 2023
Weakly Supervised Semantic Segmentation for Driving Scenes
Weakly Supervised Semantic Segmentation for Driving Scenes
Dongseob Kim
Seungho Lee
Junsuk Choe
Hyunjung Shim
7
3
0
21 Dec 2023
TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary
  Multi-Label Classification of CLIP Without Training
TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training
Yuqi Lin
Minghao Chen
Kaipeng Zhang
Hengjia Li
Mingming Li
Zheng Yang
Dongqin Lv
Binbin Lin
Haifeng Liu
Deng Cai
CLIP
VLM
46
11
0
20 Dec 2023
Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic
  Segmentation
Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic Segmentation
Wenhao Xu
Rongtao Xu
Changwei Wang
Shibiao Xu
Li Guo
Man Zhang
Xiaopeng Zhang
VLM
33
10
0
20 Dec 2023
CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary
  semantic segmentation
CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation
Monika Wysoczañska
Oriane Siméoni
Michael Ramamonjisoa
Andrei Bursuc
Tomasz Trzciñski
Patrick Pérez
VLM
CLIP
34
29
0
19 Dec 2023
Zero-shot Building Attribute Extraction from Large-Scale Vision and
  Language Models
Zero-shot Building Attribute Extraction from Large-Scale Vision and Language Models
Fei Pan
Sangryul Jeon
Brian Wang
Frank Mckenna
Stella X. Yu
44
2
0
19 Dec 2023
CLIM: Contrastive Language-Image Mosaic for Region Representation
CLIM: Contrastive Language-Image Mosaic for Region Representation
Size Wu
Wenwei Zhang
Lumin Xu
Sheng Jin
Wentao Liu
Chen Change Loy
ObjD
VLM
52
15
0
18 Dec 2023
Tokenize Anything via Prompting
Tokenize Anything via Prompting
Ting Pan
Lulu Tang
Xinlong Wang
Shiguang Shan
VLM
28
22
0
14 Dec 2023
Foundation Models in Robotics: Applications, Challenges, and the Future
Foundation Models in Robotics: Applications, Challenges, and the Future
Roya Firoozi
Johnathan Tucker
Stephen Tian
Anirudha Majumdar
Jiankai Sun
...
Brian Ichter
Danny Driess
Jiajun Wu
Cewu Lu
Mac Schwager
LM&Ro
AI4CE
LRM
VLM
37
140
0
13 Dec 2023
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
Shuyang Sun
Runjia Li
Philip H. S. Torr
Xiuye Gu
Siyang Li
VLM
CLIP
33
32
0
12 Dec 2023
Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic
  Segmentation
Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Yuanbin Wang
Shaofei Huang
Yulu Gao
Zhen Wang
Rui Wang
Kehua Sheng
Bo-Wen Zhang
Si Liu
VLM
30
13
0
12 Dec 2023
Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering
  of Layer-Distributed Neural Representations
Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations
Xiao Zhang
David Yunis
Michael Maire
25
2
0
11 Dec 2023
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Zeyi Sun
Ye Fang
Tong Wu
Pan Zhang
Yuhang Zang
Shu Kong
Yuanjun Xiong
Dahua Lin
Jiaqi Wang
VLM
CLIP
48
83
0
06 Dec 2023
Repurposing Diffusion-Based Image Generators for Monocular Depth
  Estimation
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
B. Ke
Anton Obukhov
Shengyu Huang
Nando Metzger
Rodrigo Caye Daudt
Konrad Schindler
VLM
MDE
37
145
0
04 Dec 2023
Likelihood-Aware Semantic Alignment for Full-Spectrum
  Out-of-Distribution Detection
Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection
Fan Lu
Kai Zhu
Kecheng Zheng
Wei Zhai
Xuemiao Xu
OODD
155
4
0
04 Dec 2023
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
Feng Wang
Jieru Mei
Alan L. Yuille
VLM
29
55
0
04 Dec 2023
G2D: From Global to Dense Radiography Representation Learning via
  Vision-Language Pre-training
G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training
Che Liu
Ouyang Cheng
Sibo Cheng
Anand Shah
Wenjia Bai
Rossella Arcucci
VLM
MedIm
23
8
0
03 Dec 2023
Grounding Everything: Emerging Localization Properties in
  Vision-Language Transformers
Grounding Everything: Emerging Localization Properties in Vision-Language Transformers
Walid Bousselham
Felix Petersen
Vittorio Ferrari
Hilde Kuehne
ObjD
VLM
42
39
0
01 Dec 2023
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual
  Prompts
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
Mu Cai
Haotian Liu
Dennis Park
Siva Karthik Mustikovela
Gregory P. Meyer
Yuning Chai
Yong Jae Lee
VLM
LRM
MLLM
46
85
0
01 Dec 2023
Open-vocabulary object 6D pose estimation
Open-vocabulary object 6D pose estimation
Jaime Corsetti
Davide Boscaini
Changjae Oh
Andrea Cavallaro
Fabio Poiesi
23
10
0
01 Dec 2023
A Simple Recipe for Language-guided Domain Generalized Segmentation
A Simple Recipe for Language-guided Domain Generalized Segmentation
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Patrick Pérez
Raoul de Charette
VLM
23
14
0
29 Nov 2023
One-Shot Open Affordance Learning with Foundation Models
One-Shot Open Affordance Learning with Foundation Models
Gen Li
Deqing Sun
Laura Sevilla-Lara
Varun Jampani
VLM
70
22
0
29 Nov 2023
Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf
  Vision-Language Models
Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
Jiayun Luo
Siddhesh Khandelwal
Leonid Sigal
Boyang Albert Li
MLLM
VLM
35
7
0
28 Nov 2023
SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language
  Guidance
SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
Lukas Hoyer
D. Tan
Muhammad Ferjad Naeem
Luc Van Gool
F. Tombari
VLM
MLLM
36
16
0
27 Nov 2023
Spatially Covariant Image Registration with Text Prompts
Spatially Covariant Image Registration with Text Prompts
Xiang Chen
Min Liu
Rongguang Wang
Renjiu Hu
Dongdong Liu
Gaolei Li
Hang Zhang
MedIm
35
9
0
27 Nov 2023
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
Bin Xie
Jiale Cao
Jin Xie
Fahad Shahbaz Khan
Yanwei Pang
VLM
28
43
0
27 Nov 2023
Language-guided Few-shot Semantic Segmentation
Language-guided Few-shot Semantic Segmentation
Jing Wang
Yuang Liu
Qiang-feng Zhou
Fan Wang
VLM
22
3
0
23 Nov 2023
Open-Vocabulary Camouflaged Object Segmentation
Open-Vocabulary Camouflaged Object Segmentation
Youwei Pang
Xiaoqi Zhao
Jiaming Zuo
Lihe Zhang
Huchuan Lu
VLM
ObjD
31
6
0
19 Nov 2023
CLIP Guided Image-perceptive Prompt Learning for Image Enhancement
CLIP Guided Image-perceptive Prompt Learning for Image Enhancement
Weiwen Chen
Qiuhong Ke
Zinuo Li
CLIP
VLM
16
2
0
07 Nov 2023
Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion
Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion
Hao Zhou
Tiancheng Shen
Xu Yang
Hai Huang
Xiangtai Li
Lu Qi
Ming-Hsuan Yang
89
12
0
06 Nov 2023
Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic
  Segmentation
Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation
Fei Zhang
Tianfei Zhou
Boyang Li
Hao He
Chaofan Ma
Tianjiao Zhang
Jiangchao Yao
Ya-Qin Zhang
Yanfeng Wang
VLM
42
17
0
29 Oct 2023
Text Augmented Spatial-aware Zero-shot Referring Image Segmentation
Text Augmented Spatial-aware Zero-shot Referring Image Segmentation
Yuchen Suo
Linchao Zhu
Yi Yang
31
13
0
27 Oct 2023
Three Pillars improving Vision Foundation Model Distillation for Lidar
Three Pillars improving Vision Foundation Model Distillation for Lidar
Gilles Puy
Spyros Gidaris
Alexandre Boulch
Oriane Siméoni
Corentin Sautier
Patrick Pérez
Andrei Bursuc
Renaud Marlet
104
18
0
26 Oct 2023
Videoprompter: an ensemble of foundational models for zero-shot video
  understanding
Videoprompter: an ensemble of foundational models for zero-shot video understanding
Adeel Yousaf
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
Mubarak Shah
VLM
38
2
0
23 Oct 2023
A Survey on Continual Semantic Segmentation: Theory, Challenge, Method
  and Application
A Survey on Continual Semantic Segmentation: Theory, Challenge, Method and Application
Bo Yuan
Danpei Zhao
3DV
CLL
32
10
0
22 Oct 2023
SILC: Improving Vision Language Pretraining with Self-Distillation
SILC: Improving Vision Language Pretraining with Self-Distillation
Muhammad Ferjad Naeem
Yongqin Xian
Xiaohua Zhai
Lukas Hoyer
Luc Van Gool
F. Tombari
VLM
26
33
0
20 Oct 2023
Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in
  Open Worlds
Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds
Sipeng Zheng
Jiazheng Liu
Yicheng Feng
Zongqing Lu
42
29
0
20 Oct 2023
Weakly-Supervised Semantic Segmentation with Image-Level Labels: from
  Traditional Models to Foundation Models
Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation Models
Zhaozheng Chen
Qianru Sun
VLM
27
7
0
19 Oct 2023
Image Clustering with External Guidance
Image Clustering with External Guidance
Yunfan Li
Peng Hu
Dezhong Peng
Jiancheng Lv
Jianping Fan
Xi Peng
23
10
0
18 Oct 2023
Towards Training-free Open-world Segmentation via Image Prompt
  Foundation Models
Towards Training-free Open-world Segmentation via Image Prompt Foundation Models
Lv Tang
Peng-Tao Jiang
Haoke Xiao
Bo Li
VLM
13
7
0
17 Oct 2023
Label-efficient Segmentation via Affinity Propagation
Label-efficient Segmentation via Affinity Propagation
Wentong Li
Yuqian Yuan
Song Wang
Wenyu Liu
Dongqi Tang
Jian Liu
Jianke Zhu
Lei Zhang
32
5
0
16 Oct 2023
SAIR: Learning Semantic-aware Implicit Representation
SAIR: Learning Semantic-aware Implicit Representation
Canyu Zhang
Xiaoguang Li
Qing-Wu Guo
Song Wang
36
3
0
13 Oct 2023
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided
  Image Editing
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu
Kang Zhao
Bo Peng
Yue Jiang
Yingya Zhang
Jing Dong
31
2
0
12 Oct 2023
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Chengyang Zhao
Yikang Shen
Zhenfang Chen
Mingyu Ding
Chuang Gan
51
15
0
10 Oct 2023
OV-PARTS: Towards Open-Vocabulary Part Segmentation
OV-PARTS: Towards Open-Vocabulary Part Segmentation
Meng Wei
Xiaoyu Yue
Wenwei Zhang
Shu Kong
Xihui Liu
Jiangmiao Pang
VLM
26
24
0
08 Oct 2023
Compositional Semantics for Open Vocabulary Spatio-semantic
  Representations
Compositional Semantics for Open Vocabulary Spatio-semantic Representations
Robin Karlsson
Francisco Lepe-Salazar
K. Takeda
VLM
53
1
0
08 Oct 2023
Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene
  Representation
Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
Kashu Yamazaki
Taisei Hanyu
Khoa T. Vo
Thang M. Pham
Minh-Triet Tran
Gianfranco Doretto
Anh Nguyen
Ngan Le
24
25
0
05 Oct 2023
CLIP Is Also a Good Teacher: A New Learning Framework for Inductive
  Zero-shot Semantic Segmentation
CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation
Jialei Chen
Daisuke Deguchi
Chenkai Zhang
Xu Zheng
Hiroshi Murase
VLM
17
9
0
03 Oct 2023
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense
  Prediction
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
Size Wu
Wenwei Zhang
Lumin Xu
Sheng Jin
Xiangtai Li
Wentao Liu
Chen Change Loy
CLIP
VLM
24
69
0
02 Oct 2023
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object
  Detection
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection
Shilin Xu
Xiangtai Li
Size Wu
Wenwei Zhang
Yunhai Tong
Chen Change Loy
ObjD
VLM
31
0
0
02 Oct 2023
Previous
1234567
Next