ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01071
  4. Cited By
Extract Free Dense Labels from CLIP

Extract Free Dense Labels from CLIP

2 December 2021
Chong Zhou
Chen Change Loy
Bo Dai
    VLM
    CLIP
ArXivPDFHTML

Papers citing "Extract Free Dense Labels from CLIP"

50 / 343 papers shown
Title
Learning Mask-aware CLIP Representations for Zero-Shot Segmentation
Learning Mask-aware CLIP Representations for Zero-Shot Segmentation
Siyu Jiao
Yunchao Wei
Yaowei Wang
Yao-Min Zhao
Humphrey Shi
VLM
33
47
0
30 Sep 2023
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and
  Planning
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning
Yuanyi Zhong
Alihusein Kuwajerwala
Sacha Morin
Krishna Murthy Jatavallabhula
Bipasha Sen
...
Celso Miguel de Melo
Joshua B. Tenenbaum
Antonio Torralba
Florian Shkurti
Liam Paull
LM&Ro
36
166
0
28 Sep 2023
CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic
  Segmentation For-Free
CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free
Monika Wysoczañska
Michael Ramamonjisoa
Tomasz Trzciñski
Oriane Siméoni
3DV
VLM
32
20
0
25 Sep 2023
Rewrite Caption Semantics: Bridging Semantic Gaps for
  Language-Supervised Semantic Segmentation
Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation
Yun Xing
Jian Kang
Aoran Xiao
Jiahao Nie
Ling Shao
Shijian Lu
VLM
38
12
0
24 Sep 2023
A Sentence Speaks a Thousand Images: Domain Generalization through
  Distilling CLIP with Language Guidance
A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance
Zeyi Huang
Andy Zhou
Zijian Lin
Mu Cai
Haohan Wang
Yong Jae Lee
VLM
OOD
32
28
0
21 Sep 2023
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight
  Inheritance
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance
Kan Wu
Houwen Peng
Zhenghong Zhou
Bin Xiao
Mengchen Liu
...
Xi
Xi Chen
Xinggang Wang
Hongyang Chao
Han Hu
VLM
OODD
29
53
0
21 Sep 2023
DePT: Decoupled Prompt Tuning
DePT: Decoupled Prompt Tuning
Ji Zhang
Shihan Wu
Lianli Gao
Hengtao Shen
Jingkuan Song
VLM
32
27
0
14 Sep 2023
Distribution-Aware Prompt Tuning for Vision-Language Models
Distribution-Aware Prompt Tuning for Vision-Language Models
Eulrang Cho
Jooyeon Kim
Hyunwoo J. Kim
VPVLM
VLM
32
20
0
06 Sep 2023
Diffusion Model is Secretly a Training-free Open Vocabulary Semantic
  Segmenter
Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter
Jinglong Wang
Xiawei Li
Jing Zhang
Qingyuan Xu
Qin Zhou
Qian Yu
Lu Sheng
Dong Xu
VLM
DiffM
26
45
0
06 Sep 2023
Exploring Limits of Diffusion-Synthetic Training with Weakly Supervised
  Semantic Segmentation
Exploring Limits of Diffusion-Synthetic Training with Weakly Supervised Semantic Segmentation
Ryota Yoshihashi
Yuya Otsuka
Kenji Doi
Tomohiro Tanaka
Hirokatsu Kataoka
31
1
0
04 Sep 2023
EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment
EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment
Cheng Shi
Sibei Yang
VLM
ObjD
38
38
0
03 Sep 2023
Contrastive Feature Masking Open-Vocabulary Vision Transformer
Contrastive Feature Masking Open-Vocabulary Vision Transformer
Dahun Kim
A. Angelova
Weicheng Kuo
ObjD
VLM
23
27
0
02 Sep 2023
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
Zhening Huang
Xiaoyang Wu
Xi Chen
Hengshuang Zhao
Lei Zhu
Joan Lasenby
ISeg
3DPC
VLM
55
46
0
01 Sep 2023
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute
  Decomposition-Aggregation
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation
Chaofan Ma
Yu-Hao Yang
Chen Ju
Fei Zhang
Ya-Qin Zhang
Yanfeng Wang
VLM
48
17
0
31 Aug 2023
Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot
  Anomaly Localization
Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot Anomaly Localization
Hanqiu Deng
Zhaoxiang Zhang
Jinan Bao
Xingyu Li
VLM
30
4
0
30 Aug 2023
Shatter and Gather: Learning Referring Image Segmentation with Text
  Supervision
Shatter and Gather: Learning Referring Image Segmentation with Text Supervision
Dongwon Kim
Nam-Won Kim
Cuiling Lan
Suha Kwak
VLM
42
19
0
29 Aug 2023
Referring Image Segmentation Using Text Supervision
Referring Image Segmentation Using Text Supervision
Fang Liu
Yuhao Liu
Yuqiu Kong
Ke Xu
L. Zhang
Baocai Yin
Gerhard Hancke
Rynson W. H. Lau
32
26
0
28 Aug 2023
Towards Realistic Zero-Shot Classification via Self Structural Semantic
  Alignment
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Shengxiang Zhang
Muzammal Naseer
Guangyi Chen
Zhiqiang Shen
Salman Khan
Anton van den Hengel
F. Khan
VLM
60
5
0
24 Aug 2023
Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using
  Stable Diffusion
Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion
Junjiao Tian
Lavisha Aggarwal
Andrea Colaco
Z. Kira
Mar González-Franco
DiffM
33
75
0
23 Aug 2023
LCCo: Lending CLIP to Co-Segmentation
LCCo: Lending CLIP to Co-Segmentation
Xin Duan
Yan Yang
Liyuan Pan
Xiabi Liu
VLM
39
1
0
22 Aug 2023
Open-vocabulary Video Question Answering: A New Benchmark for Evaluating
  the Generalizability of Video Question Answering Models
Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models
Dohwan Ko
Ji Soo Lee
M. Choi
Jaewon Chu
Jihwan Park
Hyunwoo J. Kim
22
5
0
18 Aug 2023
SGDiff: A Style Guided Diffusion Model for Fashion Synthesis
SGDiff: A Style Guided Diffusion Model for Fashion Synthesis
Zheng Sun
Yanghong Zhou
Honghong He
P. Y. Mok
DiffM
32
26
0
15 Aug 2023
Visual and Textual Prior Guided Mask Assemble for Few-Shot Segmentation
  and Beyond
Visual and Textual Prior Guided Mask Assemble for Few-Shot Segmentation and Beyond
Chen Shuai
Meng Fanman
Runtong Zhang
Heqian Qiu
Hongliang Li
Wu Qingbo
Xu Linfeng
VLM
30
12
0
15 Aug 2023
SegPrompt: Boosting Open-world Segmentation via Category-level Prompt
  Learning
SegPrompt: Boosting Open-world Segmentation via Category-level Prompt Learning
Muzhi Zhu
Hengtao Li
Hao Chen
Chengxiang Fan
Wei Mao
Chenchen Jing
Yifan Liu
Chunhua Shen
VLM
34
17
0
12 Aug 2023
MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner
  for Open-World Semantic Segmentation
MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation
Kaixin Cai
Pengzhen Ren
Yi Zhu
Hang Xu
Jian-zhuo Liu
Changlin Li
Guangrun Wang
Xiaodan Liang
VLM
29
14
0
09 Aug 2023
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen
  Convolutional CLIP
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Qihang Yu
Ju He
XueQing Deng
Xiaohui Shen
Liang-Chieh Chen
VLM
CLIP
36
136
0
04 Aug 2023
Lowis3D: Language-Driven Open-World Instance-Level 3D Scene
  Understanding
Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding
Runyu Ding
Jihan Yang
Chuhui Xue
Wenqing Zhang
Song Bai
Xiaojuan Qi
3DV
VLM
21
28
0
01 Aug 2023
CLIP Brings Better Features to Visual Aesthetics Learners
CLIP Brings Better Features to Visual Aesthetics Learners
Liwu Xu
Jinjin Xu
Yuzhe Yang
Yi-Jie Huang
Yanchun Xie
Yaqian Li
VLM
35
3
0
28 Jul 2023
Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation
Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation
Bokui (William) Shen
Ge Yang
Alan Yu
J. Wong
L. Kaelbling
Phillip Isola
VLM
29
104
0
27 Jul 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
F. Khan
VLM
38
118
0
25 Jul 2023
See More and Know More: Zero-shot Point Cloud Segmentation via
  Multi-modal Visual Data
See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data
Yuhang Lu
Qingnan Jiang
Runnan Chen
Yuenan Hou
Xinge Zhu
Yuexin Ma
3DPC
29
19
0
20 Jul 2023
Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Anindya Mondal
Sauradip Nag
J. Prada
Xiatian Zhu
Anjan Dutta
23
9
0
20 Jul 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present,
  and Future
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Chaoyang Zhu
Long Chen
ObjD
VLM
31
32
0
18 Jul 2023
Unified Open-Vocabulary Dense Visual Prediction
Unified Open-Vocabulary Dense Visual Prediction
Hengcan Shi
Munawar Hayat
Jianfei Cai
ObjD
VLM
43
19
0
17 Jul 2023
Leveraging Vision-Language Foundation Models for Fine-Grained Downstream
  Tasks
Leveraging Vision-Language Foundation Models for Fine-Grained Downstream Tasks
Denis Coquenet
Clément Rambour
Emanuele Dalsasso
Nicolas Thome
MLLM
CLIP
VLM
37
1
0
13 Jul 2023
A Critical Look at the Current Usage of Foundation Model for Dense
  Recognition Task
A Critical Look at the Current Usage of Foundation Model for Dense Recognition Task
Shiqi Yang
Atsushi Hashimoto
Yoshitaka Ushiku
DiffM
VLM
43
1
0
06 Jul 2023
Towards Open Vocabulary Learning: A Survey
Towards Open Vocabulary Learning: A Survey
Jianzong Wu
Xiangtai Li
Shilin Xu
Haobo Yuan
Henghui Ding
...
Jiangning Zhang
Yu Tong
Xudong Jiang
Guohao Li
Dacheng Tao
ObjD
VLM
34
136
0
28 Jun 2023
OpenMask3D: Open-Vocabulary 3D Instance Segmentation
OpenMask3D: Open-Vocabulary 3D Instance Segmentation
Ayca Takmaz
Elisabetta Fedele
R. Sumner
Marc Pollefeys
F. Tombari
Francis Engelmann
ISeg
VLM
31
163
0
23 Jun 2023
A Universal Semantic-Geometric Representation for Robotic Manipulation
A Universal Semantic-Geometric Representation for Robotic Manipulation
Tong Zhang
Yingdong Hu
Hanchen Cui
Hang Zhao
Yang Gao
70
17
0
18 Jun 2023
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to
  Enhance Visio-Linguistic Compositional Understanding
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding
Le Zhang
Rabiul Awal
Aishwarya Agrawal
CoGe
VLM
31
9
0
15 Jun 2023
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Seoyeon Kim
Minguk Kang
Dongwon Kim
Jaesik Park
Suha Kwak
VLM
27
10
0
14 Jun 2023
EventCLIP: Adapting CLIP for Event-based Object Recognition
EventCLIP: Adapting CLIP for Event-based Object Recognition
Ziyi Wu
Xudong Liu
Igor Gilitschenski
VLM
26
15
0
10 Jun 2023
Towards Label-free Scene Understanding by Vision Foundation Models
Towards Label-free Scene Understanding by Vision Foundation Models
Runnan Chen
You-Chen Liu
Lingdong Kong
Nenglun Chen
Xinge Zhu
Yuexin Ma
Tongliang Liu
Wenping Wang
VLM
31
42
0
06 Jun 2023
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Xiuye Gu
Huayu Chen
Jonathan Huang
Abdullah M. Rashwan
Boxin Wang
...
Golnaz Ghiasi
Weicheng Kuo
Huizhong Chen
Liang-Chieh Chen
David A. Ross
ISeg
28
26
0
02 Jun 2023
LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning
LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning
Atsuyuki Miyai
Qing Yu
Go Irie
Kiyoharu Aizawa
OODD
29
64
0
02 Jun 2023
Unsupervised Multi-view Pedestrian Detection
Unsupervised Multi-view Pedestrian Detection
Mengyin Liu
Chao Zhu
Shiqi Ren
Xu-Cheng Yin
34
6
0
21 May 2023
Segment Any Anomaly without Training via Hybrid Prompt Regularization
Segment Any Anomaly without Training via Hybrid Prompt Regularization
Yunkang Cao
Xiaohao Xu
Chen Sun
Y. Cheng
Zongwei Du
Liang Gao
Nong Sang
VLM
37
70
0
18 May 2023
Region-Aware Pretraining for Open-Vocabulary Object Detection with
  Vision Transformers
Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers
Dahun Kim
A. Angelova
Weicheng Kuo
ObjD
ViT
VLM
27
73
0
11 May 2023
Vision-Language Models in Remote Sensing: Current Progress and Future
  Trends
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
21
71
0
09 May 2023
CLIP-S$^4$: Language-Guided Self-Supervised Semantic Segmentation
CLIP-S4^44: Language-Guided Self-Supervised Semantic Segmentation
Wenbin He
Suphanut Jamonnak
Liangke Gou
Liu Ren
VLM
40
31
0
01 May 2023
Previous
1234567
Next