ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01071
  4. Cited By
Extract Free Dense Labels from CLIP

Extract Free Dense Labels from CLIP

2 December 2021
Chong Zhou
Chen Change Loy
Bo Dai
    VLM
    CLIP
ArXivPDFHTML

Papers citing "Extract Free Dense Labels from CLIP"

50 / 343 papers shown
Title
Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs
Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs
Jeongkee Lim
Yusung Kim
41
2
0
05 Aug 2024
Visual Grounding for Object-Level Generalization in Reinforcement
  Learning
Visual Grounding for Object-Level Generalization in Reinforcement Learning
Haobin Jiang
Zongqing Lu
LM&Ro
34
2
0
04 Aug 2024
Bayesian Active Learning for Semantic Segmentation
Bayesian Active Learning for Semantic Segmentation
Sima Didari
Wenjun Hu
Jae Oh Woo
Heng Hao
Hankyu Moon
Seungjai Min
48
0
0
03 Aug 2024
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and
  Flexible Scene Text Retrieval
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval
Gangyan Zeng
Yuan Zhang
Jin Wei
Dongbao Yang
Peng Zhang
Yiwen Gao
Xugong Qin
Yu Zhou
VLM
CLIP
30
0
0
01 Aug 2024
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
Anurag Das
Xinting Hu
Li Jiang
Bernt Schiele
VLM
46
3
0
31 Jul 2024
MarvelOVD: Marrying Object Recognition and Vision-Language Models for
  Robust Open-Vocabulary Object Detection
MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection
Kuo Wang
Lechao Cheng
Weikai Chen
Pingping Zhang
Liang Lin
Fan Zhou
Guanbin Li
VLM
ObjD
36
1
0
31 Jul 2024
MaskInversion: Localized Embeddings via Optimization of Explainability
  Maps
MaskInversion: Localized Embeddings via Optimization of Explainability Maps
Walid Bousselham
Sofian Chaybouti
Christian Rupprecht
Vittorio Ferrari
Hilde Kuehne
77
0
0
29 Jul 2024
Advancing Prompt Learning through an External Layer
Advancing Prompt Learning through an External Layer
Fangming Cui
Xun Yang
Chao Wu
Liang Xiao
Xinmei Tian
VLM
38
1
0
29 Jul 2024
LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume
  Rendering
LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume Rendering
Simon Boeder
Fabian Gigengack
Benjamin Risse
50
7
0
24 Jul 2024
Rethinking Domain Adaptation and Generalization in the Era of CLIP
Rethinking Domain Adaptation and Generalization in the Era of CLIP
Ruoyu Feng
Tao Yu
Xin Jin
Xiaoyuan Yu
Lei Xiao
Zhibo Chen
VLM
34
1
0
21 Jul 2024
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion
  Models
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu
Hao Zhou
Pengfei Xing
Long Zhao
Hao Xu
Junwei Liang
Alex Hauptmann
Ting Liu
Andrew C. Gallagher
DiffM
59
4
0
18 Jul 2024
Learning from the Web: Language Drives Weakly-Supervised Incremental
  Learning for Semantic Segmentation
Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation
Chang Liu
Giulia Rizzoli
Pietro Zanuttigh
Fu Li
Yi Niu
CLL
53
1
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided
  Self-Distillation
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
48
2
0
18 Jul 2024
ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language
  Inference
ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
Mengcheng Lan
Chaofeng Chen
Yiping Ke
Xinjiang Wang
Xue Jiang
Wayne Zhang
VLM
42
24
0
17 Jul 2024
Beyond Mask: Rethinking Guidance Types in Few-shot Segmentation
Beyond Mask: Rethinking Guidance Types in Few-shot Segmentation
Shijie Chang
Youwei Pang
Xiaoqi Zhao
Lihe Zhang
Huchuan Lu
39
1
0
16 Jul 2024
Textual Query-Driven Mask Transformer for Domain Generalized
  Segmentation
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation
Byeonghyun Pak
Byeongju Woo
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
46
3
0
12 Jul 2024
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic
  Segmentation
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation
Tong Shao
Zhuotao Tian
Hang Zhao
Jingyong Su
VLM
36
15
0
11 Jul 2024
Unified Embedding Alignment for Open-Vocabulary Video Instance
  Segmentation
Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation
Hao Fang
Peng Wu
Yawei Li
Xinxin Zhang
Xiankai Lu
VLM
32
6
0
10 Jul 2024
CLIPVQA:Video Quality Assessment via CLIP
CLIPVQA:Video Quality Assessment via CLIP
Fengchuang Xing
Mingjie Li
Yuan-Gen Wang
Guopu Zhu
Xiaochun Cao
CLIP
ViT
40
4
0
06 Jul 2024
SOWA: Adapting Hierarchical Frozen Window Self-Attention to
  Visual-Language Models for Better Anomaly Detection
SOWA: Adapting Hierarchical Frozen Window Self-Attention to Visual-Language Models for Better Anomaly Detection
Zongxiang Hu
Zhaosheng Zhang
VLM
27
1
0
04 Jul 2024
Improving Zero-shot Generalization of Learned Prompts via Unsupervised
  Knowledge Distillation
Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation
Marco Mistretta
Alberto Baldrati
Marco Bertini
Andrew D. Bagdanov
VPVLM
VLM
35
6
0
03 Jul 2024
Label-free Neural Semantic Image Synthesis
Label-free Neural Semantic Image Synthesis
Jiayi Wang
Kevin Laube
Yumeng Li
J. H. Metzen
Shin-I Cheng
Julio Borges
Anna Khoreva
DiffM
33
0
0
01 Jul 2024
GalLoP: Learning Global and Local Prompts for Vision-Language Models
GalLoP: Learning Global and Local Prompts for Vision-Language Models
Marc Lafon
Elias Ramzi
Clément Rambour
Nicolas Audebert
Nicolas Thome
VLM
41
8
0
01 Jul 2024
Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation
Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation
Zihan Gao
Lingling Li
Licheng Jiao
Fang Liu
Xu Liu
Wenping Ma
Yuwei Guo
Shuyuan Yang
34
0
0
01 Jul 2024
CLIP3D-AD: Extending CLIP for 3D Few-Shot Anomaly Detection with
  Multi-View Images Generation
CLIP3D-AD: Extending CLIP for 3D Few-Shot Anomaly Detection with Multi-View Images Generation
Zuo Zuo
Jiahao Dong
Yao Wu
Yanyun Qu
Zongze Wu
39
3
0
27 Jun 2024
High-resolution open-vocabulary object 6D pose estimation
High-resolution open-vocabulary object 6D pose estimation
Jaime Corsetti
Davide Boscaini
Francesco Giuliari
Changjae Oh
Andrea Cavallaro
Fabio Poiesi
32
1
0
24 Jun 2024
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
Thomas Stegmüller
Tim Lebailly
Nikola Dukic
Behzad Bozorgtabar
Tinne Tuytelaars
Jean-Philippe Thiran
VLM
39
1
0
23 Jun 2024
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Jiho Choi
Seonho Lee
Seungho Lee
Minhyun Lee
Hyunjung Shim
OCL
42
0
0
17 Jun 2024
Vision Language Modeling of Content, Distortion and Appearance for Image
  Quality Assessment
Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment
Fei Zhou
Zhicong Huang
Tianhao Gu
Guoping Qiu
CoGe
VLM
69
1
0
14 Jun 2024
UVIS: Unsupervised Video Instance Segmentation
UVIS: Unsupervised Video Instance Segmentation
Shuaiyi Huang
Saksham Suri
Kamal Gupta
Sai Saketh Rambhatla
Ser-Nam Lim
Abhinav Shrivastava
VLM
39
3
0
11 Jun 2024
M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via
  Multimodal Denoising
M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising
Chengjie Wang
Haokun Zhu
Jinlong Peng
Yue Wang
Ran Yi
Yunsheng Wu
Lizhuang Ma
J. J. Zhang
59
4
0
04 Jun 2024
Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise
  Pseudo Labeling
Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling
Jinxing Zhou
Dan Guo
Yiran Zhong
Meng Wang
VLM
61
18
0
03 Jun 2024
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for
  Zero-Shot Semantic Segmentation
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Yunheng Li
Zhongyu Li
Quansheng Zeng
Qibin Hou
Ming-Ming Cheng
VLM
45
8
0
02 Jun 2024
Diagnosing the Compositional Knowledge of Vision Language Models from a
  Game-Theoretic View
Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Jin Wang
Shichao Dong
Yapeng Zhu
Kelu Yao
Weidong Zhao
Chao Li
Ping Luo
CoGe
LRM
48
2
0
27 May 2024
Open-Vocabulary SAM3D: Understand Any 3D Scene
Open-Vocabulary SAM3D: Understand Any 3D Scene
Hanchen Tai
Qingdong He
Jiangning Zhang
Yijie Qian
Zhenyu Zhang
Xiaobin Hu
Yabiao Wang
Yong Liu
VLM
54
0
0
24 May 2024
TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing
TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing
Teng Xu
Jiamin Chen
Peng Chen
Youjia Zhang
Junqing Yu
Wei Yang
3DGS
DiffM
19
5
0
23 May 2024
Tuning-free Universally-Supervised Semantic Segmentation
Tuning-free Universally-Supervised Semantic Segmentation
Xiaobo Yang
Xiaojin Gong
VLM
50
1
0
23 May 2024
Unsupervised Image Prior via Prompt Learning and CLIP Semantic Guidance
  for Low-Light Image Enhancement
Unsupervised Image Prior via Prompt Learning and CLIP Semantic Guidance for Low-Light Image Enhancement
Igor Morawski
Kai He
Shusil Dangi
Winston H. Hsu
VLM
54
2
0
19 May 2024
CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual
  Question Answering
CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual Question Answering
Yuanyuan Jiang
Jianqin Yin
45
1
0
13 May 2024
Pseudo-Prompt Generating in Pre-trained Vision-Language Models for
  Multi-Label Medical Image Classification
Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification
Yaoqin Ye
Junjie Zhang
Hongwei Shi
MedIm
VLM
49
0
0
10 May 2024
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies
Lingdong Kong
You-Chen Liu
Lai Xing Ng
Benoit R. Cottereau
Wei Tsang Ooi
VLM
34
14
0
08 May 2024
A Survey on Occupancy Perception for Autonomous Driving: The Information
  Fusion Perspective
A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective
Huaiyuan Xu
Junliang Chen
Shiyu Meng
Yi Wang
Lap-Pui Chau
3DPC
41
16
0
08 May 2024
Dual-Image Enhanced CLIP for Zero-Shot Anomaly Detection
Dual-Image Enhanced CLIP for Zero-Shot Anomaly Detection
Zhaoxiang Zhang
Hanqiu Deng
Jinan Bao
Xingyu Li
VLM
36
1
0
08 May 2024
Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via
  Editable Gaussian Splatting
Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting
O. Shorinwa
Johnathan Tucker
Aliyah Smith
Aiden Swann
Timothy Chen
Roya Firoozi
Monroe Kennedy
Mac Schwager
29
22
0
07 May 2024
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised
  Video Object Segmentation
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation
Gensheng Pei
Yazhou Yao
Jianbo Jiao
Wenguan Wang
Liqiang Nie
Jinhui Tang
VOS
32
1
0
21 Apr 2024
What does CLIP know about peeling a banana?
What does CLIP know about peeling a banana?
Claudia Cuttano
Gabriele Rosi
Gabriele Trivigno
Giuseppe Averta
29
2
0
18 Apr 2024
The devil is in the object boundary: towards annotation-free instance
  segmentation using Foundation Models
The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models
Cheng Shi
Sibei Yang
VLM
40
3
0
18 Apr 2024
Single-temporal Supervised Remote Change Detection for Domain
  Generalization
Single-temporal Supervised Remote Change Detection for Domain Generalization
Qiangang Du
Jinlong Peng
Xu Chen
Qingdong He
Liren He
Qiang Nie
Wenbing Zhu
Mingmin Chi
Yabiao Wang
Chengjie Wang
35
1
0
17 Apr 2024
Do LLMs Understand Visual Anomalies? Uncovering LLM's Capabilities in Zero-shot Anomaly Detection
Do LLMs Understand Visual Anomalies? Uncovering LLM's Capabilities in Zero-shot Anomaly Detection
Jiaqi Zhu
Shaofeng Cai
Fang Deng
Junran Wu
Junran Wu
55
15
0
15 Apr 2024
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic
  Segmentation
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation
Sina Hajimiri
Ismail Ben Ayed
Jose Dolz
VLM
41
22
0
12 Apr 2024
Previous
1234567
Next