ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.03588
  4. Cited By
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation

ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation

7 December 2022
Ziqi Zhou
Bowen Zhang
Yinjie Lei
Lingqiao Liu
Yifan Liu
    VLM
ArXivPDFHTML

Papers citing "ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation"

50 / 119 papers shown
Title
Utilizing Grounded SAM for self-supervised frugal camouflaged human
  detection
Utilizing Grounded SAM for self-supervised frugal camouflaged human detection
Matthias Pijarowski
Alexander Wolpert
Martin Heckmann
Michael Teutsch
45
1
0
09 Jun 2024
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for
  Zero-Shot Semantic Segmentation
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Yunheng Li
Zhongyu Li
Quansheng Zeng
Qibin Hou
Ming-Ming Cheng
VLM
48
8
0
02 Jun 2024
Learning Robust Correlation with Foundation Model for Weakly-Supervised
  Few-Shot Segmentation
Learning Robust Correlation with Foundation Model for Weakly-Supervised Few-Shot Segmentation
Xinyang Huang
Chuanglu Zhu
Kebin Liu
Ruiying Ren
Shengjie Liu
43
2
0
30 May 2024
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Dominic Maggio
Yun Chang
Nathan Hughes
Matthew Trang
Dan Griffith
Carlyn Dougherty
Eric Cristofalo
Lukas Schmid
Luca Carlone
3DV
38
33
0
21 Apr 2024
Exploring Interactive Semantic Alignment for Efficient HOI Detection
  with Vision-language Model
Exploring Interactive Semantic Alignment for Efficient HOI Detection with Vision-language Model
Jihao Dong
Renjie Pan
Hua Yang
VLM
61
0
0
19 Apr 2024
The Devil is in the Few Shots: Iterative Visual Knowledge Completion for
  Few-shot Learning
The Devil is in the Few Shots: Iterative Visual Knowledge Completion for Few-shot Learning
Yaohui Li
Qifeng Zhou
Haoxing Chen
Jianbing Zhang
Xinyu Dai
Hao Zhou
VLM
53
0
0
15 Apr 2024
Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large
  Multi-Modal Models
Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models
David Kurzendörfer
Otniel-Bogdan Mercea
A. Sophia Koepke
Zeynep Akata
VLM
CLIP
33
2
0
09 Apr 2024
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic
  Segmentation
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
Jiannan Ge
Lingxi Xie
Hongtao Xie
Pandeng Li
Xiaopeng Zhang
Yongdong Zhang
Qi Tian
VLM
26
3
0
08 Apr 2024
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly
  Detection
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection
Xiaofan Li
Zhizhong Zhang
Xin Tan
Chengwei Chen
Yanyun Qu
Yuan Xie
Lizhuang Ma
VLM
58
36
0
08 Apr 2024
Segment Any 3D Object with Language
Segment Any 3D Object with Language
Seungjun Lee
Yuyang Zhao
Gim Hee Lee
44
1
0
02 Apr 2024
Transfer CLIP for Generalizable Image Denoising
Transfer CLIP for Generalizable Image Denoising
Junting Cheng
Dong Liang
Shan Tan
VLM
40
12
0
22 Mar 2024
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic
  Segmentation
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation
Kwanyoung Kim
Y. Oh
Jong Chul Ye
VLM
50
7
0
21 Mar 2024
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Wenqi Zhu
Jiale Cao
Jin Xie
Shuangming Yang
Yanwei Pang
VLM
CLIP
39
2
0
19 Mar 2024
Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples
Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples
Ziqi Zhou
Minghui Li
Wei Liu
Shengshan Hu
Yechao Zhang
Wei Wan
Lulu Xue
Leo Yu Zhang
Dezhong Yao
Hai Jin
SILM
AAML
50
9
0
16 Mar 2024
PosSAM: Panoptic Open-vocabulary Segment Anything
PosSAM: Panoptic Open-vocabulary Segment Anything
VS Vibashan
Shubhankar Borse
Hyojin Park
Debasmit Das
Vishal M. Patel
Munawar Hayat
Fatih Porikli
VLM
MLLM
43
6
0
14 Mar 2024
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Zicheng Zhang
Tong Zhang
Yi Zhu
Jian-zhuo Liu
Xiaodan Liang
QiXiang Ye
Wei Ke
VLM
49
2
0
13 Mar 2024
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object
  Detection
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection
Hanning Chen
Wenjun Huang
Yang Ni
Sanggeon Yun
Fei Wen
Hugo Latapie
Mohsen Imani
ObjD
MLLM
VLM
37
16
0
12 Mar 2024
QUASAR: QUality and Aesthetics Scoring with Advanced Representations
QUASAR: QUality and Aesthetics Scoring with Advanced Representations
Sergey Kastryulin
Denis Prokopenko
Artem Babenko
Dmitry V. Dylov
33
0
0
11 Mar 2024
Boosting Image Restoration via Priors from Pre-trained Models
Boosting Image Restoration via Priors from Pre-trained Models
Xiaogang Xu
Shu Kong
Tao Hu
Zhe Liu
Hujun Bao
VLM
DiffM
41
2
0
11 Mar 2024
One Prompt Word is Enough to Boost Adversarial Robustness for
  Pre-trained Vision-Language Models
One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models
Lin Li
Haoyan Guan
Jianing Qiu
Michael W. Spratling
AAML
VLM
VPVLM
31
21
0
04 Mar 2024
Generalizable Semantic Vision Query Generation for Zero-shot Panoptic
  and Semantic Segmentation
Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation
Jialei Chen
Daisuke Deguchi
Chenkai Zhang
Hiroshi Murase
VLM
45
1
0
21 Feb 2024
CLIP Can Understand Depth
CLIP Can Understand Depth
Dunam Kim
Seokju Lee
VLM
MDE
48
2
0
05 Feb 2024
CLIP-Driven Semantic Discovery Network for Visible-Infrared Person
  Re-Identification
CLIP-Driven Semantic Discovery Network for Visible-Infrared Person Re-Identification
Xiaoyan Yu
Neng Dong
Liehuang Zhu
Hao Peng
Dapeng Tao
33
7
0
11 Jan 2024
Text-Driven Traffic Anomaly Detection with Temporal High-Frequency
  Modeling in Driving Videos
Text-Driven Traffic Anomaly Detection with Temporal High-Frequency Modeling in Driving Videos
Rongqin Liang
Yuanman Li
Jiantao Zhou
Xia Li
38
6
0
07 Jan 2024
3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language
  Distillation
3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation
Zihao Xiao
Longlong Jing
Shangxuan Wu
Alex Zihao Zhu
Jingwei Ji
...
Thomas Funkhouser
Weicheng Kuo
A. Angelova
Yin Zhou
Shiwei Sheng
VLM
33
5
0
04 Jan 2024
Open Vocabulary Semantic Scene Sketch Understanding
Open Vocabulary Semantic Scene Sketch Understanding
Ahmed Bourouis
Judith E. Fan
Yulia Gryaditskaya
VLM
3DV
23
1
0
18 Dec 2023
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
Feng Wang
Jieru Mei
Alan L. Yuille
VLM
29
55
0
04 Dec 2023
Raising the Bar of AI-generated Image Detection with CLIP
Raising the Bar of AI-generated Image Detection with CLIP
D. Cozzolino
Giovanni Poggi
Riccardo Corvi
Matthias Nießner
L. Verdoliva
VLM
29
74
0
30 Nov 2023
One-Shot Open Affordance Learning with Foundation Models
One-Shot Open Affordance Learning with Foundation Models
Gen Li
Deqing Sun
Laura Sevilla-Lara
Varun Jampani
VLM
73
22
0
29 Nov 2023
SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language
  Guidance
SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
Lukas Hoyer
D. Tan
Muhammad Ferjad Naeem
Luc Van Gool
F. Tombari
VLM
MLLM
36
16
0
27 Nov 2023
HGCLIP: Exploring Vision-Language Models with Graph Representations for
  Hierarchical Understanding
HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding
Peng Xia
Xingtong Yu
Ming Hu
Lie Ju
Zhiyong Wang
Peibo Duan
Zongyuan Ge
VLM
57
9
0
23 Nov 2023
Open-Vocabulary Video Anomaly Detection
Open-Vocabulary Video Anomaly Detection
Peng Wu
Xuerong Zhou
Guansong Pang
Yujia Sun
Jing Liu
Peng Wang
Yanning Zhang
VLM
32
22
0
13 Nov 2023
Towards Calibrated Robust Fine-Tuning of Vision-Language Models
Towards Calibrated Robust Fine-Tuning of Vision-Language Models
Changdae Oh
Hyesu Lim
Mijoo Kim
Dongyoon Han
Junhyeok Park
Euiseog Jeong
Alexander G. Hauptmann
Zhi-Qi Cheng
Kyungwoo Song
VLM
29
13
0
03 Nov 2023
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive
  Survey and Evaluation
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation
Yinjie Lei
Zixuan Wang
Feng Chen
Guoqing Wang
Peng Wang
Yang Yang
34
8
0
24 Oct 2023
A Survey on Continual Semantic Segmentation: Theory, Challenge, Method
  and Application
A Survey on Continual Semantic Segmentation: Theory, Challenge, Method and Application
Bo Yuan
Danpei Zhao
3DV
CLL
35
10
0
22 Oct 2023
Towards Training-free Open-world Segmentation via Image Prompt
  Foundation Models
Towards Training-free Open-world Segmentation via Image Prompt Foundation Models
Lv Tang
Peng-Tao Jiang
Haoke Xiao
Bo Li
VLM
15
7
0
17 Oct 2023
CLIP Is Also a Good Teacher: A New Learning Framework for Inductive
  Zero-shot Semantic Segmentation
CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation
Jialei Chen
Daisuke Deguchi
Chenkai Zhang
Xu Zheng
Hiroshi Murase
VLM
17
9
0
03 Oct 2023
Towards Realistic Zero-Shot Classification via Self Structural Semantic
  Alignment
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Shengxiang Zhang
Muzammal Naseer
Guangyi Chen
Zhiqiang Shen
Salman Khan
Anton van den Hengel
F. Khan
VLM
60
5
0
24 Aug 2023
PartSeg: Few-shot Part Segmentation via Part-aware Prompt Learning
PartSeg: Few-shot Part Segmentation via Part-aware Prompt Learning
M. Han
Heliang Zheng
Chaoyue Wang
Yong Luo
Han Hu
Jing Zhang
Yonggang Wen
VLM
32
3
0
24 Aug 2023
LCCo: Lending CLIP to Co-Segmentation
LCCo: Lending CLIP to Co-Segmentation
Xin Duan
Yan Yang
Liyuan Pan
Xiabi Liu
VLM
42
1
0
22 Aug 2023
VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video
  Anomaly Detection
VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection
Peng Wu
Xu Zhou
Guansong Pang
Lingru Zhou
Qingsen Yan
Peng Wang
Yanning Zhang
CLIP
VLM
21
67
0
22 Aug 2023
Exploring Transfer Learning in Medical Image Segmentation using
  Vision-Language Models
Exploring Transfer Learning in Medical Image Segmentation using Vision-Language Models
K. Poudel
Manish Dhakal
Prasiddha Bhandari
Rabin Adhikari
Safal Thapaliya
Bishesh Khanal
VLM
30
17
0
15 Aug 2023
SGDiff: A Style Guided Diffusion Model for Fashion Synthesis
SGDiff: A Style Guided Diffusion Model for Fashion Synthesis
Zheng Sun
Yanghong Zhou
Honghong He
P. Y. Mok
DiffM
32
26
0
15 Aug 2023
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen
  Convolutional CLIP
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Qihang Yu
Ju He
XueQing Deng
Xiaohui Shen
Liang-Chieh Chen
VLM
CLIP
39
136
0
04 Aug 2023
LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network
LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network
Hao Yang
Liyuan Pan
Yan Yang
Richard Hartley
Miaomiao Liu
VLM
42
9
0
19 Jul 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present,
  and Future
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Chaoyang Zhu
Long Chen
ObjD
VLM
31
32
0
18 Jul 2023
Leveraging Vision-Language Foundation Models for Fine-Grained Downstream
  Tasks
Leveraging Vision-Language Foundation Models for Fine-Grained Downstream Tasks
Denis Coquenet
Clément Rambour
Emanuele Dalsasso
Nicolas Thome
MLLM
CLIP
VLM
37
1
0
13 Jul 2023
A Critical Look at the Current Usage of Foundation Model for Dense
  Recognition Task
A Critical Look at the Current Usage of Foundation Model for Dense Recognition Task
Shiqi Yang
Atsushi Hashimoto
Yoshitaka Ushiku
DiffM
VLM
43
1
0
06 Jul 2023
Prompting classes: Exploring the Power of Prompt Class Learning in
  Weakly Supervised Semantic Segmentation
Prompting classes: Exploring the Power of Prompt Class Learning in Weakly Supervised Semantic Segmentation
Balamurali Murugesan
Rukhshanda Hussain
Rajarshi Bhattacharya
Ismail Ben Ayed
Jose Dolz
VLM
VPVLM
26
4
0
30 Jun 2023
SegViTv2: Exploring Efficient and Continual Semantic Segmentation with
  Plain Vision Transformers
SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers
Bowen Zhang
Liyang Liu
Minh Hieu Phan
Zhi Tian
Chunhua Shen
Yifan Liu
ViT
26
28
0
09 Jun 2023
Previous
123
Next