ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01071
  4. Cited By
Extract Free Dense Labels from CLIP

Extract Free Dense Labels from CLIP

2 December 2021
Chong Zhou
Chen Change Loy
Bo Dai
    VLM
    CLIP
ArXivPDFHTML

Papers citing "Extract Free Dense Labels from CLIP"

50 / 343 papers shown
Title
Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
S. Park
Subeen Lee
Hyun Seok Seong
Jaejoon Yoo
Jae-Pil Heo
39
1
0
03 Jan 2025
Multi-Modality Driven LoRA for Adverse Condition Depth Estimation
Multi-Modality Driven LoRA for Adverse Condition Depth Estimation
Guanglei Yang
Rui Tian
Yongqiang Zhang
Zhun Zhong
Yongqiang Li
Wangmeng Zuo
35
0
0
31 Dec 2024
Improving Generated and Retrieved Knowledge Combination Through
  Zero-shot Generation
Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation
Xinkai Du
Quanjie Han
Chao Lv
Yi Liu
Yalin Sun
Hao Shu
Hongbo Shan
Maosong Sun
RALM
37
0
0
25 Dec 2024
MVREC: A General Few-shot Defect Classification Model Using Multi-View Region-Context
MVREC: A General Few-shot Defect Classification Model Using Multi-View Region-Context
Shuai Lyu
Fangjian Liao
Zeqi Ma
Rongchen Zhang
Dongmei Mo
W. Wong
76
1
0
22 Dec 2024
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level
  Vision-Language Alignment
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Cijo Jose
Théo Moutakanni
Dahyun Kang
Federico Baldassarre
Timothée Darcet
...
Maxime Oquab
Oriane Siméoni
Huy V. Vo
Patrick Labatut
Piotr Bojanowski
CLIP
VLM
100
6
0
20 Dec 2024
Leveraging Content and Context Cues for Low-Light Image Enhancement
Leveraging Content and Context Cues for Low-Light Image Enhancement
Igor Morawski
Kai He
Shusil Dangi
Winston H. Hsu
93
0
0
10 Dec 2024
Composed Image Retrieval for Training-Free Domain Conversion
Composed Image Retrieval for Training-Free Domain Conversion
Nikos Efthymiadis
Bill Psomas
Zakaria Laskar
Konstantinos Karantzalos
Yannis Avrithis
Ondřej Chum
Giorgos Tolias
73
0
0
04 Dec 2024
SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from
  Sparse Multi-View RGB Images
SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from Sparse Multi-View RGB Images
Junqiu Yu
Xinlin Ren
Yongchong Gu
Haitao Lin
Tianyu Wang
Bo Li
Hang Xu
Yu-Gang Jiang
Xiangyang Xue
Yanwei Fu
3DGS
81
0
0
03 Dec 2024
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
Sanghwan Kim
Rui Xiao
Mariana-Iuliana Georgescu
Stephan Alaniz
Zeynep Akata
VLM
76
2
0
02 Dec 2024
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language
  for Open-Vocabulary Segmentation
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation
Luca Barsellotti
Lorenzo Bianchi
Nicola Messina
F. Carrara
Marcella Cornia
Lorenzo Baraldi
Fabrizio Falchi
Rita Cucchiara
VLM
72
2
0
28 Nov 2024
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
Chanyoung Kim
Dayun Ju
Woojung Han
Ming-Hsuan Yang
Seong Jae Hwang
VLM
VOS
79
0
0
26 Nov 2024
Open Vocabulary Monocular 3D Object Detection
Open Vocabulary Monocular 3D Object Detection
Jin Yao
Hao Gu
Xuweiyi Chen
Jiayun Wang
Zezhou Cheng
ObjD
VLM
73
3
0
25 Nov 2024
A Study on Unsupervised Domain Adaptation for Semantic Segmentation in
  the Era of Vision-Language Models
A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models
Manuel Schwonberg
Claus Werner
Hanno Gottschalk
Carsten Meyer
VLM
90
0
0
25 Nov 2024
ResCLIP: Residual Attention for Training-free Dense Vision-language
  Inference
ResCLIP: Residual Attention for Training-free Dense Vision-language Inference
Yuhang Yang
Jinhong Deng
Wen Li
Lixin Duan
VLM
81
0
0
24 Nov 2024
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
Sule Bai
Yong-Jin Liu
Yifei Han
Haoji Zhang
Yansong Tang
VLM
79
3
0
24 Nov 2024
freePruner: A Training-free Approach for Large Multimodal Model
  Acceleration
freePruner: A Training-free Approach for Large Multimodal Model Acceleration
Bingxin Xu
Yuzhang Shang
Yunhao Ge
Qian Lou
Yan Yan
97
3
0
23 Nov 2024
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
M. Arda Aydın
Efe Mert Çırpar
Elvin Abdinli
Gözde B. Ünal
Y. Sahin
VLM
71
0
0
18 Nov 2024
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf
  Foundation Models for Open-Vocabulary Semantic Segmentation
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation
Dengke Zhang
Fagui Liu
Quan Tang
VLM
47
1
0
15 Nov 2024
Harnessing Vision Foundation Models for High-Performance, Training-Free
  Open Vocabulary Segmentation
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation
Yuheng Shi
Minjing Dong
Chang Xu
VLM
43
1
0
14 Nov 2024
Learning Few-Shot Object Placement with Intra-Category Transfer
Learning Few-Shot Object Placement with Intra-Category Transfer
Adrian Rofer
Russell Buchanan
Max Argus
S. Vijayakumar
Abhinav Valada
43
0
0
05 Nov 2024
Language-guided Hierarchical Fine-grained Image Forgery Detection and
  Localization
Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization
Xiao Guo
Xiaohong Liu
I. Masi
Xiaoming Liu
95
9
0
31 Oct 2024
Multilingual Vision-Language Pre-training for the Remote Sensing Domain
Multilingual Vision-Language Pre-training for the Remote Sensing Domain
João Daniel Silva
João Magalhães
D. Tuia
Bruno Martins
CLIP
VLM
42
1
0
30 Oct 2024
Domain Adaptation with a Single Vision-Language Embedding
Domain Adaptation with a Single Vision-Language Embedding
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Patrick Pérez
Raoul de Charette
VLM
28
0
0
28 Oct 2024
AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot
  Nuclei Detection via Visual-Language Pre-trained Models
AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models
Yongjian Wu
Yang Zhou
Jiya Saiyin
Bingzheng Wei
M. Lai
Jianzhong Shou
Yan Xu
VLM
MedIm
27
1
0
22 Oct 2024
Scene Graph Generation with Role-Playing Large Language Models
Scene Graph Generation with Role-Playing Large Language Models
Guikun Chen
Jin Li
Wenguan Wang
VLM
48
5
0
20 Oct 2024
Overcoming Domain Limitations in Open-vocabulary Segmentation
Overcoming Domain Limitations in Open-vocabulary Segmentation
Dongjun Hwang
Seong Joon Oh
Junsuk Choe
SSeg
OOD
58
0
0
15 Oct 2024
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation
Jiayi Lin
Jiabo Huang
Jian Hu
S. Gong
DiffM
VLM
36
0
0
15 Oct 2024
ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization
ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization
Jiawei Li
Fanrui Zhang
Jiaying Zhu
Esther Sun
Qiang Zhang
Zheng-jun Zha
MLLM
55
8
0
14 Oct 2024
Emerging Pixel Grounding in Large Multimodal Models Without Grounding
  Supervision
Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
Shengcao Cao
Liang-Yan Gui
Yu-Xiong Wang
44
3
0
10 Oct 2024
Brain Mapping with Dense Features: Grounding Cortical Semantic
  Selectivity in Natural Images With Vision Transformers
Brain Mapping with Dense Features: Grounding Cortical Semantic Selectivity in Natural Images With Vision Transformers
Andrew F. Luo
Jacob Yeung
Rushikesh Zawar
Shaurya Dewan
Margaret M. Henderson
Leila Wehbe
Michael J. Tarr
34
3
0
07 Oct 2024
In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for
  3D Scene Understanding
In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding
Shenghao Li
34
1
0
06 Oct 2024
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for
  Remote Sensing Images
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images
Kaiyu Li
Ruixun Liu
Xiangyong Cao
Deyu Meng
Zhi Wang
Deyu Meng
Zhi Wang
36
3
0
02 Oct 2024
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Heeseong Shin
Chaehyun Kim
Sunghwan Hong
Seokju Cho
Anurag Arnab
Paul Hongsuck Seo
Seungryong Kim
VLM
34
1
0
30 Sep 2024
MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image Segmentation
MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image Segmentation
Taha Koleilat
Hojat Asgariandehkordi
H. Rivaz
Yiming Xiao
MedIm
VLM
43
6
0
28 Sep 2024
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
Ayca Takmaz
Alexandros Delitzas
R. Sumner
Francis Engelmann
Johanna Wald
Federico Tombari
78
11
0
27 Sep 2024
Exploring Fine-grained Retail Product Discrimination with Zero-shot
  Object Classification Using Vision-Language Models
Exploring Fine-grained Retail Product Discrimination with Zero-shot Object Classification Using Vision-Language Models
Anil Osman Tur
Alessandro Conti
Cigdem Beyan
Davide Boscaini
Roberto Larcher
S. Messelodi
Fabio Poiesi
Elisa Ricci
VLM
39
0
0
23 Sep 2024
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation
Qilong Zhangli
Di Liu
Abhishek Aich
Dimitris Metaxas
S. Schulter
36
0
0
15 Sep 2024
Generalization Boosted Adapter for Open-Vocabulary Segmentation
Generalization Boosted Adapter for Open-Vocabulary Segmentation
Wenhao Xu
Changwei Wang
Xuxiang Feng
Rongtao Xu
Longzhao Huang
Zherui Zhang
Li Guo
Shibiao Xu
VLM
34
2
0
13 Sep 2024
High-Performance Few-Shot Segmentation with Foundation Models: An
  Empirical Study
High-Performance Few-Shot Segmentation with Foundation Models: An Empirical Study
Shijie Chang
Lihe Zhang
Huchuan Lu
VLM
36
0
0
10 Sep 2024
Revisiting Prompt Pretraining of Vision-Language Models
Revisiting Prompt Pretraining of Vision-Language Models
Zhenyuan Chen
Lingfeng Yang
Shuo Chen
Zhaowei Chen
Jiajun Liang
Xiang Li
MLLM
VPVLM
VLM
43
1
0
10 Sep 2024
iSeg: An Iterative Refinement-based Framework for Training-free
  Segmentation
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation
Lin Sun
Jiale Cao
J. Xie
F. Khan
Yanwei Pang
DiffM
43
1
0
05 Sep 2024
GraspSplats: Efficient Manipulation with 3D Feature Splatting
GraspSplats: Efficient Manipulation with 3D Feature Splatting
Mazeyu Ji
Ri-Zhao Qiu
Xueyan Zou
Xiaolong Wang
3DGS
39
18
0
03 Sep 2024
Image-to-Lidar Relational Distillation for Autonomous Driving Data
Image-to-Lidar Relational Distillation for Autonomous Driving Data
Anas Mahmoud
Ali Harakeh
Steven Waslander
26
0
0
01 Sep 2024
FADE: Few-shot/zero-shot Anomaly Detection Engine using Large
  Vision-Language Model
FADE: Few-shot/zero-shot Anomaly Detection Engine using Large Vision-Language Model
Yuanwei Li
Elizaveta Ivanova
Martins Bruveris
VLM
21
1
0
31 Aug 2024
Leveraging Hallucinations to Reduce Manual Prompt Dependency in
  Promptable Segmentation
Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation
Jian Hu
Jiayi Lin
Junchi Yan
Shaogang Gong
VLM
44
7
0
27 Aug 2024
MROVSeg: Breaking the Resolution Curse of Vision-Language Models in
  Open-Vocabulary Semantic Segmentation
MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation
Yuanbing Zhu
Bingke Zhu
Zhen Chen
Huan Xu
Ming Tang
Jinqiao Wang
VLM
34
0
0
27 Aug 2024
AnoPLe: Few-Shot Anomaly Detection via Bi-directional Prompt Learning
  with Only Normal Samples
AnoPLe: Few-Shot Anomaly Detection via Bi-directional Prompt Learning with Only Normal Samples
Yujin Lee
Seoyoon Jang
Hyunsoo Yoon
32
0
0
24 Aug 2024
Image Segmentation in Foundation Model Era: A Survey
Image Segmentation in Foundation Model Era: A Survey
Tianfei Zhou
Fei Zhang
Boyu Chang
Wenguan Wang
Ye Yuan
E. Konukoglu
Daniel Cremers
VLM
42
4
0
23 Aug 2024
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation
Jingyun Wang
Guoliang Kang
VLM
SSL
47
7
0
13 Aug 2024
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic
  Segmentation
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
Dahyun Kang
Minsu Cho
ObjD
VLM
40
9
0
09 Aug 2024
Previous
1234567
Next