Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01071
Cited By
Extract Free Dense Labels from CLIP
2 December 2021
Chong Zhou
Chen Change Loy
Bo Dai
VLM
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Extract Free Dense Labels from CLIP"
50 / 343 papers shown
Title
Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs
Kanchana Ranasinghe
Satya Narayan Shukla
Omid Poursaeed
Michael S. Ryoo
Tsung-Yu Lin
LRM
49
23
0
11 Apr 2024
O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation
Muer Tie
Julong Wei
Zhengjun Wang
Ke Wu
Shansuai Yuan
Kaizhao Zhang
Jie Jia
Jieru Zhao
Zhongxue Gan
Wenchao Ding
40
7
0
10 Apr 2024
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation
Luca Barsellotti
Roberto Amoroso
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
VLM
DiffM
44
13
0
09 Apr 2024
Retrieval-Augmented Open-Vocabulary Object Detection
Jooyeon Kim
Eulrang Cho
Sehyung Kim
Hyunwoo J. Kim
VLM
ObjD
45
8
0
08 Apr 2024
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection
Xiaofan Li
Zhizhong Zhang
Xin Tan
Chengwei Chen
Yanyun Qu
Yuan Xie
Lizhuang Ma
VLM
58
36
0
08 Apr 2024
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
Ji-Jia Wu
Andy Chia-Hao Chang
Chieh-Yu Chuang
Chun-Pei Chen
Yu-Lun Liu
Min-Hung Chen
Hou-Ning Hu
Yung-Yu Chuang
Yen-Yu Lin
VLM
43
9
0
05 Apr 2024
Segment Any 3D Object with Language
Seungjun Lee
Yuyang Zhao
Gim Hee Lee
44
1
0
02 Apr 2024
ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Jienneg Chen
Qihang Yu
Xiaohui Shen
Alan L. Yuille
Liang-Chieh Chen
3DV
VLM
36
24
0
02 Apr 2024
OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation
Xiongwei Wu
Sicheng Yu
Ee-Peng Lim
Chong-Wah Ngo
VLM
37
2
0
01 Apr 2024
GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields
Yunsong Wang
Hanlin Chen
Gim Hee Lee
34
5
0
01 Apr 2024
Training-Free Semantic Segmentation via LLM-Supervision
Wenfang Sun
Yingjun Du
Gaowen Liu
Ramana Rao Kompella
Cees G. M. Snoek
VLM
44
2
0
31 Mar 2024
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias
Sang-Kee Jo
Soohyun Ryu
Sungyub Kim
Eunho Yang
Kyungsu Kim
35
1
0
30 Mar 2024
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation
Sang-Kee Jo
Fei Pan
In-Jae Yu
Kyungsu Kim
33
2
0
30 Mar 2024
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation
Yuan Wang
Rui Sun
Naisong Luo
Yuwen Pan
Tianzhu Zhang
VLM
43
9
0
30 Mar 2024
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models
Barbara Toniella Corradini
Mustafa Shukor
Paul Couairon
Guillaume Couairon
Franco Scarselli
Matthieu Cord
DiffM
VLM
45
4
0
29 Mar 2024
Transfer CLIP for Generalizable Image Denoising
Junting Cheng
Dong Liang
Shan Tan
VLM
40
12
0
22 Mar 2024
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation
Kwanyoung Kim
Y. Oh
Jong Chul Ye
VLM
50
7
0
21 Mar 2024
Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments
Djamahl Etchegaray
Zi Huang
Tatsuya Harada
Yadan Luo
31
9
0
20 Mar 2024
Better Call SAL: Towards Learning to Segment Anything in Lidar
Aljovsa Ovsep
Tim Meinhardt
Francesco Ferroni
Neehar Peri
Deva Ramanan
Laura Leal-Taixé
VLM
35
15
0
19 Mar 2024
TAG: Guidance-free Open-Vocabulary Semantic Segmentation
Yasufumi Kawano
Yoshimitsu Aoki
VLM
30
2
0
17 Mar 2024
MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation
Yasufumi Kawano
Yoshimitsu Aoki
DiffM
32
4
0
17 Mar 2024
A Fixed-Point Approach to Unified Prompt-Based Counting
Wei Lin
Antoni B. Chan
28
4
0
15 Mar 2024
PosSAM: Panoptic Open-vocabulary Segment Anything
VS Vibashan
Shubhankar Borse
Hyojin Park
Debasmit Das
Vishal M. Patel
Munawar Hayat
Fatih Porikli
VLM
MLLM
43
6
0
14 Mar 2024
Annotation Free Semantic Segmentation with Vision Foundation Models
Soroush Seifi
Daniel Olmeda Reino
Fabien Despinoy
Rahaf Aljundi
VLM
34
1
0
14 Mar 2024
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Zicheng Zhang
Tong Zhang
Yi Zhu
Jian-zhuo Liu
Xiaodan Liang
QiXiang Ye
Wei Ke
VLM
49
2
0
13 Mar 2024
Learning Generalizable Feature Fields for Mobile Manipulation
Ri-Zhao Qiu
Yafei Hu
Ge Yang
Yuchen Song
Yang Fu
...
Jiteng Mu
Ruihan Yang
Nikolay Atanasov
Sebastian Scherer
Xiaolong Wang
40
27
0
12 Mar 2024
FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications
Yuki Tatsukawa
I-Chao Shen
Anran Qi
Yuki Koyama
Takeo Igarashi
Ariel Shamir
CLIP
VLM
25
5
0
11 Mar 2024
Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
Yajie Liu
Pu Ge
Qingjie Liu
Di Huang
75
2
0
06 Mar 2024
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model
Hao-Ran Cheng
Erjia Xiao
Jindong Gu
Le Yang
Jinhao Duan
Jize Zhang
Jiahang Cao
Kaidi Xu
Renjing Xu
37
6
0
29 Feb 2024
HaLo-NeRF: Learning Geometry-Guided Semantics for Exploring Unconstrained Photo Collections
Chen Dudai
Morris Alper
Hana Bezalel
Rana Hanocka
Itai Lang
Hadar Averbuch-Elor
23
2
0
14 Feb 2024
Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision
Zhaoqing Wang
Xiaobo Xia
Ziye Chen
Xiao He
Yandong Guo
Biwei Huang
Tongliang Liu
VLM
26
11
0
14 Feb 2024
OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding
Guibiao Liao
Kaichen Zhou
Zhenyu Bao
Kanglin Liu
Qing Li
VLM
20
20
0
07 Feb 2024
CLIP Can Understand Depth
Dunam Kim
Seokju Lee
VLM
MDE
48
2
0
05 Feb 2024
Region-Based Representations Revisited
Michal Shlapentokh-Rothman
Ansel Blume
Yao Xiao
Yuqun Wu
TV Sethuraman
Heyi Tao
Jae Yong Lee
Wilfredo Torres
Yu-xiong Wang
Derek Hoiem
36
5
0
04 Feb 2024
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors
Shiyin Dong
Mingrui Zhu
Kun Cheng
Nannan Wang
Xinbo Gao
DiffM
30
3
0
29 Jan 2024
Self-Supervised Vision Transformers Are Efficient Segmentation Learners for Imperfect Labels
Seungho Lee
Seoungyoon Kang
Hyunjung Shim
ViT
VLM
36
0
0
23 Jan 2024
Exploring Simple Open-Vocabulary Semantic Segmentation
Zihang Lai
VLM
21
0
0
22 Jan 2024
Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
Ci-Siang Lin
Chien-Yi Wang
Yu-Chiang Frank Wang
Min-Hung Chen
VLM
23
0
0
22 Jan 2024
EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models
Koichi Namekata
Amirmojtaba Sabour
Sanja Fidler
Seung Wook Kim
49
18
0
22 Jan 2024
Improving fine-grained understanding in image-text pre-training
Ioana Bica
Anastasija Ilić
Matthias Bauer
Goker Erdogan
Matko Bovsnjak
...
A. Gritsenko
Matthias Minderer
Charles Blundell
Razvan Pascanu
Jovana Mitrović
VLM
25
22
0
18 Jan 2024
Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation
Ze-Long Cheng
Kehan Li
Hao Li
Peng Jin
Chang Liu
Xiawu Zheng
Rongrong Ji
Jie Chen
VOS
36
2
0
18 Jan 2024
POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
Antonín Vobecký
Oriane Siméoni
David Hurych
Spyros Gidaris
Andrei Bursuc
Patrick Pérez
Josef Sivic
40
33
0
17 Jan 2024
Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion
Minglong Xue
Jinhong He
Wenhai Wang
Mingliang Zhou
20
11
0
08 Jan 2024
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively
Haobo Yuan
Xiangtai Li
Chong Zhou
Yining Li
Kai Chen
Chen Change Loy
VLM
29
51
0
05 Jan 2024
3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation
Zihao Xiao
Longlong Jing
Shangxuan Wu
Alex Zihao Zhu
Jingwei Ji
...
Thomas Funkhouser
Weicheng Kuo
A. Angelova
Yin Zhou
Shiwei Sheng
VLM
33
5
0
04 Jan 2024
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Longtian Qiu
Shan Ning
Xuming He
VLM
38
3
0
04 Jan 2024
Few-shot Adaptation of Multi-modal Foundation Models: A Survey
Fan Liu
Tianshu Zhang
Wenwen Dai
Wenwen Cai
Wenwen Cai Xiaocong Zhou
Delong Chen
VLM
OffRL
31
23
0
03 Jan 2024
VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting
Seunggu Kang
WonJun Moon
Euiyeon Kim
Jae-Pil Heo
18
21
0
27 Dec 2023
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Yuxuan Zhang
Yiren Song
Jiaming Liu
Rui Wang
Jinpeng Yu
...
Huaxia Li
Xu Tang
Yao Hu
Han Pan
Zhongliang Jing
43
58
0
26 Dec 2023
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification
Qinying Liu
Wei Wu
Kecheng Zheng
Zhan Tong
Jiawei Liu
Yu Liu
Wei Chen
Zilei Wang
Yujun Shen
VLM
26
6
0
21 Dec 2023
Previous
1
2
3
4
5
6
7
Next