Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.05349
Cited By
PixLore: A Dataset-driven Approach to Rich Image Captioning
8 December 2023
Diego Bonilla
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PixLore: A Dataset-driven Approach to Rich Image Captioning"
3 / 3 papers shown
Title
Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Teng Wang
Jinrui Zhang
Junjie Fei
Hao Zheng
Yunlong Tang
Zhe Li
Mingqi Gao
Shanshan Zhao
MLLM
109
82
0
04 May 2023
Tag2Text: Guiding Vision-Language Model via Image Tagging
Xinyu Huang
Youcai Zhang
Jinyu Ma
Weiwei Tian
Rui Feng
Yuejie Zhang
Yaqian Li
Yandong Guo
Lei Zhang
CLIP
MLLM
VLM
3DV
69
74
0
10 Mar 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
320
4,279
0
30 Jan 2023
1