Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.06641
Cited By
Image-to-Text Translation for Interactive Image Recognition: A Comparative User Study with Non-Expert Users
11 May 2023
Wataru Kawabe
Yusuke Sugano
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Image-to-Text Translation for Interactive Image Recognition: A Comparative User Study with Non-Expert Users"
3 / 3 papers shown
Title
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
392
4,137
0
28 Jan 2022
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
345
2,271
0
02 Sep 2021
CrowdHuman: A Benchmark for Detecting Human in a Crowd
Shuai Shao
Zijian Zhao
Boxun Li
Tete Xiao
Gang Yu
Xiangyu Zhang
Jian Sun
222
675
0
30 Apr 2018
1