Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.16158
Cited By
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding
20 December 2024
Chenxin Tao
Shiqian Su
X. Zhu
Chenyu Zhang
Zhe Chen
Jun Liu
Wenhai Wang
Lewei Lu
Gao Huang
Yu Qiao
Jifeng Dai
MLLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding"
1 / 1 papers shown
Title
Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training
Yiran Chen
Hao Peng
Tong Zhang
Heng Ji
VLM
28
0
0
13 May 2025
1