ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.16158
  4. Cited By
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

20 December 2024
Chenxin Tao
Shiqian Su
X. Zhu
Chenyu Zhang
Zhe Chen
Jun Liu
Wenhai Wang
Lewei Lu
Gao Huang
Yu Qiao
Jifeng Dai
    MLLM
    VLM
ArXivPDFHTML

Papers citing "HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding"

1 / 1 papers shown
Title
Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training
Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training
Yiran Chen
Hao Peng
Tong Zhang
Heng Ji
VLM
28
0
0
13 May 2025
1