Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.06169
Cited By
Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to See
8 October 2024
Phu Pham
Phu Pham
Kun Wan
Yu-Jhe Li
Zeliang Zhang
Daniel Miranda
Ajinkya Kale
Ajinkya Kale
Chenliang Xu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to See"
1 / 1 papers shown
Title
Learning to Inference Adaptively for Multimodal Large Language Models
Zhuoyan Xu
Khoi Duc Nguyen
Preeti Mukherjee
Saurabh Bagchi
Somali Chaterji
Yingyu Liang
Yin Li
LRM
46
1
0
13 Mar 2025
1