Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.10416
Cited By
Can Sound Replace Vision in LLaVA With Token Substitution?
12 June 2025
Ali Vosoughi
Jing Bi
Pinxin Liu
Yunlong Tang
Chenliang Xu
CLIP
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Can Sound Replace Vision in LLaVA With Token Substitution?"
Title
No papers