Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.12195
Cited By
Browse and Concentrate: Comprehending Multimodal Content via prior-LLM Context Fusion
19 February 2024
Ziyue Wang
Chi Chen
Yiqi Zhu
Fuwen Luo
Peng Li
Ming Yan
Ji Zhang
Fei Huang
Maosong Sun
Yang Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Browse and Concentrate: Comprehending Multimodal Content via prior-LLM Context Fusion"
5 / 5 papers shown
Title
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
Yiqi Zhu
Z. Wang
C. Zhang
Peng Li
Yang Liu
CoGe
VLM
68
0
0
18 Mar 2025
InsightVision: A Comprehensive, Multi-Level Chinese-based Benchmark for Evaluating Implicit Visual Semantics in Large Vision Language Models
Xiaofei Yin
Y. Hong
Ya Guo
Yi Tu
Weiqiang Wang
Gongshen Liu
Huijia Zhu
VLM
63
0
0
19 Feb 2025
ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models
Ziyue Wang
Chi Chen
Fuwen Luo
Yurui Dong
Yuanchi Zhang
Yuzhuang Xu
Xiaolong Wang
Peng Li
Yang Liu
LRM
40
3
0
07 Oct 2024
DeepONet for Solving Nonlinear Partial Differential Equations with Physics-Informed Training
Yahong Yang
25
0
0
06 Oct 2024
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Qinghao Ye
Haiyang Xu
Guohai Xu
Jiabo Ye
Ming Yan
...
Junfeng Tian
Qiang Qi
Ji Zhang
Feiyan Huang
Jingren Zhou
VLM
MLLM
208
900
0
27 Apr 2023
1