Title |
---|
![]() Revisit Large-Scale Image-Caption Data in Pre-training Multimodal
Foundation Models Zhengfeng Lai Vasileios Saveris Chia-Ju Chen Hong-You Chen Haotian Zhang ...Wenze Hu Zhe Gan Peter Grasch Meng Cao Yinfei Yang |
![]() Look, Compare, Decide: Alleviating Hallucination in Large
Vision-Language Models via Multi-View Multi-Path Reasoning Xiaoye Qu Jiashuo Sun Wei Wei Yu Cheng |
![]() A Survey on Evaluation of Multimodal Large Language Models Jiaxing Huang Jingyi Zhang |