Title |
---|
![]() A Survey on Evaluation of Multimodal Large Language Models Jiaxing Huang Jingyi Zhang |
![]() Modality Invariant Multimodal Learning to Handle Missing Modalities: A
Single-Branch Approach Muhammad Saad Saeed Shah Nawaz Muhammad Zaigham Zaheer Muhammad Haris Khan Karthik Nandakumar Muhammad Haroon Yousaf Hassan Sajjad Tom De Schepper Markus Schedl |
![]() PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal
Documents Junjie Wang Yin Zhang Yatai Ji Yuxiang Zhang Chunyang Jiang ...Bei Chen Qunshu Lin Minghao Liu Ge Zhang Wenhu Chen |
![]() OpenVLA: An Open-Source Vision-Language-Action Model Moo Jin Kim Karl Pertsch Siddharth Karamcheti Ted Xiao Ashwin Balakrishna ...Russ Tedrake Dorsa Sadigh Sergey Levine Percy Liang Chelsea Finn |