Title |
---|
![]() AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model Avamarie Brueggeman Andrea Madotto Zhaojiang Lin Tushar Nagarajan Matt Smith ...Peyman Heidari Yue Liu Kavya Srinet Babak Damavandi Anuj Kumar |
![]() Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level
Vision Haoning Wu Zicheng Zhang Erli Zhang Chaofeng Chen Liang Liao ...Chunyi Li Wenxiu Sun Qiong Yan Guangtao Zhai Weisi Lin |
![]() A Survey on Image-text Multimodal Models Ruifeng Guo Jingxuan Wei Linzhuang Sun Khai-Nguyen Nguyen Guiyong Chang Dawei Liu Sibo Zhang Zhengbing Yao Mingjun Xu Liping Bu |
![]() DreamLLM: Synergistic Multimodal Comprehension and Creation Runpei Dong Chunrui Han Yuang Peng Zekun Qi Zheng Ge ...Hao-Ran Wei Xiangwen Kong Xiangyu Zhang Kaisheng Ma Li Yi |
![]() GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised
Learning Mainak Singha Ankit Jha Biplab Banerjee |