Title |
---|
![]() AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model Avamarie Brueggeman Andrea Madotto Zhaojiang Lin Tushar Nagarajan Matt Smith ...Peyman Heidari Yue Liu Kavya Srinet Babak Damavandi Anuj Kumar |
![]() ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring
Instruction Tuning Liang Zhao En Yu Zheng Ge Jinrong Yang Hao-Ran Wei ...Jian‐Yuan Sun Yuang Peng Runpei Dong Chunrui Han Xiangyu Zhang |
![]() DetGPT: Detect What You Need via Reasoning Renjie Pi Jiahui Gao Shizhe Diao Boyao Wang Hanze Dong ...Lewei Yao Jianhua Han Hang Xu Lingpeng Kong Tong Zhang Tong Zhang |