Title |
---|
![]() Foundational Models Defining a New Era in Vision: A Survey and Outlook Muhammad Awais Muzammal Naseer Salman Khan Rao Muhammad Anwer Hisham Cholakkal M. Shah Ming-Hsuan Yang Fahad Shahbaz Khan |
![]() ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring
Instruction Tuning Liang Zhao En Yu Zheng Ge Jinrong Yang Hao-Ran Wei ...Jian‐Yuan Sun Yuang Peng Runpei Dong Chunrui Han Xiangyu Zhang |