Title |
---|
![]() Show-o: One Single Transformer to Unify Multimodal Understanding and
Generation Jinheng Xie Weijia Mao Zechen Bai David Junhao Zhang Weihao Wang Kevin Qinghong Lin Yuchao Gu Zhijie Chen Zhenheng Yang Mike Zheng Shou |
![]() xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Le Xue Manli Shu Anas Awadalla Jun Wang An Yan ...Zeyuan Chen Silvio Savarese Juan Carlos Niebles Caiming Xiong Ran Xu |
![]() VITA: Towards Open-Source Interactive Omni Multimodal LLM Chaoyou Fu Haojia Lin Zuwei Long Yunhang Shen Meng Zhao ...Ran He Rongrong Ji Yunsheng Wu Caifeng Shan Xing Sun |