
ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
Papers citing "ChatRex: Taming Multimodal LLM for Joint Perception and Understanding"
50 / 96 papers shown
Title |
---|
![]() Aria: An Open Multimodal Native Mixture-of-Experts Model Dongxu Li Yudong Liu Haoning Wu Yue Wang Zhiqi Shen ...Lihuan Zhang Hanshu Yan Guoyin Wang Bei Chen Junnan Li |
![]() MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning Haotian Zhang Mingfei Gao Zhe Gan Philipp Dufter Nina Wenzel ...Haoxuan You Zirui Wang Afshin Dehghan Peter Grasch Yinfei Yang |
![]() Molmo and PixMo: Open Weights and Open Data for State-of-the-Art
Multimodal Models Matt Deitke Christopher Clark Sangho Lee Rohun Tripathi Yue Yang ...Noah A. Smith Hannaneh Hajishirzi Ross Girshick Ali Farhadi Aniruddha Kembhavi |
![]() GLaMM: Pixel Grounding Large Multimodal Model H. Rasheed Muhammad Maaz Sahal Shaji Mullappilly Abdelrahman M. Shaker Salman Khan Hisham Cholakkal Rao M. Anwer Erix Xing Ming-Hsuan Yang Fahad S. Khan |