
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models
Shilin Xu
Yanwei Li
Rui Yang
Tao Zhang
Yueyi Sun
Wei Chow
Linfeng Li
Hang Song
Qi Xu
Yunhai Tong
Xiangtai Li
Hao Fei
Papers citing "Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models"
Title | |||
---|---|---|---|
No papers |