Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.17760
Cited By
Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach
26 November 2024
Shijian Deng
Wentian Zhao
Yu-Jhe Li
Kun Wan
Daniel Miranda
Ajinkya Kale
Yapeng Tian
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach"
2 / 2 papers shown
Title
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning
Xiaokun Wang
Chris
Jiangbo Pei
Wei Shen
Yi Peng
...
Ai Jian
Tianyidan Xie
Xuchen Song
Yang Liu
Yahui Zhou
OffRL
LRM
28
0
0
12 May 2025
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model
Yuhang Zang
Xiaoyi Dong
Pan Zhang
Yuhang Cao
Ziyu Liu
...
Haodong Duan
Feiyu Xiong
Kai Chen
Dahua Lin
Jiaqi Wang
VLM
74
19
0
21 Jan 2025
1