
v1v2 (latest)
RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
Jingbo Zhu
Papers citing "RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data"
42 / 42 papers shown
Title |
---|
![]() Llama 2: Open Foundation and Fine-Tuned Chat Models Hugo Touvron Louis Martin Kevin R. Stone Peter Albert Amjad Almahairi ...Sharan Narang Aurelien Rodriguez Robert Stojnic Sergey Edunov Thomas Scialom |