Title |
---|
![]() Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge Jiayi Ye Yanbo Wang Yue Huang Dongping Chen Qihui Zhang ...Werner Geyer Chao Huang Pin-Yu Chen Nitesh V. Chawla Xiangliang Zhang |
![]() RRM: Robust Reward Model Training Mitigates Reward Hacking Tianqi Liu Wei Xiong Jie Jessie Ren Lichang Chen Junru Wu ...Yuan Liu Bilal Piot Abe Ittycheriah Aviral Kumar Mohammad Saleh |
![]() A Survey for Large Language Models in Biomedicine Chong Wang Mengyao Li Junjun He Zhongruo Wang Erfan Darzi ...Yi Yu Pietro Liò Tianyun Wang Yu Guang Wang Yiqing Shen |