Title |
---|
![]() ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large
Language Models Yuqing Huang Rongyang Zhang Xiaoxiao He Xuyang Zhi Hao Wang ...Guoping Hu Guiquan Liu Qi Liu Defu Lian Enhong Chen |
![]() RRM: Robust Reward Model Training Mitigates Reward Hacking Tianqi Liu Wei Xiong Jie Jessie Ren Lichang Chen Junru Wu ...Yuan Liu Bilal Piot Abe Ittycheriah Aviral Kumar Mohammad Saleh |
![]() Exploring Large Language Models for Product Attribute Value
Identification Kassem Sabeh Mouna Kacimi Johann Gamper Robert Litschko Barbara Plank |
![]() Enhancing Logical Reasoning in Large Language Models through Graph-based
Synthetic Data Jiaming Zhou Abbas Ghaddar Ge Zhang Liheng Ma Yaochen Hu Soumyasundar Pal Mark Coates Bin Wang Yingxue Zhang Jianye Hao |
![]() The Central Role of the Loss Function in Reinforcement Learning Kaiwen Wang Nathan Kallus Wen Sun |
![]() Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via
Self-Improvement An Yang Beichen Zhang Binyuan Hui Bofei Gao Bowen Yu ...Mingfeng Xue Runji Lin Tianyu Liu Xingzhang Ren Zhenru Zhang |