Title |
---|
![]() Toward Optimal LLM Alignments Using Two-Player Games Rui Zheng Hongyi Guo Zhihan Liu Xiaoying Zhang Yuanshun Yao ...Tao Gui Qi Zhang Xuanjing Huang Hang Li Yang Liu |
![]() AgentGym: Evolving Large Language Model-based Agents across Diverse
Environments Zhiheng Xi Yiwen Ding Wenxiang Chen Boyang Hong Honglin Guo ...Qi Zhang Xipeng Qiu Xuanjing Huang Zuxuan Wu Yu-Gang Jiang |
![]() Online Merging Optimizers for Boosting Rewards and Mitigating Tax in
Alignment Keming Lu Bowen Yu Fei Huang Yang Fan Runji Lin Chang Zhou |
![]() RaFe: Ranking Feedback Improves Query Rewriting for RAG Shengyu Mao Yong-jia Jiang Boli Chen Xiao Li Peng Wang Xinyu Wang Pengjun Xie Fei Huang Huajun Chen Ningyu Zhang |