Title |
---|
![]() QPO: Query-dependent Prompt Optimization via Multi-Loop Offline
Reinforcement Learning Yilun Kong Hangyu Mao Qi Zhao Bin Zhang Jingqing Ruan Li Shen Yongzhe Chang Xueqian Wang Rui Zhao Dacheng Tao |
![]() BadRobot: Jailbreaking Embodied LLMs in the Physical World Hangtao Zhang Chenyu Zhu Xianlong Wang Ziqi Zhou Yichen Wang ...Shengshan Hu Leo Yu Zhang Aishan Liu Peijin Guo Leo Yu Zhang |
![]() Qwen2 Technical Report An Yang Baosong Yang Binyuan Hui Jian Xu Bowen Yu ...Yuqiong Liu Zeyu Cui Zhenru Zhang Zhifang Guo Zhi-Wei Fan |