
Cross-lingual Transfer of Reward Models in Multilingual Alignment
Papers citing "Cross-lingual Transfer of Reward Models in Multilingual Alignment"
48 / 48 papers shown
Title |
---|
![]() Qwen2 Technical Report An Yang Baosong Yang Binyuan Hui Jian Xu Bowen Yu ...Yuqiong Liu Zeyu Cui Zhenru Zhang Zhifang Guo Zhi-Wei Fan |
![]() RewardBench: Evaluating Reward Models for Language Modeling Nathan Lambert Valentina Pyatkin Jacob Morrison Lester James V. Miranda Bill Yuchen Lin ...Sachin Kumar Tom Zick Yejin Choi Noah A. Smith Hanna Hajishirzi |
![]() OLMo: Accelerating the Science of Language Models Dirk Groeneveld Iz Beltagy Pete Walsh Akshita Bhagia Rodney Michael Kinney ...Jesse Dodge Kyle Lo Luca Soldaini Noah A. Smith Hanna Hajishirzi |
![]() Dolma: an Open Corpus of Three Trillion Tokens for Language Model
Pretraining Research Luca Soldaini Rodney Michael Kinney Akshita Bhagia Dustin Schwenk David Atkinson ...Hanna Hajishirzi Iz Beltagy Dirk Groeneveld Jesse Dodge Kyle Lo |
![]() Secrets of RLHF in Large Language Models Part II: Reward Modeling Bing Wang Rui Zheng Luyao Chen Yan Liu Shihan Dou ...Qi Zhang Xipeng Qiu Xuanjing Huang Zuxuan Wu Yuanyuan Jiang |