
RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
Papers citing "RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style"
29 / 29 papers shown
Title |
---|
![]() RewardBench: Evaluating Reward Models for Language Modeling Nathan Lambert Valentina Pyatkin Jacob Morrison Lester James V. Miranda Bill Yuchen Lin ...Sachin Kumar Tom Zick Yejin Choi Noah A. Smith Hanna Hajishirzi |