
Think-J: Learning to Think for Generative LLM-as-a-Judge
Papers citing "Think-J: Learning to Think for Generative LLM-as-a-Judge"
17 / 17 papers shown
Title |
---|
![]() Secrets of RLHF in Large Language Models Part II: Reward Modeling Bing Wang Rui Zheng Luyao Chen Yan Liu Shihan Dou ...Qi Zhang Xipeng Qiu Xuanjing Huang Zuxuan Wu Yuanyuan Jiang |