Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.16579
Cited By
Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
25 November 2024
Zhiheng Xi
Dingwen Yang
Jixuan Huang
Jixin Tang
Guanyu Li
Yiwen Ding
Wei He
Boyang Hong
Shihan Do
Wenyu Zhan
Xinyu Wang
Rui Zheng
Tao Ji
Xiaowei Shi
Yitao Zhai
Rongxiang Weng
Jiadong Wang
Xunliang Cai
Tao Gui
Zuxuan Wu
Qi Zhang
Xipeng Qiu
Xuanjing Huang
Yu-Gang Jiang
LRM
LLMAG
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision"
3 / 3 papers shown
Title
Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Bang Zhang
Ruotian Ma
Qingxuan Jiang
Peisong Wang
Jiaqi Chen
...
Fanghua Ye
Jian Li
Yifan Yang
Zhaopeng Tu
Xiaolong Li
LLMAG
ELM
ALM
228
0
1
01 May 2025
Bag of Tricks for Inference-time Computation of LLM Reasoning
Fan Liu
Wenshuo Chao
Naiqiang Tan
Hao Liu
OffRL
LRM
142
5
0
11 Feb 2025
RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
Enyu Zhou
Guodong Zheng
Binghai Wang
Zhiheng Xi
Shihan Dou
...
Yurong Mou
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
ALM
129
19
0
13 Oct 2024
1