Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.16268
Cited By
ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning
22 February 2025
Shulin Huang
Linyi Yang
Yangqiu Song
Shuang Chen
Leyang Cui
Bo Liu
Qingcheng Zeng
Ying Wen
Kun Shao
Weinan Zhang
Jun Wang
Yue Zhang
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning"
2 / 2 papers shown
Title
Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilities
Weixiang Zhao
Xingyu Sui
Jiahe Guo
Yulin Hu
Yang Deng
Yanyan Zhao
Bing Qin
Wanxiang Che
Tat-Seng Chua
Ting Liu
ELM
LRM
132
9
0
23 Mar 2025
Foundation models may exhibit staged progression in novel CBRN threat disclosure
Kevin M Esvelt
78
1
0
19 Mar 2025
1