ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.16268
  4. Cited By
ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning

ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning

22 February 2025
Shulin Huang
Linyi Yang
Yangqiu Song
Shuang Chen
Leyang Cui
Bo Liu
Qingcheng Zeng
Ying Wen
Kun Shao
Weinan Zhang
Jun Wang
Yue Zhang
    LRM
ArXiv (abs)PDFHTML

Papers citing "ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning"

2 / 2 papers shown
Title
Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilities
Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilities
Weixiang Zhao
Xingyu Sui
Jiahe Guo
Yulin Hu
Yang Deng
Yanyan Zhao
Bing Qin
Wanxiang Che
Tat-Seng Chua
Ting Liu
ELMLRM
132
9
0
23 Mar 2025
Foundation models may exhibit staged progression in novel CBRN threat disclosure
Foundation models may exhibit staged progression in novel CBRN threat disclosure
Kevin M Esvelt
78
1
0
19 Mar 2025
1