Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.13723
Cited By
Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values
20 February 2025
Hongbo Zhang
Han Cui
Guangsheng Bao
Linyi Yang
Jun Wang
Yue Zhang
OffRL
LRM
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values"
2 / 2 papers shown
Title
Thinking Machines: A Survey of LLM based Reasoning Strategies
Dibyanayan Bandyopadhyay
Soham Bhattacharjee
Asif Ekbal
LRM
ELM
71
10
0
13 Mar 2025
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
LLMAG
LM&Ro
AI4CE
LM&MA
125
56
0
02 Apr 2024
1