ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.13723
  4. Cited By
Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values

Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values

20 February 2025
Hongbo Zhang
Han Cui
Guangsheng Bao
Linyi Yang
Jun Wang
Yue Zhang
    OffRL
    LRM
    AI4CE
ArXivPDFHTML

Papers citing "Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values"

2 / 2 papers shown
Title
Thinking Machines: A Survey of LLM based Reasoning Strategies
Dibyanayan Bandyopadhyay
Soham Bhattacharjee
Asif Ekbal
LRM
ELM
71
10
0
13 Mar 2025
A Survey on Large Language Model-Based Game Agents
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
LLMAG
LM&Ro
AI4CE
LM&MA
125
56
0
02 Apr 2024
1