Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values

20 February 2025

Papers citing "Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values"

2 / 2 papers shown

Title
Thinking Machines: A Survey of LLM based Reasoning Strategies Dibyanayan Bandyopadhyay Soham Bhattacharjee Asif Ekbal LRM ELM 71 10 0 13 Mar 2025
A Survey on Large Language Model-Based Game Agents Sihao Hu Tiansheng Huang Gaowen Liu Ramana Rao Kompella Gaowen Liu Selim Furkan Tekin Yichang Xu Zachary Yahn Ling Liu LLMAG LM&Ro AI4CE LM&MA 125 56 0 02 Apr 2024