ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.00750
  4. Cited By
Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling

24 February 2025
Yiwen Ding
Zhiheng Xi
Wei He
Zhuoyuan Li
Yitao Zhai
Xiaowei Shi
Xunliang Cai
Tao Gui
Qi Zhang
Xuanjing Huang
    LRM
ArXivPDFHTML

Papers citing "Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling"

3 / 3 papers shown
Title
Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst
Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst
Hongru Wang
Deng Cai
Wanjun Zhong
Shijue Huang
Jeff Z. Pan
Zeming Liu
Kam-Fai Wong
ReLM
LRM
19
3
0
20 May 2025
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRL
LRM
42
1
0
19 Apr 2025
GiFT: Gibbs Fine-Tuning for Code Generation
GiFT: Gibbs Fine-Tuning for Code Generation
Haochen Li
Wanjin Feng
Xin Zhou
Zhiqi Shen
SyDa
84
1
0
17 Feb 2025
1