ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.07886
  4. Cited By
Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision

Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision

14 January 2025
Yaowen Ye
Cassidy Laidlaw
Jacob Steinhardt
    ALM
ArXiv (abs)PDFHTML

Papers citing "Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision"

2 / 2 papers shown
Title
On the Emergence of Weak-to-Strong Generalization: A Bias-Variance Perspective
On the Emergence of Weak-to-Strong Generalization: A Bias-Variance Perspective
Gengze Xu
Wei Yao
Ziqiao Wang
Yong Liu
46
0
0
30 May 2025
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
Bo Liu
Yunxiang Li
Yangqiu Song
Hanjing Wang
Linyi Yang
...
Jun Wang
Jun Wang
Weinan Zhang
Shuyue Hu
Ying Wen
LLMAGKELMLRMAI4CE
134
11
0
12 Mar 2025
1