ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.10406
  4. Cited By
PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier

PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier

12 June 2025
Y. Jiang
Yuwen Xiong
Yufeng Yuan
Chao Xin
Wenyuan Xu
Yu Yue
Qianchuan Zhao
Lin Yan
    LRM
ArXiv (abs)PDFHTML

Papers citing "PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier"

Title
No papers