ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2510.00915
  4. Cited By
Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers
v1v2 (latest)

Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers

1 October 2025
Xin-Qiang Cai
Wei Wang
Feng Liu
Tongliang Liu
Gang Niu
Masashi Sugiyama
    OffRLAAML
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)Github (971★)

Papers citing "Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers"

1 / 1 papers shown
Title
Do Math Reasoning LLMs Help Predict the Impact of Public Transit Events?
Do Math Reasoning LLMs Help Predict the Impact of Public Transit Events?
Bowen Fang
Ruijian Zha
Xuan Di
AI4TS
12
0
0
02 Nov 2025
1