ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.17618
  4. Cited By
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse
  Tasks and Model Scales

Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales

27 May 2024
Ju-Seung Byun
Andrew Perrault
ArXivPDFHTML

Papers citing "Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales"

1 / 1 papers shown
Title
ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for
  Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback
ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback
Ju-Seung Byun
Jiyun Chun
Jihyung Kil
Andrew Perrault
ReLM
LRM
39
2
0
25 Jun 2024
1