ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.16661
  4. Cited By
RLSF: Fine-tuning LLMs via Symbolic Feedback
v1v2v3 (latest)

RLSF: Fine-tuning LLMs via Symbolic Feedback

26 May 2024
Piyush Jha
Prithwish Jana
Pranavkrishna Suresh
Arnav Arora
Vijay Ganesh
    LRM
ArXiv (abs)PDFHTML

Papers citing "RLSF: Fine-tuning LLMs via Symbolic Feedback"

3 / 3 papers shown
Title
DualSchool: How Reliable are LLMs for Optimization Education?
DualSchool: How Reliable are LLMs for Optimization Education?
Michael Klamkin
Arnaud Deza
Sikai Cheng
Haoruo Zhao
Pascal Van Hentenryck
51
0
0
27 May 2025
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
Andreas Hochlehnert
Hardik Bhatnagar
Vishaal Udandarao
Samuel Albanie
Ameya Prabhu
Matthias Bethge
ReLMALMLRM
232
26
0
09 Apr 2025
Reinforcement Learning for Generative AI: A Survey
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
198
13
0
28 Aug 2023
1