Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.16661
Cited By
v1
v2
v3 (latest)
RLSF: Fine-tuning LLMs via Symbolic Feedback
26 May 2024
Piyush Jha
Prithwish Jana
Pranavkrishna Suresh
Arnav Arora
Vijay Ganesh
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RLSF: Fine-tuning LLMs via Symbolic Feedback"
3 / 3 papers shown
Title
DualSchool: How Reliable are LLMs for Optimization Education?
Michael Klamkin
Arnaud Deza
Sikai Cheng
Haoruo Zhao
Pascal Van Hentenryck
51
0
0
27 May 2025
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
Andreas Hochlehnert
Hardik Bhatnagar
Vishaal Udandarao
Samuel Albanie
Ameya Prabhu
Matthias Bethge
ReLM
ALM
LRM
232
26
0
09 Apr 2025
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
198
13
0
28 Aug 2023
1