Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.06303
Cited By
Reward Is Enough: LLMs Are In-Context Reinforcement Learners
21 May 2025
Kefan Song
Amir Moeini
Peng Wang
Lei Gong
Rohan Chandra
Yanjun Qi
Shangtong Zhang
ReLM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reward Is Enough: LLMs Are In-Context Reinforcement Learners"
2 / 2 papers shown
Title
LLM-First Search: Self-Guided Exploration of the Solution Space
Nathan Herr
Tim Rocktaschel
Roberta Raileanu
LRM
148
0
0
05 Jun 2025
Can large language models explore in-context?
Akshay Krishnamurthy
Keegan Harris
Dylan J. Foster
Cyril Zhang
Aleksandrs Slivkins
LM&Ro
LLMAG
LRM
278
29
0
22 Mar 2024
1