Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.15383
Cited By
Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search
24 May 2024
Nicola Dainese
Matteo Merler
Minttu Alakuijala
Pekka Marttinen
LLMAG
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search"
7 / 7 papers shown
Title
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning
Abdullah Vanlioglu
46
0
0
28 Mar 2025
Reclaiming the Source of Programmatic Policies: Programmatic versus Latent Spaces
Tales H. Carvalho
Kenneth Tjhia
Levi H. S. Lelis
34
6
0
16 Oct 2024
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
S. Hoi
SyDa
ALM
129
240
0
05 Jul 2022
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
322
4,077
0
24 May 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
361
8,495
0
28 Jan 2022
Grounding Predicates through Actions
Toki Migimatsu
Jeannette Bohg
150
32
0
29 Sep 2021
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
D. Song
Jacob Steinhardt
ELM
AIMat
ALM
208
624
0
20 May 2021
1