ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.15383
  4. Cited By
Generating Code World Models with Large Language Models Guided by Monte
  Carlo Tree Search

Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search

24 May 2024
Nicola Dainese
Matteo Merler
Minttu Alakuijala
Pekka Marttinen
    LLMAG
ArXivPDFHTML

Papers citing "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search"

7 / 7 papers shown
Title
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning
Abdullah Vanlioglu
46
0
0
28 Mar 2025
Reclaiming the Source of Programmatic Policies: Programmatic versus
  Latent Spaces
Reclaiming the Source of Programmatic Policies: Programmatic versus Latent Spaces
Tales H. Carvalho
Kenneth Tjhia
Levi H. S. Lelis
34
6
0
16 Oct 2024
CodeRL: Mastering Code Generation through Pretrained Models and Deep
  Reinforcement Learning
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
S. Hoi
SyDa
ALM
129
240
0
05 Jul 2022
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
322
4,077
0
24 May 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
361
8,495
0
28 Jan 2022
Grounding Predicates through Actions
Grounding Predicates through Actions
Toki Migimatsu
Jeannette Bohg
150
32
0
29 Sep 2021
Measuring Coding Challenge Competence With APPS
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
D. Song
Jacob Steinhardt
ELM
AIMat
ALM
208
624
0
20 May 2021
1