Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.16181
Cited By
How Can LLM Guide RL? A Value-Based Approach
25 February 2024
Shenao Zhang
Sirui Zheng
Shuqi Ke
Zhihan Liu
Wanxin Jin
Jianbo Yuan
Yingxiang Yang
Hongxia Yang
Zhaoran Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"How Can LLM Guide RL? A Value-Based Approach"
11 / 11 papers shown
Title
DSADF: Thinking Fast and Slow for Decision Making
Alex Zhihao Dou
Dongfei Cui
Jun Yan
Wei Wang
Benteng Chen
Haoming Wang
Zeke Xie
Shufei Zhang
OffRL
41
0
0
13 May 2025
Monte Carlo Planning with Large Language Model for Text-Based Game Agents
Zijing Shi
Meng Fang
Ling Chen
LLMAG
LM&Ro
33
0
0
23 Apr 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Balázs Kégl
OffRL
69
1
0
17 Feb 2025
Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos
Harsh Mahesheka
Zhixian Xie
Zhilin Wang
Wanxin Jin
29
0
0
11 Oct 2024
Efficient Reinforcement Learning with Large Language Model Priors
Xue Yan
Yan Song
Xidong Feng
Mengyue Yang
Haifeng Zhang
Haitham Bou Ammar
Jun Wang
OffRL
33
3
0
10 Oct 2024
On the Modeling Capabilities of Large Language Models for Sequential Decision Making
Martin Klissarov
Devon Hjelm
Alexander Toshev
Bogdan Mazoure
LM&Ro
ELM
OffRL
LRM
34
2
0
08 Oct 2024
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
267
2,494
0
06 Oct 2022
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
140
95
0
28 Jan 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
389
8,495
0
28 Jan 2022
Skill Induction and Planning with Latent Language
Pratyusha Sharma
Antonio Torralba
Jacob Andreas
LM&Ro
202
108
0
04 Oct 2021
Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators
Clement Gehring
Masataro Asai
Rohan Chitnis
Tom Silver
L. Kaelbling
Shirin Sohrabi
Michael Katz
OffRL
40
37
0
30 Sep 2021
1