Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.00001
Cited By
Reward Design with Language Models
27 February 2023
Minae Kwon
Sang Michael Xie
Kalesha Bullard
Dorsa Sadigh
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reward Design with Language Models"
48 / 48 papers shown
Title
GoalLadder: Incremental Goal Discovery with Vision-Language Models
Alexey Zakharov
Shimon Whiteson
20
0
0
19 Jun 2025
LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement
Dimitris Panagopoulos
Adolfo Perrusquía
Weisi Guo
LLMAG
9
0
0
09 Jun 2025
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
Geonwoo Cho
Jaemoon Lee
Jaegyun Im
Subi Lee
Jihwan Lee
Sundong Kim
31
0
0
06 Jun 2025
AURA: Agentic Upskilling via Reinforced Abstractions
Alvin Zhu
Yusuke Tanaka
Dennis W. Hong
55
0
0
03 Jun 2025
TuneComp: Joint Fine-tuning and Compression for Large Foundation Models
Xiangyu Chen
Jing Liu
Ye Wang
Matthew Brand
Wang
T. Koike-Akino
57
0
0
27 May 2025
TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning
Yuhui Chen
Haoran Li
Zhennan Jiang
Haowei Wen
Dongbin Zhao
61
0
0
26 May 2025
Divide-Fuse-Conquer: Eliciting "Aha Moments" in Multi-Scenario Games
Xiaoqing Zhang
Huabin Zheng
Ang Lv
Yuhan Liu
Zirui Song
Flood Sung
Xiuying Chen
Rui Yan
OffRL
ReLM
LRM
AI4CE
118
0
0
22 May 2025
ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations
Jiahui Zhang
Yusen Luo
Abrar Anwar
Sumedh Anand Sontakke
Joseph J Lim
Jesse Thomason
Erdem Biyik
Jesse Zhang
OffRL
LM&Ro
126
0
0
16 May 2025
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
Wei Shen
Guanlin Liu
Zheng Wu
Ruofei Zhu
Qingping Yang
Chao Xin
Yu Yue
Lin Yan
154
14
0
28 Mar 2025
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap
Tong Nie
Jian Sun
Wei Ma
226
4
0
27 Mar 2025
VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences
Anukriti Singh
Amisha Bhaskar
Peihong Yu
Souradip Chakraborty
Ruthwik Dasyam
Amrit Singh Bedi
Pratap Tokekar
106
0
0
18 Mar 2025
LLM-Mediated Guidance of MARL Systems
Philipp D. Siedler
Ian Gemp
99
0
0
16 Mar 2025
Controllable Complementarity: Subjective Preferences in Human-AI Collaboration
Chase McDonald
Cleotilde Gonzalez
97
0
0
07 Mar 2025
M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
Ziyan Wang
Zhicheng Zhang
Fei Fang
Yali Du
117
3
0
03 Mar 2025
Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning
Adrià López Escoriza
Nicklas Hansen
Stone Tao
Tongzhou Mu
H. Su
OffRL
90
1
0
03 Mar 2025
The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning
Sheila Schoepp
Masoud Jafaripour
Yingyue Cao
Tianpei Yang
Fatemeh Abdollahi
Shadan Golestan
Zahin Sufiyan
Osmar Zaiane
Matthew E. Taylor
OffRL
LM&Ro
123
1
0
24 Feb 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Balázs Kégl
OffRL
142
1
0
17 Feb 2025
HAMSTER: Hierarchical Action Models For Open-World Robot Manipulation
Yi Li
Yuquan Deng
Jing Zhang
Joel Jang
Marius Memme
...
Fabio Ramos
Dieter Fox
Anqi Li
Abhishek Gupta
Ankit Goyal
LM&Ro
194
17
0
08 Feb 2025
Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression
Kai Yoshida
M. Mizukami
Seiya Kawano
Canasai Kruengkrai
Hiroaki Sugiyama
Koichiro Yoshino
ALM
OffRL
134
1
0
28 Jan 2025
An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems
Hashmath Shaik
Alex Doboli
OffRL
ELM
467
0
0
31 Dec 2024
ELEMENTAL: Interactive Learning from Demonstrations and Vision-Language Models for Reward Design in Robotics
Letian Chen
Nina Moorman
Matthew C. Gombolay
OffRL
LM&Ro
191
1
0
27 Nov 2024
Large Language Model-Enhanced Reinforcement Learning for Generic Bus Holding Control Strategies
Jiajie Yu
Yuhong Wang
Wei Ma
OffRL
166
2
0
14 Oct 2024
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
Kanghyun Ryu
Qiayuan Liao
Zhongyu Li
Koushil Sreenath
Negar Mehr
Negar Mehr
LM&Ro
358
4
0
27 Sep 2024
AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language Models
Yifei Yao
Wentao He
Chenyu Gu
Jiaheng Du
Fuwei Tan
Zhen Zhu
Junguo Lu
OffRL
129
2
0
13 Sep 2024
E
2
C
F
D
\mathrm{E^{2}CFD}
E
2
CFD
: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model
Zepeng Wang
Chao Ma
Linjiang Zhou
Libing Wu
Lei Yang
Xiaochuan Shi
Guojun Peng
OffRL
89
0
0
08 Jul 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
100
1
0
09 Jun 2024
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
LLMAG
LM&Ro
AI4CE
LM&MA
226
57
0
02 Apr 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
129
70
0
30 Mar 2024
RAIL: Robot Affordance Imagination with Large Language Models
Ceng Zhang
Xin Meng
Dongchen Qi
Gregory Chirikjian
LM&Ro
68
3
0
28 Mar 2024
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models
Runyu Ma
Jelle Luijkx
Zlatan Ajanović
Jens Kober
LM&Ro
LRM
116
9
0
14 Mar 2024
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models
Liangliang Chen
Yutian Lei
Shiyu Jin
Ying Zhang
Liangjun Zhang
LM&Ro
105
12
0
11 Mar 2024
Natural Language Reinforcement Learning
Xidong Feng
Bo Liu
Mengyue Yang
Ziyan Wang
Girish A. Koushiks
Yali Du
Ying Wen
Jun Wang
OffRL
102
5
0
11 Feb 2024
Institutional Platform for Secure Self-Service Large Language Model Exploration
V. Bumgardner
Mitchell A. Klusty
W. V. Logan
Samuel E. Armstrong
Caylin D. Hickey
Jeff Talbert
Caylin Hickey
Jeff Talbert
138
1
0
01 Feb 2024
Multi-granularity Knowledge Transfer for Continual Reinforcement Learning
Chaofan Pan
Lingfei Ren
Hao Wang
Linbo Xiong
Wei Wei
Yonghao Li
Xin Yang
134
3
0
25 Jan 2024
RePLan: Robotic Replanning with Perception and Language Models
Marta Skreta
Zihan Zhou
Jia Lin Yuan
Kourosh Darvish
Alán Aspuru-Guzik
Animesh Garg
LM&Ro
LRM
125
26
0
08 Jan 2024
Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents
Zihao Zhou
Bin-Bin Hu
Chenyang Zhao
Pu Zhang
Bin Liu
LLMAG
116
16
0
22 Nov 2023
Make a Donut: Hierarchical EMD-Space Planning for Zero-Shot Deformable Manipulation with Tools
Yang You
Bokui Shen
Congyue Deng
Haoran Geng
Songlin Wei
He Wang
Leonidas Guibas
60
1
0
05 Nov 2023
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
Zelai Xu
Chao Yu
Fei Fang
Yu Wang
Yi Wu
LLMAG
125
95
0
29 Oct 2023
Learning Reward for Physical Skills using Large Language Model
Yuwei Zeng
Yiqing Xu
76
6
0
21 Oct 2023
ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination
Xihuai Wang
Shao Zhang
Wenhao Zhang
Wentao Dong
Jingxiao Chen
Ying Wen
Weinan Zhang
93
11
0
08 Oct 2023
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Hao Sha
Yao Mu
Yuxuan Jiang
Li Chen
Chenfeng Xu
Ping Luo
Shengbo Eben Li
Masayoshi Tomizuka
Wei Zhan
Mingyu Ding
263
179
0
04 Oct 2023
Guide Your Agent with Adaptive Multimodal Rewards
Changyeon Kim
Younggyo Seo
Hao Liu
Lisa Lee
Jinwoo Shin
Honglak Lee
Kimin Lee
72
9
0
19 Sep 2023
LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient Querying
Thommen George Karimpanal
Laknath Semage
Santu Rana
Hung Le
T. Tran
Sunil R. Gupta
Svetha Venkatesh
RALM
LLMAG
55
3
0
21 Aug 2023
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
Wenlong Huang
Chen Wang
Ruohan Zhang
Yunzhu Li
Jiajun Wu
Li Fei-Fei
LM&Ro
132
519
0
12 Jul 2023
RoCo: Dialectic Multi-Robot Collaboration with Large Language Models
Zhao Mandi
Shreeya Jain
Shuran Song
LM&Ro
LLMAG
75
141
0
10 Jul 2023
On Learning to Summarize with Large Language Models as References
Yixin Liu
Kejian Shi
Katherine S He
Longtian Ye
Alexander R. Fabbri
Pengfei Liu
Dragomir R. Radev
Arman Cohan
ELM
117
82
0
23 May 2023
Language Instructed Reinforcement Learning for Human-AI Coordination
Hengyuan Hu
Dorsa Sadigh
LM&Ro
96
64
0
13 Apr 2023
Deanthropomorphising NLP: Can a Language Model Be Conscious?
Matthew Shardlow
Piotr Przybyła
64
7
0
21 Nov 2022
1