ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.13373
  4. Cited By
Large Language Model as a Policy Teacher for Training Reinforcement
  Learning Agents

Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents

22 November 2023
Zihao Zhou
Bin-Bin Hu
Chenyang Zhao
Pu Zhang
Bin Liu
    LLMAG
ArXivPDFHTML

Papers citing "Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents"

14 / 14 papers shown
Title
Internet of Agents: Fundamentals, Applications, and Challenges
Internet of Agents: Fundamentals, Applications, and Challenges
Yuntao Wang
Shaolong Guo
Yanghe Pan
Zhou Su
Fahao Chen
Tom H. Luan
Peng Li
Jiawen Kang
Dusit Niyato
LLMAG
LM&Ro
AI4CE
60
0
0
12 May 2025
The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning
The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning
Sheila Schoepp
Masoud Jafaripour
Yingyue Cao
Tianpei Yang
Fatemeh Abdollahi
Shadan Golestan
Zahin Sufiyan
Osmar Zaiane
Matthew E. Taylor
OffRL
LM&Ro
46
0
0
24 Feb 2025
Language-Driven Policy Distillation for Cooperative Driving in
  Multi-Agent Reinforcement Learning
Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning
Jiaqi Liu
Chengkai Xu
Peng Hang
Jian Sun
Mingyu Ding
W. Zhan
Masayoshi Tomizuka
40
2
0
31 Oct 2024
How to Build a Pre-trained Multimodal model for Simultaneously Chatting
  and Decision-making?
How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?
Zuojin Tang
Bin-Bin Hu
Chenyang Zhao
De Ma
Gang Pan
Bin Liu
23
0
0
21 Oct 2024
G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks
G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks
Guibin Zhang
Xinfeng Li
Xiangguo Sun
Guancheng Wan
Miao Yu
Junfeng Fang
Kun Wang
Dawei Cheng
Dawei Cheng
AAML
AI4CE
51
7
0
15 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient
  Multi-Agent Exploration
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
46
4
0
03 Oct 2024
Cut the Crap: An Economical Communication Pipeline for LLM-based
  Multi-Agent Systems
Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems
Guibin Zhang
Xinfeng Li
Zhixun Li
Sukwon Yun
Guancheng Wan
Kun Wang
Dawei Cheng
Jeffrey Xu Yu
Tianlong Chen
34
9
0
03 Oct 2024
A Survey on the Integration of Generative AI for Critical Thinking in
  Mobile Networks
A Survey on the Integration of Generative AI for Critical Thinking in Mobile Networks
Athanasios Karapantelakis
Alexandros Nikou
Ajay Kattepur
Jean Martins
Leonid Mokrushin
S. Mohalik
Marin Orlic
Aneta Vulgarakis Feljan
29
1
0
10 Apr 2024
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Hao Sha
Yao Mu
Yuxuan Jiang
Li Chen
Chenfeng Xu
Ping Luo
Shengbo Eben Li
Masayoshi Tomizuka
Wei Zhan
Mingyu Ding
120
159
0
04 Oct 2023
Enabling Intelligent Interactions between an Agent and an LLM: A
  Reinforcement Learning Approach
Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach
Bin-Bin Hu
Chenyang Zhao
Pushi Zhang
Zihao Zhou
Yuanhang Yang
Zenglin Xu
Bin Liu
LM&Ro
LLMAG
25
21
0
06 Jun 2023
Foundation Models for Decision Making: Problems, Methods, and
  Opportunities
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&Ro
OffRL
LRM
AI4CE
95
155
0
07 Mar 2023
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
267
2,494
0
06 Oct 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
314
3,248
0
21 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
389
8,495
0
28 Jan 2022
1