Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.04907
Cited By
JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning
7 December 2021
Zichuan Lin
Junyou Li
Jianing Shi
Deheng Ye
Qiang Fu
Wei Yang
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning"
22 / 22 papers shown
Title
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game
Zehua Wang
Yurui Dong
Fuwen Luo
Minyuan Ruan
Zhili Cheng
Chong Chen
Peng Li
Yang Liu
LRM
89
0
0
13 Mar 2025
Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms
Minghe Gao
Wendong Bu
Bingchen Miao
Yang Wu
Yunfei Li
Juncheng Billy Li
Siliang Tang
Qi Wu
Yueting Zhuang
Meng Wang
LM&Ro
45
3
0
17 Nov 2024
Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation
Zhonghan Zhao
Kewei Chen
Dongxu Guo
Wenhao Chai
Tianbo Ye
Yanting Zhang
Gaoang Wang
64
21
0
13 Mar 2024
S-Agents: Self-organizing Agents in Open-ended Environments
Jia-Qing Chen
Yu-Gang Jiang
Jiachen Lu
Li Zhang
AIFin
LLMAG
LM&Ro
60
15
0
07 Feb 2024
Affordable Generative Agents
Yangbin Yu
Qin Zhang
Junyou Li
Qiang Fu
Deheng Ye
LLMAG
AI4CE
45
5
0
03 Feb 2024
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Hao Li
Xue Yang
Zhaokai Wang
Xizhou Zhu
Jie Zhou
Yu Qiao
Xiaogang Wang
Hongsheng Li
Lewei Lu
Jifeng Dai
43
32
0
14 Dec 2023
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception
Yiran Qin
Enshen Zhou
Qichang Liu
Zhen-fei Yin
Lu Sheng
Ruimao Zhang
Yu Qiao
Jing Shao
LM&Ro
32
39
0
12 Dec 2023
See and Think: Embodied Agent in Virtual Environment
Zhonghan Zhao
Wenhao Chai
Xuan Wang
Li Boyi
Shengyu Hao
Shidong Cao
Tianbo Ye
Gaoang Wang
LM&Ro
LLMAG
34
34
0
26 Nov 2023
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems
Yilun Kong
Jingqing Ruan
Yihong Chen
Bin Zhang
Tianpeng Bao
...
Xiaoru Hu
Hangyu Mao
Ziyue Li
Xingyu Zeng
Rui Zhao
LLMAG
42
37
0
19 Nov 2023
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models
Zihao Wang
Shaofei Cai
Guy Van den Broeck
Yonggang Jin
Jinbing Hou
...
Zhaofeng He
Zilong Zheng
Yaodong Yang
Xiaojian Ma
Yitao Liang
LLMAG
LM&Ro
40
96
0
10 Nov 2023
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
Shaofei Cai
Bowei Zhang
Zihao Wang
Xiaojian Ma
Guy Van den Broeck
Yitao Liang
83
26
0
12 Oct 2023
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Xizhou Zhu
Yuntao Chen
Hao Tian
Chenxin Tao
Weijie Su
...
Lewei Lu
Xiaogang Wang
Yu Qiao
Zhaoxiang Zhang
Jifeng Dai
LLMAG
LM&Ro
36
215
0
25 May 2023
Voyager: An Open-Ended Embodied Agent with Large Language Models
Guanzhi Wang
Yuqi Xie
Yunfan Jiang
Ajay Mandlekar
Chaowei Xiao
Yuke Zhu
Linxi Fan
Anima Anandkumar
LM&Ro
SyDa
60
757
0
25 May 2023
Language Models Meet World Models: Embodied Experiences Enhance Language Models
Jiannan Xiang
Tianhua Tao
Yi Gu
Tianmin Shu
Zirui Wang
Zichao Yang
Zhiting Hu
ALM
LLMAG
LM&Ro
CLL
36
94
0
18 May 2023
CLIP4MC: An RL-Friendly Vision-Language Model for Minecraft
Ziluo Ding
Hao Luo
Ke Li
Junpeng Yue
Tiejun Huang
Zongqing Lu
VLM
23
11
0
19 Mar 2023
Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Zichuan Lin
Xiapeng Wu
Mingfei Sun
Deheng Ye
Qiang Fu
Wei Yang
Wei Liu
18
3
0
05 Feb 2023
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents
Zihao Wang
Shaofei Cai
Guanzhou Chen
Guy Van den Broeck
Xiaojian Ma
Yitao Liang
LM&Ro
LLMAG
60
318
0
03 Feb 2023
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
Shaofei Cai
Zihao Wang
Xiaojian Ma
Guy Van den Broeck
Yitao Liang
50
40
0
21 Jan 2023
RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning
Boxuan Zhao
Jun Zhang
Deheng Ye
Jiancheng Cao
Xiao Han
Qiang Fu
Wei Yang
OffRL
31
9
0
04 Dec 2022
Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization
Tiantian Zhang
Zichuan Lin
Yuxing Wang
Deheng Ye
Qiang Fu
Wei Yang
Xueqian Wang
Bin Liang
Bo Yuan
Xiu Li
CLL
35
10
0
01 Sep 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
45
288
0
23 Jun 2022
MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned
Anssi Kanervisto
Stephanie Milani
Karolis Ramanauskas
Nicholay Topin
Zichuan Lin
...
Franccois Fleuret
Alexander Nikulin
Yury Belousov
Oleg Svidchenko
A. Shpilman
OffRL
60
31
0
17 Feb 2022
1