ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.12808
  4. Cited By
Open-Ended Learning Leads to Generally Capable Agents

Open-Ended Learning Leads to Generally Capable Agents

27 July 2021
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
Jakob Bauer
Jakub Sygnowski
Maja Trebacz
Max Jaderberg
Michaël Mathieu
Nat McAleese
N. Bradley-Schmieg
Nathaniel Wong
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
ArXivPDFHTML

Papers citing "Open-Ended Learning Leads to Generally Capable Agents"

50 / 134 papers shown
Title
Large Sequence Models for Sequential Decision-Making: A Survey
Large Sequence Models for Sequential Decision-Making: A Survey
Muning Wen
Runji Lin
Hanjing Wang
Yaodong Yang
Ying Wen
Luo Mai
Jun Wang
Haifeng Zhang
Weinan Zhang
LM&Ro
LRM
37
35
0
24 Jun 2023
Reward-Free Curricula for Training Robust World Models
Reward-Free Curricula for Training Robust World Models
Marc Rigter
Minqi Jiang
Ingmar Posner
VLM
OffRL
31
6
0
15 Jun 2023
Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement
  Learning
Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning
E. Liu
S. Suri
Tong Mu
Allan Zhou
Chelsea Finn
LLMAG
LM&Ro
21
2
0
14 Jun 2023
Composing Efficient, Robust Tests for Policy Selection
Composing Efficient, Robust Tests for Policy Selection
Dustin Morrill
Thomas J. Walsh
D. Hernández
Peter R. Wurman
Peter Stone
20
0
0
12 Jun 2023
On the Importance of Exploration for Generalization in Reinforcement
  Learning
On the Importance of Exploration for Generalization in Reinforcement Learning
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCV
OffRL
30
20
0
08 Jun 2023
Generalization Across Observation Shifts in Reinforcement Learning
Generalization Across Observation Shifts in Reinforcement Learning
Anuj Mahajan
Amy Zhang
OOD
OffRL
11
0
0
07 Jun 2023
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
Bo Liu
Yifeng Zhu
Chongkai Gao
Yihao Feng
Qian Liu
Yuke Zhu
Peter Stone
CLL
35
115
0
05 Jun 2023
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Yang Li
Shao Zhang
Jichen Sun
Wenhao Zhang
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
24
13
0
05 Jun 2023
Operationalising the Definition of General Purpose AI Systems: Assessing
  Four Approaches
Operationalising the Definition of General Purpose AI Systems: Assessing Four Approaches
Risto Uuk
C. I. Gutierrez
Alex Tamkin
26
2
0
05 Jun 2023
OMNI: Open-endedness via Models of human Notions of Interestingness
OMNI: Open-endedness via Models of human Notions of Interestingness
Jenny Zhang
Joel Lehman
Kenneth O. Stanley
Jeff Clune
LRM
20
31
0
02 Jun 2023
Thought Cloning: Learning to Think while Acting by Imitating Human
  Thinking
Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
Shengran Hu
Jeff Clune
LM&Ro
OffRL
LRM
AI4CE
35
27
0
01 Jun 2023
Adaptive Coordination in Social Embodied Rearrangement
Adaptive Coordination in Social Embodied Rearrangement
Andrew Szot
Unnat Jain
Dhruv Batra
Z. Kira
Ruta Desai
Akshara Rai
42
13
0
31 May 2023
Ghost in the Minecraft: Generally Capable Agents for Open-World
  Environments via Large Language Models with Text-based Knowledge and Memory
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Xizhou Zhu
Yuntao Chen
Hao Tian
Chenxin Tao
Weijie Su
...
Lewei Lu
Xiaogang Wang
Yu Qiao
Zhaoxiang Zhang
Jifeng Dai
LLMAG
LM&Ro
31
212
0
25 May 2023
Dynamics of niche construction in adaptable populations evolving in
  diverse environments
Dynamics of niche construction in adaptable populations evolving in diverse environments
Eleni Nisioti
Clément Moulin-Frier
29
2
0
16 May 2023
Open-ended search for environments and adapted agents using MAP-Elites
Open-ended search for environments and adapted agents using MAP-Elites
Emma Stensby Norstein
K. Ellefsen
K. Glette
16
4
0
02 May 2023
Dynamic Datasets and Market Environments for Financial Reinforcement
  Learning
Dynamic Datasets and Market Environments for Financial Reinforcement Learning
Xiao-Yang Liu
Ziyi Xia
Hongyang Yang
Jiechao Gao
Daochen Zha
Ming Zhu
Chris Wang
Zhaoran Wang
Jian Guo
OffRL
29
27
0
25 Apr 2023
Meta-Learned Models of Cognition
Meta-Learned Models of Cognition
Marcel Binz
Ishita Dasgupta
A. Jagadish
M. Botvinick
Jane X. Wang
Eric Schulz
30
25
0
12 Apr 2023
Skill Reinforcement Learning and Planning for Open-World Long-Horizon
  Tasks
Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks
Haoqi Yuan
Chi Zhang
Hongchen Wang
Feiyang Xie
Penglin Cai
Hao Dong
Zongqing Lu
LM&Ro
LLMAG
20
18
0
29 Mar 2023
marl-jax: Multi-Agent Reinforcement Leaning Framework
marl-jax: Multi-Agent Reinforcement Leaning Framework
K. Mehta
Anuj Mahajan
Kiran Ravish
23
3
0
24 Mar 2023
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement
  Learning
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning
Mikayel Samvelyan
Akbir Khan
Michael Dennis
Minqi Jiang
Jack Parker-Holder
Jakob N. Foerster
Roberta Raileanu
Tim Rocktaschel
54
24
0
06 Mar 2023
Augmented Language Models: a Survey
Augmented Language Models: a Survey
Grégoire Mialon
Roberto Dessì
Maria Lomeli
Christoforos Nalmpantis
Ramakanth Pasunuru
...
Jane Dwivedi-Yu
Asli Celikyilmaz
Edouard Grave
Yann LeCun
Thomas Scialom
LRM
KELM
44
367
0
15 Feb 2023
Cooperative Open-ended Learning Framework for Zero-shot Coordination
Cooperative Open-ended Learning Framework for Zero-shot Coordination
Yang Li
Shao Zhang
Jichen Sun
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
26
22
0
09 Feb 2023
Open-World Multi-Task Control Through Goal-Aware Representation Learning
  and Adaptive Horizon Prediction
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
Shaofei Cai
Zihao Wang
Xiaojian Ma
Anji Liu
Yitao Liang
47
40
0
21 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&Ro
OffRL
AI4CE
LRM
38
108
0
18 Jan 2023
The Effectiveness of World Models for Continual Reinforcement Learning
The Effectiveness of World Models for Continual Reinforcement Learning
Samuel Kessler
M. Ostaszewski
Michal Bortkiewicz
M. Żarski
Maciej Wołczyk
Jack Parker-Holder
Stephen J. Roberts
Piotr Milo's
KELM
OffRL
CLL
27
7
0
29 Nov 2022
Powderworld: A Platform for Understanding Generalization via Rich Task
  Distributions
Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
Kevin Frans
Phillip Isola
OffRL
44
9
0
23 Nov 2022
Contextual Transformer for Offline Meta Reinforcement Learning
Contextual Transformer for Offline Meta Reinforcement Learning
Runji Lin
Ye Li
Xidong Feng
Zhaowei Zhang
Xian Hong Wu Fung
Haifeng Zhang
Jun Wang
Yali Du
Yaodong Yang
OffRL
18
6
0
15 Nov 2022
General Intelligence Requires Rethinking Exploration
General Intelligence Requires Rethinking Exploration
Minqi Jiang
Tim Rocktaschel
Edward Grefenstette
LRM
29
17
0
15 Nov 2022
Dynamic Collaborative Multi-Agent Reinforcement Learning Communication
  for Autonomous Drone Reforestation
Dynamic Collaborative Multi-Agent Reinforcement Learning Communication for Autonomous Drone Reforestation
P. D. Siedler
AI4CE
20
4
0
14 Nov 2022
Avalon: A Benchmark for RL Generalization Using Procedurally Generated
  Worlds
Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
Joshua Albrecht
Abraham J. Fetterman
Bryden Fogelman
Ellie Kitanidis
Bartosz Wróblewski
...
Michael Rosenthal
Maksis Knutins
Zachary Polizzi
James B. Simon
Kanjun Qiu
OffRL
21
23
0
24 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
55
8
0
23 Oct 2022
Augmentative Topology Agents For Open-Ended Learning
Augmentative Topology Agents For Open-Ended Learning
Muhammad Umair Nasir
Michael Beukman
Steven D. James
C. Cleghorn
29
3
0
20 Oct 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
27
1
0
18 Oct 2022
Retrospectives on the Embodied AI Workshop
Retrospectives on the Embodied AI Workshop
Matt Deitke
Dhruv Batra
Yonatan Bisk
Tommaso Campari
Angel X. Chang
...
Jesse Thomason
Alexander Toshev
Joanne Truong
Luca Weihs
Jiajun Wu
LM&Ro
37
51
0
13 Oct 2022
How to Enable Uncertainty Estimation in Proximal Policy Optimization
How to Enable Uncertainty Estimation in Proximal Policy Optimization
Eugene Bykovets
Yannick Metz
Mennatallah El-Assady
Daniel A. Keim
J. M. Buhmann
UQCV
8
1
0
07 Oct 2022
VIMA: General Robot Manipulation with Multimodal Prompts
VIMA: General Robot Manipulation with Multimodal Prompts
Yunfan Jiang
Agrim Gupta
Zichen Zhang
Guanzhi Wang
Yongqiang Dou
Yanjun Chen
Li Fei-Fei
Anima Anandkumar
Yuke Zhu
Linxi Fan
LM&Ro
28
335
0
06 Oct 2022
Neural Distillation as a State Representation Bottleneck in
  Reinforcement Learning
Neural Distillation as a State Representation Bottleneck in Reinforcement Learning
Valentin Guillet
D. Wilson
Carlos Aguilar-Melchor
Emmanuel Rachelson
11
1
0
05 Oct 2022
Improving alignment of dialogue agents via targeted human judgements
Improving alignment of dialogue agents via targeted human judgements
Amelia Glaese
Nat McAleese
Maja Trkebacz
John Aslanides
Vlad Firoiu
...
John F. J. Mellor
Demis Hassabis
Koray Kavukcuoglu
Lisa Anne Hendricks
G. Irving
ALM
AAML
227
502
0
28 Sep 2022
Disentangling Transfer in Continual Reinforcement Learning
Disentangling Transfer in Continual Reinforcement Learning
Maciej Wołczyk
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
65
27
0
28 Sep 2022
Honor of Kings Arena: an Environment for Generalization in Competitive
  Reinforcement Learning
Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning
Hua Wei
Jingxiao Chen
Xiyang Ji
Hongyang Qin
Minwen Deng
...
Lin Liu
Lanxiao Huang
Deheng Ye
Qiang Fu
Wei Yang
35
27
0
18 Sep 2022
The Alignment Problem from a Deep Learning Perspective
The Alignment Problem from a Deep Learning Perspective
Richard Ngo
Lawrence Chan
Sören Mindermann
59
183
0
30 Aug 2022
Deep Reinforcement Learning for Multi-Agent Interaction
Deep Reinforcement Learning for Multi-Agent Interaction
I. Ahmed
Cillian Brewitt
Ignacio Carlucho
Filippos Christianos
Mhairi Dunion
...
Lukas Schafer
Massimiliano Tamborski
Giuseppe Vecchio
Cheng Wang
Stefano V. Albrecht
DRL
AI4CE
11
11
0
02 Aug 2022
Bayesian Generational Population-Based Training
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
31
15
0
19 Jul 2022
Brick Tic-Tac-Toe: Exploring the Generalizability of AlphaZero to Novel
  Test Environments
Brick Tic-Tac-Toe: Exploring the Generalizability of AlphaZero to Novel Test Environments
John Tan Chong Min
Mehul Motani
32
1
0
13 Jul 2022
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Andrei Lupu
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
Jakob N. Foerster
43
13
0
11 Jul 2022
Improving Policy Optimization with Generalist-Specialist Learning
Improving Policy Optimization with Generalist-Specialist Learning
Zhiwei Jia
Xuanlin Li
Z. Ling
Shuang Liu
Yiran Wu
H. Su
OffRL
29
24
0
26 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online
  Videos
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
36
285
0
23 Jun 2022
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale
  Knowledge
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Linxi Fan
Guanzhi Wang
Yunfan Jiang
Ajay Mandlekar
Yuncong Yang
Haoyi Zhu
Andrew Tang
De-An Huang
Yuke Zhu
Anima Anandkumar
LM&Ro
48
348
0
17 Jun 2022
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Matt Deitke
Eli VanderBilt
Alvaro Herrasti
Luca Weihs
Jordi Salvador
...
Winson Han
Eric Kolve
Ali Farhadi
Aniruddha Kembhavi
Roozbeh Mottaghi
LM&Ro
38
235
0
14 Jun 2022
Language and Culture Internalisation for Human-Like Autotelic AI
Language and Culture Internalisation for Human-Like Autotelic AI
Cédric Colas
Tristan Karch
Clément Moulin-Frier
Pierre-Yves Oudeyer
LM&Ro
32
25
0
02 Jun 2022
Previous
123
Next