ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.01588
  4. Cited By
Leveraging Procedural Generation to Benchmark Reinforcement Learning

Leveraging Procedural Generation to Benchmark Reinforcement Learning

3 December 2019
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
ArXivPDFHTML

Papers citing "Leveraging Procedural Generation to Benchmark Reinforcement Learning"

50 / 286 papers shown
Title
Using Offline Data to Speed-up Reinforcement Learning in Procedurally
  Generated Environments
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments
Alain Andres
Lukas Schafer
Esther Villar-Rodriguez
Stefano V. Albrecht
Javier Del Ser
OffRL
OnRL
34
2
0
18 Apr 2023
Efficient Automation of Neural Network Design: A Survey on
  Differentiable Neural Architecture Search
Efficient Automation of Neural Network Design: A Survey on Differentiable Neural Architecture Search
Alexandre Heuillet
A. Nasser
Hichem Arioui
Hedi Tabia
AI4CE
27
11
0
11 Apr 2023
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via
  Geometry-aware Curriculum and Iterative Generalist-Specialist Learning
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-aware Curriculum and Iterative Generalist-Specialist Learning
Weikang Wan
Haoran Geng
Yun-Hai Liu
Zikang Shan
Yaodong Yang
Li Yi
He Wang
50
94
0
02 Apr 2023
POPGym: Benchmarking Partially Observable Reinforcement Learning
POPGym: Benchmarking Partially Observable Reinforcement Learning
Steven D. Morad
Ryan Kortvelesy
Matteo Bettini
Stephan Liwicki
Amanda Prorok
OffRL
22
38
0
03 Mar 2023
Self-supervised network distillation: an effective approach to
  exploration in sparse reward environments
Self-supervised network distillation: an effective approach to exploration in sparse reward environments
Matej Pecháč
M. Chovanec
Igor Farkaš
32
3
0
22 Feb 2023
TiZero: Mastering Multi-Agent Football with Curriculum Learning and
  Self-Play
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Fanqing Lin
Shiyu Huang
Tim Pearce
Wenze Chen
Weijuan Tu
26
17
0
15 Feb 2023
Policy-Induced Self-Supervision Improves Representation Finetuning in
  Visual RL
Policy-Induced Self-Supervision Improves Representation Finetuning in Visual RL
Sébastien M. R. Arnold
Fei Sha
SSL
21
0
0
12 Feb 2023
MarioGPT: Open-Ended Text2Level Generation through Large Language Models
MarioGPT: Open-Ended Text2Level Generation through Large Language Models
Shyam Sudhakaran
Miguel González Duque
Claire Glanois
Matthias Anton Freiberger
Elias Najarro
S. Risi
VLM
29
54
0
12 Feb 2023
Equivariant MuZero
Equivariant MuZero
Andreea Deac
T. Weber
George Papamakarios
16
3
0
09 Feb 2023
Off-the-Grid MARL: Datasets with Baselines for Offline Multi-Agent
  Reinforcement Learning
Off-the-Grid MARL: Datasets with Baselines for Offline Multi-Agent Reinforcement Learning
Claude Formanek
Asad Jeewa
Jonathan P. Shock
Arnu Pretorius
OffRL
43
1
0
01 Feb 2023
Scaling laws for single-agent reinforcement learning
Scaling laws for single-agent reinforcement learning
Jacob Hilton
Jie Tang
John Schulman
22
20
0
31 Jan 2023
Composing Task Knowledge with Modular Successor Feature Approximators
Composing Task Knowledge with Modular Successor Feature Approximators
Wilka Carvalho
Angelos Filos
Richard L. Lewis
Honglak Lee
Satinder Singh
17
7
0
28 Jan 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement
  Learning
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
29
8
0
26 Jan 2023
Minimal Value-Equivalent Partial Models for Scalable and Robust Planning
  in Lifelong Reinforcement Learning
Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning
Safa Alver
Doina Precup
OffRL
16
5
0
24 Jan 2023
The configurable tree graph (CT-graph): measurable problems in partially
  observable and distal reward environments for lifelong reinforcement learning
The configurable tree graph (CT-graph): measurable problems in partially observable and distal reward environments for lifelong reinforcement learning
Andrea Soltoggio
Eseoghene Ben-Iwhiwhu
Christos Peridis
Pawel Ladosz
Jeffery Dick
Praveen K. Pilly
Soheil Kolouri
OffRL
32
3
0
21 Jan 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
122
0
19 Jan 2023
Generalization through Diversity: Improving Unsupervised Environment
  Design
Generalization through Diversity: Improving Unsupervised Environment Design
Wenjun Li
Pradeep Varakantham
Dexun Li
33
7
0
19 Jan 2023
A Domain-Agnostic Approach for Characterization of Lifelong Learning
  Systems
A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Megan M. Baker
Alexander New
Mario Aguilar-Simon
Ziad Al-Halah
Sébastien M. R. Arnold
...
Zifan Xu
A. Yanguas-Gil
Harel Yedidsion
Shangqun Yu
Gautam K. Vallabha
30
15
0
18 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&Ro
OffRL
AI4CE
LRM
38
109
0
18 Jan 2023
Mutation Testing of Deep Reinforcement Learning Based on Real Faults
Mutation Testing of Deep Reinforcement Learning Based on Real Faults
Florian Tambon
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
G. Antoniol
30
7
0
13 Jan 2023
Mastering Diverse Domains through World Models
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
38
546
0
10 Jan 2023
Reinforcement Learning with Success Induced Task Prioritization
Reinforcement Learning with Success Induced Task Prioritization
Maria Nesterova
Alexey Skrynnik
Aleksandr I. Panov
8
2
0
30 Dec 2022
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
27
25
0
29 Dec 2022
On Realization of Intelligent Decision-Making in the Real World: A
  Foundation Decision Model Perspective
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective
Ying Wen
Bo Liu
M. Zhou
Shufang Hou
Zhe Cao
Chenyang Le
Jingxiao Chen
Zheng Tian
Weinan Zhang
Jun Wang
AI4CE
26
10
0
24 Dec 2022
Lifelong Reinforcement Learning with Modulating Masks
Lifelong Reinforcement Learning with Modulating Masks
Eseoghene Ben-Iwhiwhu
Saptarshi Nath
Praveen K. Pilly
Soheil Kolouri
Andrea Soltoggio
CLL
OffRL
32
20
0
21 Dec 2022
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement
  Learning
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Benjamin Ellis
Jonathan Cook
S. Moalla
Mikayel Samvelyan
Mingfei Sun
Anuj Mahajan
Jakob N. Foerster
Shimon Whiteson
25
83
0
14 Dec 2022
Improving generalization in reinforcement learning through forked agents
Improving generalization in reinforcement learning through forked agents
Olivier Moulin
Vincent François-Lavet
Mark Hoogendoorn
AI4CE
28
0
0
13 Dec 2022
Tackling Visual Control via Multi-View Exploration Maximization
Tackling Visual Control via Multi-View Exploration Maximization
Mingqi Yuan
Xin Jin
Bo Li
Wenjun Zeng
30
1
0
28 Nov 2022
A System for Morphology-Task Generalization via Unified Representation
  and Behavior Distillation
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation
Hiroki Furuta
Yusuke Iwasawa
Yutaka Matsuo
S. Gu
22
14
0
25 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
35
0
0
23 Nov 2022
Powderworld: A Platform for Understanding Generalization via Rich Task
  Distributions
Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
Kevin Frans
Phillip Isola
OffRL
47
9
0
23 Nov 2022
Deep Reinforcement Learning with Vector Quantized Encoding
Deep Reinforcement Learning with Vector Quantized Encoding
Liang Zhang
Justin Lieffers
A. Pyarelal
OffRL
18
2
0
12 Nov 2022
Active Task Randomization: Learning Robust Skills via Unsupervised
  Generation of Diverse and Feasible Tasks
Active Task Randomization: Learning Robust Skills via Unsupervised Generation of Diverse and Feasible Tasks
Kuan Fang
Toki Migimatsu
Ajay Mandlekar
Li Fei-Fei
Jeannette Bohg
44
2
0
11 Nov 2022
Pretraining in Deep Reinforcement Learning: A Survey
Pretraining in Deep Reinforcement Learning: A Survey
Zhihui Xie
Zichuan Lin
Junyou Li
Shuai Li
Deheng Ye
OffRL
OnRL
AI4CE
26
23
0
08 Nov 2022
Broken Neural Scaling Laws
Broken Neural Scaling Laws
Ethan Caballero
Kshitij Gupta
Irina Rish
David M. Krueger
30
74
0
26 Oct 2022
Avalon: A Benchmark for RL Generalization Using Procedurally Generated
  Worlds
Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
Joshua Albrecht
Abraham J. Fetterman
Bryden Fogelman
Ellie Kitanidis
Bartosz Wróblewski
...
Michael Rosenthal
Maksis Knutins
Zachary Polizzi
James B. Simon
Kanjun Qiu
OffRL
29
23
0
24 Oct 2022
Evaluating Long-Term Memory in 3D Mazes
Evaluating Long-Term Memory in 3D Mazes
J. Pašukonis
Timothy Lillicrap
Danijar Hafner
3DV
21
21
0
24 Oct 2022
LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon
  Manipulation
LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon Manipulation
Shuo Cheng
Danfei Xu
54
37
0
23 Oct 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
34
6
0
22 Oct 2022
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement
  Learning
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Yifan Xu
Nicklas Hansen
Zirui Wang
Yung-Chieh Chan
H. Su
Z. Tu
OffRL
31
15
0
19 Oct 2022
Rethinking Value Function Learning for Generalization in Reinforcement
  Learning
Rethinking Value Function Learning for Generalization in Reinforcement Learning
Seungyong Moon
JunYeong Lee
Hyun Oh Song
OOD
OffRL
21
16
0
18 Oct 2022
Bootstrap Advantage Estimation for Policy Optimization in Reinforcement
  Learning
Bootstrap Advantage Estimation for Policy Optimization in Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
6
0
0
13 Oct 2022
Object-Category Aware Reinforcement Learning
Object-Category Aware Reinforcement Learning
Qi Yi
Rui Zhang
Shaohui Peng
Jiaming Guo
Xingui Hu
Zidong Du
Xishan Zhang
Qi Guo
Yunji Chen
CML
LRM
25
6
0
13 Oct 2022
Contrastive Retrospection: honing in on critical steps for rapid
  learning and generalization in RL
Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL
Chen Sun
Wannan Yang
Thomas Jiralerspong
Dane Malenfant
Benjamin Alsbury-Nealy
Yoshua Bengio
Blake A. Richards
OffRL
19
2
0
12 Oct 2022
Exploration via Elliptical Episodic Bonuses
Exploration via Elliptical Episodic Bonuses
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
32
40
0
11 Oct 2022
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
DaeJin Jo
Sungwoong Kim
D. W. Nam
Taehwan Kwon
Seungeun Rho
Jongmin Kim
Donghoon Lee
OffRL
29
10
0
11 Oct 2022
Benchmarking Reinforcement Learning Techniques for Autonomous Navigation
Benchmarking Reinforcement Learning Techniques for Autonomous Navigation
Zifan Xu
Bo Liu
Xuesu Xiao
Anirudh Nair
Peter Stone
36
42
0
10 Oct 2022
A Comprehensive Survey of Data Augmentation in Visual Reinforcement
  Learning
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning
Guozheng Ma
Zhen Wang
Zhecheng Yuan
Xueqian Wang
Bo Yuan
Dacheng Tao
OffRL
38
26
0
10 Oct 2022
Decomposed Mutual Information Optimization for Generalized Context in
  Meta-Reinforcement Learning
Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning
Yao Mu
Yuzheng Zhuang
Fei Ni
Bin Wang
Jianyu Chen
Jianye Hao
Ping Luo
21
2
0
09 Oct 2022
Atari-5: Distilling the Arcade Learning Environment down to Five Games
Atari-5: Distilling the Arcade Learning Environment down to Five Games
Matthew Aitchison
Penny Sweetser
Marcus Hutter
50
19
0
05 Oct 2022
Previous
123456
Next