ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.01588
  4. Cited By
Leveraging Procedural Generation to Benchmark Reinforcement Learning

Leveraging Procedural Generation to Benchmark Reinforcement Learning

3 December 2019
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
ArXivPDFHTML

Papers citing "Leveraging Procedural Generation to Benchmark Reinforcement Learning"

50 / 286 papers shown
Title
Goal Misgeneralization: Why Correct Specifications Aren't Enough For
  Correct Goals
Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals
Rohin Shah
Vikrant Varma
Ramana Kumar
Mary Phuong
Victoria Krakovna
J. Uesato
Zachary Kenton
40
68
0
04 Oct 2022
Hyperbolic Deep Reinforcement Learning
Hyperbolic Deep Reinforcement Learning
Edoardo Cetin
B. Chamberlain
Michael M. Bronstein
Jonathan J. Hunt
43
21
0
04 Oct 2022
DMAP: a Distributed Morphological Attention Policy for Learning to
  Locomote with a Changing Body
DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body
A. Chiappa
Alessandro Marin Vargas
Alexander Mathis
34
7
0
28 Sep 2022
Quantification before Selection: Active Dynamics Preference for Robust
  Reinforcement Learning
Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning
Kang Xu
Yan Ma
Wei Li
46
0
0
23 Sep 2022
A Generalist Neural Algorithmic Learner
A Generalist Neural Algorithmic Learner
Borja Ibarz
Vitaly Kurin
George Papamakarios
Kyriacos Nikiforou
Mehdi Abbana Bennani
...
Andreea Deac
Beatrice Bevilacqua
Yaroslav Ganin
Charles Blundell
Petar Velivcković
OOD
32
53
0
22 Sep 2022
Pretraining the Vision Transformer using self-supervised methods for
  vision based Deep Reinforcement Learning
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning
Manuel Goulão
Arlindo L. Oliveira
ViT
41
6
0
22 Sep 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative
  MARL
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
39
49
0
21 Sep 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities:
  Robustness, Safety, and Generalizability
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Bo-wen Li
Ding Zhao
74
45
0
16 Sep 2022
Look where you look! Saliency-guided Q-networks for generalization in
  visual Reinforcement Learning
Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning
David Bertoin
Adil Zouitine
Mehdi Zouitine
Emmanuel Rachelson
36
30
0
16 Sep 2022
Style-Agnostic Reinforcement Learning
Style-Agnostic Reinforcement Learning
Juyong Lee
Seokjun Ahn
Jaesik Park
25
4
0
31 Aug 2022
Continual Reinforcement Learning with TELLA
Continual Reinforcement Learning with TELLA
Neil Fendley
Cash Costello
Eric Q. Nguyen
Gino Perrotta
Corey Lowman
CLL
19
2
0
08 Aug 2022
Learning to Generalize with Object-centric Agents in the Open World
  Survival Game Crafter
Learning to Generalize with Object-centric Agents in the Open World Survival Game Crafter
Aleksandar Stanić
Yujin Tang
David R Ha
Jürgen Schmidhuber
ELM
29
13
0
05 Aug 2022
Unsupervised Frequent Pattern Mining for CEP
Unsupervised Frequent Pattern Mining for CEP
G. Shapira
Assaf Schuster
19
0
0
28 Jul 2022
Driver Dojo: A Benchmark for Generalizable Reinforcement Learning for
  Autonomous Driving
Driver Dojo: A Benchmark for Generalizable Reinforcement Learning for Autonomous Driving
Sebastian Rietsch
S. Huang
G. Kontes
Axel Plinge
Christopher Mutschler
OOD
OffRL
24
5
0
23 Jul 2022
The Game of Hidden Rules: A New Kind of Benchmark Challenge for Machine
  Learning
The Game of Hidden Rules: A New Kind of Benchmark Challenge for Machine Learning
Eric Pulick
S. Bharti
Yiding Chen
Vladimir Menkov
Yonatan Dov Mintz
Paul B. Kantor
Vicki M. Bier
16
1
0
20 Jul 2022
Bayesian Generational Population-Based Training
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
31
15
0
19 Jul 2022
Bootstrap State Representation using Style Transfer for Better
  Generalization in Deep Reinforcement Learning
Bootstrap State Representation using Style Transfer for Better Generalization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
34
4
0
15 Jul 2022
GriddlyJS: A Web IDE for Reinforcement Learning
GriddlyJS: A Web IDE for Reinforcement Learning
C. Bamford
Minqi Jiang
Mikayel Samvelyan
Tim Rocktaschel
OnRL
38
4
0
13 Jul 2022
Temporal Disentanglement of Representations for Improved Generalisation
  in Reinforcement Learning
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Mhairi Dunion
Trevor A. McInroe
K. Luck
Josiah P. Hanna
Stefano V. Albrecht
OOD
DRL
18
18
0
12 Jul 2022
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic
  Reinforcement Learning
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning
Homer Walke
Jonathan Yang
Albert Yu
Aviral Kumar
Jedrzej Orbik
Avi Singh
Sergey Levine
OffRL
OnRL
27
32
0
11 Jul 2022
Offline RL Policies Should be Trained to be Adaptive
Offline RL Policies Should be Trained to be Adaptive
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
35
45
0
05 Jul 2022
Improving Policy Optimization with Generalist-Specialist Learning
Improving Policy Optimization with Generalist-Specialist Learning
Zhiwei Jia
Xuanlin Li
Z. Ling
Shuang Liu
Yiran Wu
H. Su
OffRL
32
24
0
26 Jun 2022
Robust Deep Reinforcement Learning through Bootstrapped Opportunistic
  Curriculum
Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum
Junlin Wu
Yevgeniy Vorobeychik
24
21
0
21 Jun 2022
DNA: Proximal Policy Optimization with a Dual Network Architecture
DNA: Proximal Policy Optimization with a Dual Network Architecture
Mathew H. Aitchison
Penny Sweetser
OffRL
25
4
0
20 Jun 2022
EAGER: Asking and Answering Questions for Automatic Reward Shaping in
  Language-guided RL
EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
Thomas Carta
Pierre-Yves Oudeyer
Olivier Sigaud
Sylvain Lamprier
OffRL
28
24
0
20 Jun 2022
Deep Surrogate Assisted Generation of Environments
Deep Surrogate Assisted Generation of Environments
Varun Bhatt
Bryon Tjanaka
Matthew C. Fontaine
Stefanos Nikolaidis
56
35
0
09 Jun 2022
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Mandi Zhao
Pieter Abbeel
Stephen James
OffRL
31
33
0
07 Jun 2022
Learning Dynamics and Generalization in Reinforcement Learning
Learning Dynamics and Generalization in Reinforcement Learning
Clare Lyle
Mark Rowland
Will Dabney
Marta Z. Kwiatkowska
Y. Gal
OOD
OffRL
28
12
0
05 Jun 2022
Disentangling Epistemic and Aleatoric Uncertainty in Reinforcement
  Learning
Disentangling Epistemic and Aleatoric Uncertainty in Reinforcement Learning
Bertrand Charpentier
Ransalu Senanayake
Mykel Kochenderfer
Stephan Günnemann
PER
UD
50
24
0
03 Jun 2022
Efficient Scheduling of Data Augmentation for Deep Reinforcement
  Learning
Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
Byungchan Ko
Jungseul Ok
OnRL
27
5
0
01 Jun 2022
History Compression via Language Models in Reinforcement Learning
History Compression via Language Models in Reinforcement Learning
Fabian Paischer
Thomas Adler
Vihang Patil
Angela Bitto-Nemling
Markus Holzleitner
Sebastian Lehner
Hamid Eghbalzadeh
Sepp Hochreiter
OffRL
AI4TS
28
42
0
24 May 2022
Chain of Thought Imitation with Procedure Cloning
Chain of Thought Imitation with Procedure Cloning
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
32
30
0
22 May 2022
An Empirical Investigation of Representation Learning for Imitation
An Empirical Investigation of Representation Learning for Imitation
Xin Chen
Sam Toyer
Cody Wild
Scott Emmons
Ian S. Fischer
...
Steven H. Wang
Ping Luo
Stuart J. Russell
Pieter Abbeel
Rohin Shah
AI4TS
33
27
0
16 May 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
71
791
0
12 May 2022
Learning Generalized Policies Without Supervision Using GNNs
Learning Generalized Policies Without Supervision Using GNNs
Simon Ståhlberg
Blai Bonet
Hector Geffner
OffRL
26
27
0
12 May 2022
Local Feature Swapping for Generalization in Reinforcement Learning
Local Feature Swapping for Generalization in Reinforcement Learning
David Bertoin
Emmanuel Rachelson
OOD
21
14
0
13 Apr 2022
JORLDY: a fully customizable open source framework for reinforcement
  learning
JORLDY: a fully customizable open source framework for reinforcement learning
Kyushik Min
Hyunho Lee
Kwansu Shin
Tae-woo Lee
Hojoon Lee
Jinwon Choi
Sung-Hyun Son
OnRL
16
0
0
11 Apr 2022
Dynamic Noises of Multi-Agent Environments Can Improve Generalization:
  Agent-based Models meets Reinforcement Learning
Dynamic Noises of Multi-Agent Environments Can Improve Generalization: Agent-based Models meets Reinforcement Learning
Mohamed Akrout
Amal Feriani
Bob McLeod
AI4CE
13
0
0
26 Mar 2022
The Sandbox Environment for Generalizable Agent Research (SEGAR)
The Sandbox Environment for Generalizable Agent Research (SEGAR)
R. Devon Hjelm
Bogdan Mazoure
Florian Golemo
Felipe Vieira Frujeri
Mihai Jalobeanu
Andrey Kolobov
LLMAG
LRM
24
1
0
19 Mar 2022
Evolving Curricula with Regret-Based Environment Design
Evolving Curricula with Regret-Based Environment Design
Jack Parker-Holder
Minqi Jiang
Michael Dennis
Mikayel Samvelyan
Jakob N. Foerster
Edward Grefenstette
Tim Rocktaschel
31
117
0
02 Mar 2022
Reliable validation of Reinforcement Learning Benchmarks
Reliable validation of Reinforcement Learning Benchmarks
Matthias Muller-Brockhausen
Aske Plaat
Mike Preuss
OffRL
11
1
0
02 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
46
11
0
01 Mar 2022
GraphWorld: Fake Graphs Bring Real Insights for GNNs
GraphWorld: Fake Graphs Bring Real Insights for GNNs
John Palowitch
Anton Tsitsulin
Brandon Mayer
Bryan Perozzi
GNN
195
68
0
28 Feb 2022
Consistent Dropout for Policy Gradient Reinforcement Learning
Consistent Dropout for Policy Gradient Reinforcement Learning
Matthew J. Hausknecht
Nolan Wagener
OffRL
19
10
0
23 Feb 2022
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for
  Visual Reinforcement Learning
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning
Zhecheng Yuan
Guozheng Ma
Yao Mu
Bo Xia
Bo Yuan
Xueqian Wang
Ping Luo
Huazhe Xu
33
28
0
21 Feb 2022
A Survey of Explainable Reinforcement Learning
A Survey of Explainable Reinforcement Learning
Stephanie Milani
Nicholay Topin
Manuela Veloso
Fei Fang
XAI
LRM
22
52
0
17 Feb 2022
Can Wikipedia Help Offline Reinforcement Learning?
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
140
95
0
28 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Autonomous Reinforcement Learning: Formalism and Benchmarking
Autonomous Reinforcement Learning: Formalism and Benchmarking
Archit Sharma
Kelvin Xu
Nikhil Sardana
Abhishek Gupta
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
47
26
0
17 Dec 2021
Invariance Through Latent Alignment
Invariance Through Latent Alignment
Takuma Yoneda
Ge Yang
Matthew R. Walter
Bradly C. Stadie
OOD
21
9
0
15 Dec 2021
Previous
123456
Next