Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.01587
Cited By
Procedural Generalization by Planning with Self-Supervised World Models
2 November 2021
Ankesh Anand
Jacob Walker
Yazhe Li
Eszter Vértes
Julian Schrittwieser
Sherjil Ozair
T. Weber
Jessica B. Hamrick
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Procedural Generalization by Planning with Self-Supervised World Models"
38 / 38 papers shown
Title
Vector Quantized Models for Planning
Sherjil Ozair
Yazhe Li
Ali Razavi
Ioannis Antonoglou
Aaron van den Oord
Oriol Vinyals
OffRL
61
50
0
08 Jun 2021
Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL
Bogdan Mazoure
Ahmed M. Ahmed
Patrick MacAlpine
R. Devon Hjelm
Andrey Kolobov
49
27
0
04 Jun 2021
Learning and Planning in Complex Action Spaces
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
M. Barekatain
Simon Schmitt
David Silver
67
79
0
13 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
64
66
0
13 Apr 2021
Decoupling Value and Policy for Generalization in Reinforcement Learning
Roberta Raileanu
Rob Fergus
DRL
OffRL
51
98
0
20 Feb 2021
The Value Equivalence Principle for Model-Based Reinforcement Learning
Christopher Grimm
André Barreto
Satinder Singh
David Silver
OffRL
40
85
0
06 Nov 2020
Representation Learning via Invariant Causal Mechanisms
Jovana Mitrović
Brian McWilliams
Jacob Walker
Lars Buesing
Charles Blundell
CML
OOD
SSL
60
249
0
15 Oct 2020
Prioritized Level Replay
Minqi Jiang
Edward Grefenstette
Tim Rocktaschel
OffRL
55
158
0
08 Oct 2020
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
95
852
0
05 Oct 2020
Phasic Policy Gradient
K. Cobbe
Jacob Hilton
Oleg Klimov
John Schulman
OffRL
51
158
0
09 Sep 2020
Monte-Carlo Tree Search as Regularized Policy Optimization
Jean-Bastien Grill
Florent Altché
Yunhao Tang
Thomas Hubert
Michal Valko
Ioannis Antonoglou
Rémi Munos
74
75
0
24 Jul 2020
On Robustness and Transferability of Convolutional Neural Networks
Josip Djolonga
Jessica Yung
Michael Tschannen
Rob Romijnders
Lucas Beyer
...
D. Moldovan
Sylvain Gelly
N. Houlsby
Xiaohua Zhai
Mario Lucic
OOD
55
155
0
16 Jul 2020
Deep Reinforcement and InfoMax Learning
Bogdan Mazoure
Rémi Tachet des Combes
T. Doan
Philip Bachman
R. Devon Hjelm
AI4CE
58
109
0
12 Jun 2020
Reinforcement Learning with Augmented Data
Michael Laskin
Kimin Lee
Adam Stooke
Lerrel Pinto
Pieter Abbeel
A. Srinivas
OffRL
74
660
0
30 Apr 2020
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Z. Guo
Bernardo Avila-Pires
Bilal Piot
Jean-Bastien Grill
Florent Altché
Rémi Munos
M. G. Azar
BDL
DRL
SSL
175
142
0
30 Apr 2020
Causally Correct Partial Models for Reinforcement Learning
Danilo Jimenez Rezende
Ivo Danihelka
George Papamakarios
Nan Rosemary Ke
Ray Jiang
...
Jane X. Wang
Jovana Mitrović
F. Besse
Ioannis Antonoglou
Lars Buesing
AI4TS
66
34
0
07 Feb 2020
Gradient Surgery for Multi-Task Learning
Tianhe Yu
Saurabh Kumar
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
165
1,211
0
19 Jan 2020
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
72
555
0
03 Dec 2019
Contrastive Learning of Structured World Models
Thomas Kipf
Elise van der Pol
Max Welling
OCL
DRL
69
284
0
27 Nov 2019
Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck
Maximilian Igl
K. Ciosek
Yingzhen Li
Sebastian Tschiatschek
Cheng Zhang
Sam Devlin
Katja Hofmann
OffRL
55
173
0
28 Oct 2019
Improving Generalization in Meta Reinforcement Learning using Learned Objectives
Louis Kirsch
Sjoerd van Steenkiste
Jürgen Schmidhuber
OffRL
69
119
0
09 Oct 2019
Benchmarking Model-Based Reinforcement Learning
Tingwu Wang
Xuchan Bao
I. Clavera
Jerrick Hoang
Yeming Wen
Eric D. Langlois
Matthew Shunshi Zhang
Guodong Zhang
Pieter Abbeel
Jimmy Ba
OffRL
59
363
0
03 Jul 2019
Shaping Belief States with Generative Environment Models for RL
Karol Gregor
Danilo Jimenez Rezende
F. Besse
Yan Wu
Hamza Merzic
Aaron van den Oord
OffRL
AI4CE
82
119
0
21 Jun 2019
When to Trust Your Model: Model-Based Policy Optimization
Michael Janner
Justin Fu
Marvin Zhang
Sergey Levine
OffRL
89
950
0
19 Jun 2019
Unsupervised State Representation Learning in Atari
Ankesh Anand
Evan Racah
Sherjil Ozair
Yoshua Bengio
Marc-Alexandre Côté
R. Devon Hjelm
SSL
51
255
0
19 Jun 2019
Model-Based Reinforcement Learning for Atari
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
...
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
OffRL
117
860
0
01 Mar 2019
Neural Predictive Belief Representations
Z. Guo
M. G. Azar
Bilal Piot
Bernardo Avila-Pires
Rémi Munos
SSL
45
81
0
15 Nov 2018
Natural Environment Benchmarks for Reinforcement Learning
Amy Zhang
Yuxin Wu
Joelle Pineau
OffRL
OOD
56
69
0
14 Nov 2018
Learning Dexterous In-Hand Manipulation
OpenAI OpenAI
Marcin Andrychowicz
Bowen Baker
Maciek Chociej
Rafal Jozefowicz
...
Szymon Sidor
Joshua Tobin
Peter Welinder
Lilian Weng
Wojciech Zaremba
127
1,877
0
01 Aug 2018
Illuminating Generalization in Deep Reinforcement Learning through Procedural Level Generation
Niels Justesen
R. Torrado
Philip Bontrager
Ahmed Khalifa
Julian Togelius
S. Risi
110
184
0
28 Jun 2018
Implicit Quantile Networks for Distributional Reinforcement Learning
Will Dabney
Georg Ostrovski
David Silver
Rémi Munos
OffRL
129
531
0
14 Jun 2018
Learning to Adapt in Dynamic, Real-World Environments Through Meta-Reinforcement Learning
Anusha Nagabandi
I. Clavera
Simin Liu
R. Fearing
Pieter Abbeel
Sergey Levine
Chelsea Finn
108
548
0
30 Mar 2018
Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
Joshua Tobin
Rachel Fong
Alex Ray
Jonas Schneider
Wojciech Zaremba
Pieter Abbeel
238
2,964
0
20 Mar 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
811
11,894
0
09 Mar 2017
Learning to reinforcement learn
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
OffRL
97
978
0
17 Nov 2016
RL
2
^2
2
: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
81
1,018
0
09 Nov 2016
Deep Visual Foresight for Planning Robot Motion
Chelsea Finn
Sergey Levine
111
784
0
03 Oct 2016
Learning Deconvolution Network for Semantic Segmentation
Hyeonwoo Noh
Seunghoon Hong
Bohyung Han
SSeg
229
4,173
0
17 May 2015
1