Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1903.00374
Cited By
v1
v2
v3
v4
v5 (latest)
Model-Based Reinforcement Learning for Atari
1 March 2019
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
K. Czechowski
D. Erhan
Chelsea Finn
Piotr Kozakowski
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Model-Based Reinforcement Learning for Atari"
50 / 521 papers shown
Title
Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills
Yevgen Chebotar
Karol Hausman
Yao Lu
Ted Xiao
Dmitry Kalashnikov
...
A. Irpan
Benjamin Eysenbach
Ryan Julian
Chelsea Finn
Sergey Levine
SSL
OffRL
85
153
0
15 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
111
66
0
13 Apr 2021
Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment
Philip J. Ball
Cong Lu
Jack Parker-Holder
Stephen J. Roberts
OffRL
112
45
0
12 Apr 2021
Adaptive Variants of Optimal Feedback Policies
B. Lopez
Jean-Jacques E. Slotine
OffRL
74
4
0
06 Apr 2021
Discriminator Augmented Model-Based Reinforcement Learning
Behzad Haghgoo
Allan Zhou
Archit Sharma
Chelsea Finn
OffRL
67
3
0
24 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
47
17
0
15 Mar 2021
Adapting User Interfaces with Model-based Reinforcement Learning
Kashyap Todi
G. Bailly
Luis A. Leiva
Antti Oulasvirta
85
89
0
11 Mar 2021
Behavior From the Void: Unsupervised Active Pre-Training
Hao Liu
Pieter Abbeel
VLM
SSL
142
207
0
08 Mar 2021
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings
Lili Chen
Kimin Lee
A. Srinivas
Pieter Abbeel
OffRL
77
11
0
04 Mar 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Chong Chen
Yaodong Yang
Lu Zhang
Wulong Liu
Zhaopeng Meng
OffRL
138
4
0
03 Mar 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Victor Campos
Pablo Sprechmann
Steven Hansen
André Barreto
Steven Kapturowski
Alex Vitvitskyi
Adria Puigdomenech Badia
Charles Blundell
OffRL
OnRL
83
26
0
24 Feb 2021
Greedy-Step Off-Policy Reinforcement Learning
Yuhui Wang
Qingyuan Wu
Pengcheng He
Xiaoyang Tan
OffRL
59
1
0
23 Feb 2021
MUSBO: Model-based Uncertainty Regularized and Sample Efficient Batch Optimization for Deployment Constrained Reinforcement Learning
DiJia Su
Jason D. Lee
John M. Mulvey
H. Vincent Poor
OffRL
62
6
0
23 Feb 2021
Return-Based Contrastive Representation Learning for Reinforcement Learning
Guoqing Liu
Wei Shen
Li Zhao
Tao Qin
Jinhua Zhu
Jian Li
Nenghai Yu
Tie-Yan Liu
SSL
OffRL
100
48
0
22 Feb 2021
Sim-Env: Decoupling OpenAI Gym Environments from Simulation Models
Andreas Schuderer
Stefano Bromuri
M. V. Eekelen
AI4CE
43
2
0
19 Feb 2021
Deep Latent Competition: Learning to Race Using Visual Control Policies in Latent Space
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
Lucas Liebenwein
Ryan M Sander
S. Karaman
Daniela Rus
BDL
85
37
0
19 Feb 2021
Neuro-algorithmic Policies enable Fast Combinatorial Generalization
Marin Vlastelica
Michal Rolínek
Georg Martius
74
17
0
15 Feb 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
76
18
0
13 Feb 2021
Planning and Learning Using Adaptive Entropy Tree Search
Piotr Kozakowski
Mikolaj Pacek
Piotr Milo's
45
2
0
12 Feb 2021
Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Piotr Kozakowski
Lukasz Kaiser
Henryk Michalewski
Afroz Mohiuddin
Katarzyna Kañska
OffRL
81
5
0
12 Feb 2021
Machine Learning for Mechanical Ventilation Control
Daniel Suo
Naman Agarwal
Wenhan Xia
Xinyi Chen
Udaya Ghai
...
J. LaChance
Tom Zadjel
Manuel Schottdorf
Daniel J. Cohen
Elad Hazan
OOD
AI4CE
138
10
0
12 Feb 2021
Multi-Task Reinforcement Learning with Context-based Representations
Shagun Sodhani
Amy Zhang
Joelle Pineau
98
192
0
11 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
55
13
0
09 Feb 2021
Model-Augmented Q-learning
Youngmin Oh
Jinwoo Shin
Eunho Yang
Sung Ju Hwang
OffRL
47
1
0
07 Feb 2021
Counterfactual State Explanations for Reinforcement Learning Agents via Generative Deep Learning
Matthew Lyle Olson
Roli Khanna
Lawrence Neal
Fuxin Li
Weng-Keen Wong
CML
92
75
0
29 Jan 2021
Prior Preference Learning from Experts:Designing a Reward with Active Inference
Jinyoung Shin
Cheolhyeong Kim
H. Hwang
100
9
0
22 Jan 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
123
75
0
01 Jan 2021
Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients
Chris Cundy
Rishi Desai
Stefano Ermon
OffRL
127
4
0
30 Dec 2020
Causal World Models by Unsupervised Deconfounding of Physical Dynamics
Minne Li
Mengyue Yang
Furui Liu
Xu Chen
Zhitang Chen
Jun Wang
SyDa
CML
50
13
0
28 Dec 2020
A Tutorial on Sparse Gaussian Processes and Variational Inference
Felix Leibfried
Vincent Dutordoir
S. T. John
N. Durrande
GP
176
52
0
27 Dec 2020
Stochastic Action Prediction for Imitation Learning
S. Venkatesh
N. Rathod
Shishir Kolathaya
B. Amrutur
37
0
0
26 Dec 2020
Planning from Pixels in Atari with Learned Symbolic Representations
Andrea Dittadi
Frederik K. Drachmann
Thomas Bolander
94
11
0
16 Dec 2020
Models, Pixels, and Rewards: Evaluating Design Trade-offs in Visual Model-Based Reinforcement Learning
Mohammad Babaeizadeh
M. Saffar
Danijar Hafner
Harini Kannan
Chelsea Finn
Sergey Levine
D. Erhan
VLM
65
9
0
08 Dec 2020
Planning from Pixels using Inverse Dynamics Models
Keiran Paster
Sheila A. McIlraith
Jimmy Ba
BDL
75
41
0
04 Dec 2020
Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUp
Junfan Lin
Zhongzhan Huang
Keze Wang
Xiaodan Liang
Weiwei Chen
Liang Lin
30
11
0
30 Nov 2020
Minimax Sample Complexity for Turn-based Stochastic Game
Qiwen Cui
Lin F. Yang
89
23
0
29 Nov 2020
Unsupervised Object Keypoint Learning using Local Spatial Predictability
Anand Gopalakrishnan
Sjoerd van Steenkiste
Jürgen Schmidhuber
SSL
68
21
0
25 Nov 2020
Safely Learning Dynamical Systems from Short Trajectories
Amir Ali Ahmadi
A. Chaudhry
Vikas Sindhwani
Stephen Tu
61
5
0
24 Nov 2020
Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Thomas Mesnard
T. Weber
Fabio Viola
S. Thakoor
Alaa Saade
...
A. Guez
Éric Moulines
Marcus Hutter
Lars Buesing
Rémi Munos
CML
OffRL
111
58
0
18 Nov 2020
Explaining Conditions for Reinforcement Learning Behaviors from Real and Imagined Data
Aastha Acharya
Rebecca L. Russell
Nisar R. Ahmed
OffRL
50
4
0
17 Nov 2020
Distilling a Hierarchical Policy for Planning and Control via Representation and Reinforcement Learning
Jung-Su Ha
Young-Jin Park
Hyeok-Joo Chae
Soon-Seo Park
Han-Lim Choi
126
3
0
16 Nov 2020
Reinforcement Learning with Dual-Observation for General Video Game Playing
Chengpeng Hu
Ziqi Wang
Tianye Shu
Hao Tong
Julian Togelius
Xinghu Yao
Jialin Liu
OffRL
79
9
0
11 Nov 2020
Deep Reinforcement Learning for Navigation in AAA Video Games
Eloi Alonso
Maxim Peter
David Goumard
Joshua Romoff
59
37
0
09 Nov 2020
On the role of planning in model-based deep reinforcement learning
Jessica B. Hamrick
A. Friesen
Feryal M. P. Behbahani
A. Guez
Fabio Viola
Sims Witherspoon
Thomas W. Anthony
Lars Buesing
Petar Velickovic
T. Weber
OffRL
112
66
0
08 Nov 2020
Learning World Transition Model for Socially Aware Robot Navigation
Yuxiang Cui
Haodong Zhang
Yue Wang
R. Xiong
76
17
0
08 Nov 2020
Representation Matters: Improving Perception and Exploration for Robotics
Markus Wulfmeier
Arunkumar Byravan
Tim Hertweck
I. Higgins
Ankush Gupta
...
Malcolm Reynolds
Denis Teplyashin
Roland Hafner
Thomas Lampe
Martin Riedmiller
98
16
0
03 Nov 2020
Sample-efficient reinforcement learning using deep Gaussian processes
Charles W. L. Gadd
Markus Heinonen
Harri Lähdesmäki
Samuel Kaski
GP
BDL
66
4
0
02 Nov 2020
Fast Reinforcement Learning with Incremental Gaussian Mixture Models
R. Pinto
24
1
0
02 Nov 2020
Low-Variance Policy Gradient Estimation with World Models
Michal Nauman
Floris den Hengst
OffRL
51
1
0
29 Oct 2020
Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning
Xiyao Wang
Junge Zhang
Wenzhen Huang
Qiyue Yin
47
0
0
24 Oct 2020
Previous
1
2
3
...
10
11
7
8
9
Next