Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.01906
Cited By
Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks
6 September 2018
Felix Leibfried
Peter Vrancx
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks"
21 / 21 papers shown
Title
Deep Reinforcement Learning with Model Learning and Monte Carlo Tree Search in Minecraft
Stephan Alaniz
14
16
0
22 Mar 2018
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Vitchyr H. Pong
S. Gu
Murtaza Dalal
Sergey Levine
OffRL
84
238
0
25 Feb 2018
Learning and Querying Fast Generative Models for Reinforcement Learning
Lars Buesing
T. Weber
S. Racanière
S. M. Ali Eslami
Danilo Jimenez Rezende
...
Fabio Viola
F. Besse
Karol Gregor
Demis Hassabis
Daan Wierstra
OffRL
50
134
0
08 Feb 2018
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
94
2,255
0
06 Oct 2017
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
65
552
0
19 Jul 2017
Value Prediction Network
Junhyuk Oh
Satinder Singh
Honglak Lee
65
332
0
11 Jul 2017
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
96
2,416
0
15 May 2017
Recurrent Environment Simulators
Silvia Chiappa
S. Racanière
Daan Wierstra
S. Mohamed
44
207
0
07 Apr 2017
A Deep Learning Approach for Joint Video Frame and Reward Prediction in Atari Games
Felix Leibfried
Nate Kushman
Katja Hofmann
127
43
0
21 Nov 2016
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
41
1,225
0
16 Nov 2016
Learning Continuous Control Policies by Stochastic Value Gradients
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
80
560
0
30 Oct 2015
Deep Spatial Autoencoders for Visuomotor Learning
Chelsea Finn
X. Tan
Yan Duan
Trevor Darrell
Sergey Levine
Pieter Abbeel
SSL
37
551
0
21 Sep 2015
Action-Conditional Video Prediction using Deep Networks in Atari Games
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
74
852
0
31 Jul 2015
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
Bradly C. Stadie
Sergey Levine
Pieter Abbeel
71
502
0
03 Jul 2015
Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
Manuel Watter
Jost Tobias Springenberg
Joschka Boedecker
Martin Riedmiller
BDL
44
839
0
24 Jun 2015
End-to-End Training of Deep Visuomotor Policies
Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
BDL
212
3,418
0
02 Apr 2015
From Pixels to Torques: Policy Learning with Deep Dynamical Models
Niklas Wahlström
Thomas B. Schon
M. Deisenroth
46
189
0
08 Feb 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
776
149,474
0
22 Dec 2014
Learning to Generate Chairs, Tables and Cars with Convolutional Networks
Alexey Dosovitskiy
Jost Tobias Springenberg
Maxim Tatarchenko
Thomas Brox
GAN
106
676
0
21 Nov 2014
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
70
2,992
0
19 Jul 2012
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
R. Sutton
Csaba Szepesvári
A. Geramifard
Michael Bowling
OffRL
59
203
0
13 Jun 2012
1