ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.00374
  4. Cited By
Model-Based Reinforcement Learning for Atari
v1v2v3v4v5 (latest)

Model-Based Reinforcement Learning for Atari

1 March 2019
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
K. Czechowski
D. Erhan
Chelsea Finn
Piotr Kozakowski
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Model-Based Reinforcement Learning for Atari"

50 / 521 papers shown
Title
Will we ever have Conscious Machines?
Will we ever have Conscious Machines?
P. Krauss
Andreas Maier
79
30
0
31 Mar 2020
Importance of using appropriate baselines for evaluation of
  data-efficiency in deep reinforcement learning for Atari
Importance of using appropriate baselines for evaluation of data-efficiency in deep reinforcement learning for Atari
Kacper Kielak
OffRL
51
8
0
23 Mar 2020
Neuroevolution of Self-Interpretable Agents
Neuroevolution of Self-Interpretable Agents
Yujin Tang
Duong Nguyen
David R Ha
127
113
0
18 Mar 2020
Active Perception and Representation for Robotic Manipulation
Active Perception and Representation for Robotic Manipulation
Youssef Y. Zaky
Gaurav Paruthi
B. Tripp
James Bergstra
86
16
0
15 Mar 2020
An Adversarial Objective for Scalable Exploration
An Adversarial Objective for Scalable Exploration
Bernadette Bucher
Karl Schmeckpeper
Nikolai Matni
Kostas Daniilidis
65
2
0
13 Mar 2020
Learning Predictive Representations for Deformable Objects Using
  Contrastive Estimation
Learning Predictive Representations for Deformable Objects Using Contrastive Estimation
Wilson Yan
Ashwin Vangipuram
Pieter Abbeel
Lerrel Pinto
108
191
0
11 Mar 2020
Relevance-Guided Modeling of Object Dynamics for Reinforcement Learning
Relevance-Guided Modeling of Object Dynamics for Reinforcement Learning
William Agnew
Pedro M. Domingos
OffRL
95
3
0
03 Mar 2020
Predictive Coding for Locally-Linear Control
Predictive Coding for Locally-Linear Control
Rui Shu
Tung D. Nguyen
Yinlam Chow
Tu Pham
Khoat Than
Mohammad Ghavamzadeh
Stefano Ermon
Hung Bui
OffRLBDL
104
25
0
02 Mar 2020
PlaNet of the Bayesians: Reconsidering and Improving Deep Planning
  Network by Incorporating Bayesian Inference
PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference
Masashi Okada
Norio Kosaka
T. Taniguchi
68
43
0
01 Mar 2020
Contextual Policy Transfer in Reinforcement Learning Domains via Deep
  Mixtures-of-Experts
Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts
Michael Gimelfarb
Scott Sanner
Chi-Guhn Lee
42
1
0
29 Feb 2020
Reinforcement Learning through Active Inference
Reinforcement Learning through Active Inference
Alexander Tschantz
Beren Millidge
A. Seth
Christopher L. Buckley
AI4CE
88
72
0
28 Feb 2020
Hallucinative Topological Memory for Zero-Shot Visual Planning
Hallucinative Topological Memory for Zero-Shot Visual Planning
Kara Liu
Thanard Kurutach
Christine Tung
Pieter Abbeel
Aviv Tamar
89
48
0
27 Feb 2020
Plannable Approximations to MDP Homomorphisms: Equivariance under
  Actions
Plannable Approximations to MDP Homomorphisms: Equivariance under Actions
Elise van der Pol
Thomas Kipf
F. Oliehoek
Max Welling
81
80
0
27 Feb 2020
Disentangling Controllable Object through Video Prediction Improves
  Visual Reinforcement Learning
Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning
Yuanyi Zhong
Alex Schwing
Jian Peng
DRL
121
5
0
21 Feb 2020
Frequency-based Search-control in Dyna
Frequency-based Search-control in Dyna
Yangchen Pan
Jincheng Mei
Amir-massoud Farahmand
51
15
0
14 Feb 2020
Causally Correct Partial Models for Reinforcement Learning
Causally Correct Partial Models for Reinforcement Learning
Danilo Jimenez Rezende
Ivo Danihelka
George Papamakarios
Nan Rosemary Ke
Ray Jiang
...
Jane X. Wang
Jovana Mitrović
F. Besse
Ioannis Antonoglou
Lars Buesing
AI4TS
113
34
0
07 Feb 2020
Ready Policy One: World Building Through Active Learning
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
99
49
0
07 Feb 2020
Neuro-evolutionary Frameworks for Generalized Learning Agents
Neuro-evolutionary Frameworks for Generalized Learning Agents
Thommen George Karimpanal
22
1
0
04 Feb 2020
SEERL: Sample Efficient Ensemble Reinforcement Learning
SEERL: Sample Efficient Ensemble Reinforcement Learning
Rohan Saphal
Balaraman Ravindran
Dheevatsa Mudigere
Sasikanth Avancha
Bharat Kaul
65
19
0
15 Jan 2020
A Probabilistic Simulator of Spatial Demand for Product Allocation
A Probabilistic Simulator of Spatial Demand for Product Allocation
Porter Jenkins
Hua Wei
J. S. Jenkins
Z. Li
23
6
0
09 Jan 2020
Learning Predictive Models From Observation and Interaction
Learning Predictive Models From Observation and Interaction
Karl Schmeckpeper
Annie Xie
Oleh Rybkin
Stephen Tian
Kostas Daniilidis
Sergey Levine
Chelsea Finn
DRL
91
60
0
30 Dec 2019
Variational Recurrent Models for Solving Partially Observable Control
  Tasks
Variational Recurrent Models for Solving Partially Observable Control Tasks
Dongqi Han
Kenji Doya
Jun Tani
DRLOffRL
72
63
0
23 Dec 2019
Uncertainty-sensitive Learning and Planning with Ensembles
Uncertainty-sensitive Learning and Planning with Ensembles
Piotr Milo's
Lukasz Kuciñski
K. Czechowski
Piotr Kozakowski
Maciek Klimek
OffRL
95
8
0
19 Dec 2019
Adversarial recovery of agent rewards from latent spaces of the limit
  order book
Adversarial recovery of agent rewards from latent spaces of the limit order book
Jacobo Roa-Vicens
Yuanbo Wang
Virgile Mison
Y. Gal
Ricardo M. A. Silva
51
3
0
09 Dec 2019
Reinforcement Learning-based Visual Navigation with
  Information-Theoretic Regularization
Reinforcement Learning-based Visual Navigation with Information-Theoretic Regularization
Qiaoyun Wu
Kai Xu
Jun Wang
Mingliang Xu
Xiaoxi Gong
Tianyi Zhou
87
30
0
09 Dec 2019
Combining Q-Learning and Search with Amortized Value Estimates
Combining Q-Learning and Search with Amortized Value Estimates
Jessica B. Hamrick
V. Bapst
Alvaro Sanchez-Gonzalez
Tobias Pfaff
T. Weber
Lars Buesing
Peter W. Battaglia
OffRL
87
48
0
05 Dec 2019
Learning Human Objectives by Evaluating Hypothetical Behavior
Learning Human Objectives by Evaluating Hypothetical Behavior
S. Reddy
Anca Dragan
Sergey Levine
Shane Legg
Jan Leike
87
77
0
05 Dec 2019
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
232
1,378
0
03 Dec 2019
Simulation-based reinforcement learning for real-world autonomous
  driving
Simulation-based reinforcement learning for real-world autonomous driving
B. Osinski
Adam Jakubowski
Piotr Milos
Pawel Ziecina
Christopher Galias
S. Homoceanu
Henryk Michalewski
112
122
0
29 Nov 2019
Contrastive Learning of Structured World Models
Contrastive Learning of Structured World Models
Thomas Kipf
Elise van der Pol
Max Welling
OCLDRL
140
285
0
27 Nov 2019
Biologically inspired architectures for sample-efficient deep
  reinforcement learning
Biologically inspired architectures for sample-efficient deep reinforcement learning
Pierre Harvey Richemond
Arinbjorn Kolbeinsson
Yike Guo
59
2
0
25 Nov 2019
Scaling active inference
Scaling active inference
Alexander Tschantz
Manuel Baltieri
A. Seth
Christopher L. Buckley
BDLAI4CE
73
69
0
24 Nov 2019
Planning with Goal-Conditioned Policies
Planning with Goal-Conditioned Policies
Soroush Nasiriany
Vitchyr H. Pong
Steven Lin
Sergey Levine
OffRL
163
219
0
19 Nov 2019
Learning Representations in Reinforcement Learning:An Information
  Bottleneck Approach
Learning Representations in Reinforcement Learning:An Information Bottleneck Approach
Yingjun Pei
Xinwen Hou
SSL
76
10
0
12 Nov 2019
High Fidelity Video Prediction with Large Stochastic Recurrent Neural
  Networks
High Fidelity Video Prediction with Large Stochastic Recurrent Neural Networks
Ruben Villegas
Arkanath Pathak
Harini Kannan
D. Erhan
Quoc V. Le
Honglak Lee
VGen
80
139
0
05 Nov 2019
Learning to Predict Without Looking Ahead: World Models Without Forward
  Prediction
Learning to Predict Without Looking Ahead: World Models Without Forward Prediction
C. Freeman
Luke Metz
David R Ha
89
36
0
29 Oct 2019
Robust Model Predictive Shielding for Safe Reinforcement Learning with
  Stochastic Dynamics
Robust Model Predictive Shielding for Safe Reinforcement Learning with Stochastic Dynamics
Shuo Li
Osbert Bastani
77
86
0
24 Oct 2019
Zero-shot Policy Learning with Spatial Temporal RewardDecomposition on
  Contingency-aware Observation
Zero-shot Policy Learning with Spatial Temporal RewardDecomposition on Contingency-aware Observation
Huazhe Xu
Boyuan Chen
Yang Gao
Trevor Darrell
OffRL
37
2
0
17 Oct 2019
Soft Actor-Critic for Discrete Action Settings
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
168
305
0
16 Oct 2019
Imagined Value Gradients: Model-Based Policy Optimization with
  Transferable Latent Dynamics Models
Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Arunkumar Byravan
Jost Tobias Springenberg
A. Abdolmaleki
Roland Hafner
Michael Neunert
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
OffRL
88
41
0
09 Oct 2019
Model-based Reinforcement Learning for Predictions and Control for Limit
  Order Books
Model-based Reinforcement Learning for Predictions and Control for Limit Order Books
Haoran Wei
Yuanbo Wang
L. Mangu
Keith S. Decker
67
25
0
09 Oct 2019
Reinforcement Learning with Structured Hierarchical Grammar
  Representations of Actions
Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions
Petros Christodoulou
R. T. Lange
A. Shafti
A. Faisal
67
1
0
07 Oct 2019
Structured Object-Aware Physics Prediction for Video Modeling and
  Planning
Structured Object-Aware Physics Prediction for Video Modeling and Planning
Jannik Kossen
Karl Stelzner
Marcel Hussing
C. Voelcker
Kristian Kersting
OCL
113
70
0
06 Oct 2019
Making sense of sensory input
Making sense of sensory input
Maciej Wołczyk
Jacek Tabor
Johannes Welbl
Szymon Maszke
Marek Sergot
92
53
0
05 Oct 2019
Zero Shot Learning on Simulated Robots
Zero Shot Learning on Simulated Robots
Robert Kwiatkowski
Hod Lipson
46
0
0
04 Oct 2019
Mathematical Reasoning in Latent Space
Mathematical Reasoning in Latent Space
Dennis Lee
Christian Szegedy
M. Rabe
Sarah M. Loos
Kshitij Bansal
83
34
0
26 Sep 2019
Off-Policy Actor-Critic with Shared Experience Replay
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
80
68
0
25 Sep 2019
Gradient-Aware Model-based Policy Search
Gradient-Aware Model-based Policy Search
P. DÓro
Alberto Maria Metelli
Andrea Tirinzoni
Matteo Papini
Marcello Restelli
93
36
0
09 Sep 2019
Prediction, Consistency, Curvature: Representation Learning for
  Locally-Linear Control
Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Nir Levine
Yinlam Chow
Rui Shu
Ang Li
Mohammad Ghavamzadeh
Hung Bui
67
30
0
04 Sep 2019
Reusing Convolutional Activations from Frame to Frame to Speed up
  Training and Inference
Reusing Convolutional Activations from Frame to Frame to Speed up Training and Inference
Arno Khachatourian
28
0
0
02 Sep 2019
Previous
123...10119
Next