ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.00374
  4. Cited By
Model-Based Reinforcement Learning for Atari
v1v2v3v4v5 (latest)

Model-Based Reinforcement Learning for Atari

1 March 2019
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
K. Czechowski
D. Erhan
Chelsea Finn
Piotr Kozakowski
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Model-Based Reinforcement Learning for Atari"

50 / 521 papers shown
Title
Towards biologically plausible Dreaming and Planning in recurrent
  spiking networks
Towards biologically plausible Dreaming and Planning in recurrent spiking networks
C. Capone
P. Paolucci
CLL
69
7
0
20 May 2022
Planning with Diffusion for Flexible Behavior Synthesis
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
348
707
0
20 May 2022
The Primacy Bias in Deep Reinforcement Learning
The Primacy Bias in Deep Reinforcement Learning
Evgenii Nikishin
Max Schwarzer
P. DÓro
Pierre-Luc Bacon
Rameswar Panda
OnRL
153
198
0
16 May 2022
RLFlow: Optimising Neural Network Subgraph Transformation with World
  Models
RLFlow: Optimising Neural Network Subgraph Transformation with World Models
Sean Parker
Sami Alabed
Eiko Yoneki
34
0
0
03 May 2022
CCLF: A Contrastive-Curiosity-Driven Learning Framework for
  Sample-Efficient Reinforcement Learning
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning
Chenyu Sun
Hangwei Qian
Chunyan Miao
OffRL
96
12
0
02 May 2022
Deep Reinforcement Learning Using a Low-Dimensional Observation Filter
  for Visual Complex Video Game Playing
Deep Reinforcement Learning Using a Low-Dimensional Observation Filter for Visual Complex Video Game Playing
V. A. Kich
J. C. Jesus
Ricardo B. Grando
A. H. Kolling
Gabriel V. Heisler
R. S. Guerra
OffRL
42
2
0
24 Apr 2022
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent
  Reinforcement Learning
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Dapeng Li
Bin Zhang
Yuan Zhan
Yunru Bai
Guoliang Fan
OffRL
72
8
0
20 Apr 2022
Separating the World and Ego Models for Self-Driving
Separating the World and Ego Models for Self-Driving
Vlad Sobal
A. Canziani
Nicolas Carion
Kyunghyun Cho
Yann LeCun
79
5
0
14 Apr 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSLOnRL
112
123
0
25 Mar 2022
Graph Neural Networks for Relational Inductive Bias in Vision-based Deep
  Reinforcement Learning of Robot Control
Graph Neural Networks for Relational Inductive Bias in Vision-based Deep Reinforcement Learning of Robot Control
Marco Oliva
Soubarna Banik
Josip Josifovski
Alois Knoll
77
5
0
11 Mar 2022
Temporal Difference Learning for Model Predictive Control
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINNMU
105
256
0
09 Mar 2022
Fast and Data Efficient Reinforcement Learning from Pixels via
  Non-Parametric Value Approximation
Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation
Alex Long
Alan Blair
H. V. Hoof
41
3
0
07 Mar 2022
TransDreamer: Reinforcement Learning with Transformer World Models
TransDreamer: Reinforcement Learning with Transformer World Models
Changgu Chen
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
OffRL
109
97
0
19 Feb 2022
Online Decision Transformer
Online Decision Transformer
Qinqing Zheng
Amy Zhang
Aditya Grover
OffRL
95
209
0
11 Feb 2022
DNS: Determinantal Point Process Based Neural Network Sampler for
  Ensemble Reinforcement Learning
DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning
Hassam Sheikh
Kizza M Nandyose Frisbee
Mariano Phielipp
91
8
0
31 Jan 2022
Efficient Embedding of Semantic Similarity in Control Policies via
  Entangled Bisimulation
Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation
Martín Bertrán
Walter A. Talbott
Nitish Srivastava
J. Susskind
95
3
0
28 Jan 2022
Mask-based Latent Reconstruction for Reinforcement Learning
Mask-based Latent Reconstruction for Reinforcement Learning
Tao Yu
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
105
45
0
28 Jan 2022
Tracking and Planning with Spatial World Models
Tracking and Planning with Spatial World Models
Baris Kayalibay
Atanas Mirchev
Patrick van der Smagt
Justin Bayer
80
2
0
25 Jan 2022
Reinforcement Learning for Personalized Drug Discovery and Design for
  Complex Diseases: A Systems Pharmacology Perspective
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
77
2
0
21 Jan 2022
Instance-Dependent Confidence and Early Stopping for Reinforcement
  Learning
Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
K. Khamaru
Eric Xia
Martin J. Wainwright
Michael I. Jordan
92
6
0
21 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
116
107
0
11 Jan 2022
Learning Contraction Policies from Offline Data
Learning Contraction Policies from Offline Data
Navid Rezazadeh
Maxwell Kolarich
Solmaz S. Kia
Negar Mehr
OffRL
72
7
0
11 Dec 2021
Learning Generalizable Behavior via Visual Rewrite Rules
Learning Generalizable Behavior via Visual Rewrite Rules
Yiheng Xie
Mingxuan Li
Shangqun Yu
Michael Littman
DRL
21
1
0
09 Dec 2021
A Review for Deep Reinforcement Learning in Atari:Benchmarks,
  Challenges, and Solutions
A Review for Deep Reinforcement Learning in Atari:Benchmarks, Challenges, and Solutions
Jiajun Fan
OffRL
85
21
0
08 Dec 2021
ED2: Environment Dynamics Decomposition World Models for Continuous
  Control
ED2: Environment Dynamics Decomposition World Models for Continuous Control
Jianye Hao
Yifu Yuan
Cong Wang
Zhen Wang
OffRL
93
2
0
06 Dec 2021
Inducing Functions through Reinforcement Learning without Task
  Specification
Inducing Functions through Reinforcement Learning without Task Specification
Junmo Cho
Dong-hwan Lee
Young-Gyu Yoon
31
2
0
23 Nov 2021
Fast and Data-Efficient Training of Rainbow: an Experimental Study on
  Atari
Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari
Dominik Schmidt
Thomas Schmied
OffRL
61
12
0
19 Nov 2021
ModelLight: Model-Based Meta-Reinforcement Learning for Traffic Signal
  Control
ModelLight: Model-Based Meta-Reinforcement Learning for Traffic Signal Control
Xingshuai Huang
Di Wu
M. Jenkin
Benoit Boulet
67
15
0
15 Nov 2021
Learning Representations for Pixel-based Control: What Matters and Why?
Learning Representations for Pixel-based Control: What Matters and Why?
Manan Tomar
Utkarsh Aashu Mishra
Amy Zhang
Matthew E. Taylor
SSLOffRL
101
26
0
15 Nov 2021
Modular Networks Prevent Catastrophic Interference in Model-Based
  Multi-Task Reinforcement Learning
Modular Networks Prevent Catastrophic Interference in Model-Based Multi-Task Reinforcement Learning
Robin Schiewer
Laurenz Wiskott
19
3
0
15 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions'
  Sets
Improving Experience Replay through Modeling of Similar Transitions' Sets
Daniel Eugênio Neves
João Pedro Oliveira Batisteli
Eduardo Felipe Lopes
Lucila Ishitani
Zenilton K. G. Patrocínio
OffRL
38
1
0
12 Nov 2021
BOiLS: Bayesian Optimisation for Logic Synthesis
BOiLS: Bayesian Optimisation for Logic Synthesis
Antoine Grosnit
C. Malherbe
Rasul Tutunov
Xingchen Wan
Jun Wang
H. Ammar
118
32
0
11 Nov 2021
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Hung Le
Thommen George Karimpanal
Majid Abdolshah
T. Tran
Svetha Venkatesh
72
20
0
03 Nov 2021
Procedural Generalization by Planning with Self-Supervised World Models
Procedural Generalization by Planning with Self-Supervised World Models
Ankesh Anand
Jacob Walker
Yazhe Li
Eszter Vértes
Julian Schrittwieser
Sherjil Ozair
T. Weber
Jessica B. Hamrick
89
31
0
02 Nov 2021
Mastering Atari Games with Limited Data
Mastering Atari Games with Limited Data
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
166
242
0
30 Oct 2021
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
25
4
0
28 Oct 2021
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with
  Prototypical Representations
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations
Fei Deng
Ingook Jang
Sungjin Ahn
VLM
84
62
0
27 Oct 2021
Dream to Explore: Adaptive Simulations for Autonomous Systems
Dream to Explore: Adaptive Simulations for Autonomous Systems
Z. Sheikhbahaee
Dongshu Luo
Blake Vanberlo
S. Yun
A. Safron
Jesse Hoey
DRL
51
0
0
27 Oct 2021
The Efficiency Misnomer
The Efficiency Misnomer
Daoyuan Chen
Liuyi Yao
Dawei Gao
Ashish Vaswani
Yaliang Li
112
103
0
25 Oct 2021
Self-Consistent Models and Values
Self-Consistent Models and Values
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
95
8
0
25 Oct 2021
DiffSRL: Learning Dynamical State Representation for Deformable Object
  Manipulation with Differentiable Simulator
DiffSRL: Learning Dynamical State Representation for Deformable Object Manipulation with Differentiable Simulator
Sirui Chen
Yunhao Liu
Jialong Li
Shang Wen Yao
Tingxiang Fan
Jia Pan
AI4CE
72
10
0
24 Oct 2021
Model-based Reinforcement Learning for Service Mesh Fault Resiliency in
  a Web Application-level
Model-based Reinforcement Learning for Service Mesh Fault Resiliency in a Web Application-level
Fanfei Meng
L. Jagadeesan
M. Thottan
AI4CE
29
13
0
21 Oct 2021
Contrastive Active Inference
Contrastive Active Inference
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
80
26
0
19 Oct 2021
On-Policy Model Errors in Reinforcement Learning
On-Policy Model Errors in Reinforcement Learning
Lukas P. Frohlich
Maksym Lefarov
Melanie Zeilinger
Felix Berkenkamp
OnRL
81
6
0
15 Oct 2021
OPEn: An Open-ended Physics Environment for Learning Without a Task
OPEn: An Open-ended Physics Environment for Learning Without a Task
Chuang Gan
Abhishek Bhandwaldar
Antonio Torralba
J. Tenenbaum
Phillip Isola
LRM
205
4
0
13 Oct 2021
Action-Sufficient State Representation Learning for Control with
  Structural Constraints
Action-Sufficient State Representation Learning for Control with Structural Constraints
Erdun Gao
Chaochao Lu
Liu Leqi
José Miguel Hernández-Lobato
Clark Glymour
Bernhard Schölkopf
Kun Zhang
98
36
0
12 Oct 2021
Neural Algorithmic Reasoners are Implicit Planners
Neural Algorithmic Reasoners are Implicit Planners
Andreea Deac
Petar Velivcković
Ognjen Milinković
Pierre-Luc Bacon
Jian Tang
Mladen Nikolic
OffRL
72
24
0
11 Oct 2021
On The Transferability of Deep-Q Networks
On The Transferability of Deep-Q Networks
M. Sabatelli
Pierre Geurts
85
2
0
06 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for
  Sparse Reward Tasks
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks
Robert McCarthy
Qiang Wang
S. Redmond
OffRL
72
15
0
05 Oct 2021
Procedure Planning in Instructional Videos via Contextual Modeling and
  Model-based Policy Learning
Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning
Jing Bi
Jiebo Luo
Chenliang Xu
126
49
0
05 Oct 2021
Previous
123...567...91011
Next