ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.00374
  4. Cited By
Model-Based Reinforcement Learning for Atari
v1v2v3v4v5 (latest)

Model-Based Reinforcement Learning for Atari

1 March 2019
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
K. Czechowski
D. Erhan
Chelsea Finn
Piotr Kozakowski
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Model-Based Reinforcement Learning for Atari"

50 / 521 papers shown
Title
Deep Reinforcement Learning with Vector Quantized Encoding
Deep Reinforcement Learning with Vector Quantized Encoding
Liang Zhang
Justin Lieffers
A. Pyarelal
OffRL
58
2
0
12 Nov 2022
Pretraining in Deep Reinforcement Learning: A Survey
Pretraining in Deep Reinforcement Learning: A Survey
Zhihui Xie
Zichuan Lin
Junyou Li
Shuai Li
Deheng Ye
OffRLOnRLAI4CE
87
23
0
08 Nov 2022
The Benefits of Model-Based Generalization in Reinforcement Learning
The Benefits of Model-Based Generalization in Reinforcement Learning
K. Young
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
95
13
0
04 Nov 2022
On Many-Actions Policy Gradient
On Many-Actions Policy Gradient
Michal Nauman
Marek Cygan
70
0
0
24 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
112
10
0
23 Oct 2022
Learning Robust Dynamics through Variational Sparse Gating
Learning Robust Dynamics through Variational Sparse Gating
A. Jain
Shivakanth Sujit
S. Joshi
Vincent Michalski
Danijar Hafner
Samira Ebrahimi Kahou
73
9
0
21 Oct 2022
Trust Region Policy Optimization with Optimal Transport Discrepancies:
  Duality and Algorithm for Continuous Actions
Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions
Antonio Terpin
Nicolas Lanzetti
Batuhan Yardim
Florian Dorfler
Giorgia Ramponi
70
5
0
20 Oct 2022
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining
Hao Liu
Tom Zahavy
Volodymyr Mnih
Satinder Singh
SSL
120
7
0
19 Oct 2022
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement
  Learning
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Yifan Xu
Nicklas Hansen
Zirui Wang
Yung-Chieh Chan
H. Su
Zhuowen Tu
OffRL
81
17
0
19 Oct 2022
The Impact of Task Underspecification in Evaluating Deep Reinforcement
  Learning
The Impact of Task Underspecification in Evaluating Deep Reinforcement Learning
Vindula Jayawardana
Catherine Tang
Sirui Li
Da Suo
Cathy Wu
OffRL
108
13
0
16 Oct 2022
Maximum entropy exploration in contextual bandits with neural networks
  and energy based models
Maximum entropy exploration in contextual bandits with neural networks and energy based models
A. Elwood
Marco Leonardi
A. Mohamed
A. Rozza
59
1
0
12 Oct 2022
A Comprehensive Survey of Data Augmentation in Visual Reinforcement
  Learning
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning
Guozheng Ma
Zhen Wang
Zhecheng Yuan
Xueqian Wang
Bo Yuan
Dacheng Tao
OffRL
87
28
0
10 Oct 2022
On Neural Consolidation for Transfer in Reinforcement Learning
On Neural Consolidation for Transfer in Reinforcement Learning
Valentin Guillet
Dennis G. Wilson
Carlos Aguilar-Melchor
Emmanuel Rachelson
CLL
54
0
0
05 Oct 2022
Hyperbolic Deep Reinforcement Learning
Hyperbolic Deep Reinforcement Learning
Edoardo Cetin
B. Chamberlain
Michael M. Bronstein
Jonathan J. Hunt
93
22
0
04 Oct 2022
Learning Parsimonious Dynamics for Generalization in Reinforcement
  Learning
Learning Parsimonious Dynamics for Generalization in Reinforcement Learning
Tankred Saanum
Eric Schulz
58
1
0
29 Sep 2022
Simplifying Model-based RL: Learning Representations, Latent-space
  Models, and Policies with One Objective
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
106
27
0
18 Sep 2022
Game-theoretic Objective Space Planning
Game-theoretic Objective Space Planning
Hongrui Zheng
Zhijun Zhuang
Johannes Betz
Rahul Mangharam
66
6
0
16 Sep 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble
  of Deep Networks
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
89
18
0
16 Sep 2022
Human-level Atari 200x faster
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
96
30
0
15 Sep 2022
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image
  Generator
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator
Younggyo Seo
Kimin Lee
Fangchen Liu
Stephen James
Pieter Abbeel
VGen
70
29
0
15 Sep 2022
Using Forwards-Backwards Models to Approximate MDP Homomorphisms
Using Forwards-Backwards Models to Approximate MDP Homomorphisms
Augustine N. Mavor-Parker
Matthew J. Sargent
Christian Pehle
Andrea Banino
Lewis D. Griffin
Caswell Barry
73
1
0
14 Sep 2022
Skip Training for Multi-Agent Reinforcement Learning Controller for
  Industrial Wave Energy Converters
Skip Training for Multi-Agent Reinforcement Learning Controller for Industrial Wave Energy Converters
Soumyendu Sarkar
Vineet Gundecha
Sahand Ghorbanpour
Alexander Shmakov
Ashwin Ramesh Babu
Alexandre Frederic Julien Pichard
Mathieu Cocho
55
16
0
13 Sep 2022
Concept-modulated model-based offline reinforcement learning for rapid
  generalization
Concept-modulated model-based offline reinforcement learning for rapid generalization
Nicholas A. Ketz
Praveen K. Pilly
OffRL
55
1
0
07 Sep 2022
On the Origins of Self-Modeling
On the Origins of Self-Modeling
Robert Kwiatkowski
Yuhang Hu
Boyuan Chen
Hod Lipson
64
4
0
05 Sep 2022
Variational Inference for Model-Free and Model-Based Reinforcement
  Learning
Variational Inference for Model-Free and Model-Based Reinforcement Learning
Felix Leibfried
OffRL
78
0
0
04 Sep 2022
Transformers are Sample-Efficient World Models
Transformers are Sample-Efficient World Models
Vincent Micheli
Eloi Alonso
Franccois Fleuret
VLMOffRL
195
189
0
01 Sep 2022
Unsupervised Representation Learning in Deep Reinforcement Learning: A
  Review
Unsupervised Representation Learning in Deep Reinforcement Learning: A Review
N. Botteghi
M. Poel
C. Brune
SSLOffRL
105
13
0
27 Aug 2022
Light-weight probing of unsupervised representations for Reinforcement
  Learning
Light-weight probing of unsupervised representations for Reinforcement Learning
Wancong Zhang
Anthony GX-Chen
Vlad Sobal
Yann LeCun
Nicolas Carion
SSLOffRL
76
14
0
25 Aug 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning
  in Online Reinforcement Learning
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Shuang Qiu
Lingxiao Wang
Chenjia Bai
Zhuoran Yang
Zhaoran Wang
SSLOffRL
143
32
0
29 Jul 2022
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary
  Differential Equations
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations
Achkan Salehi
Steffen Rühl
Stéphane Doncieux
AI4CE
90
2
0
25 Jul 2022
The Free Energy Principle for Perception and Action: A Deep Learning
  Perspective
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRLAI4CE
70
33
0
13 Jul 2022
Unsupervised learning of observation functions in state-space models by
  nonparametric moment methods
Unsupervised learning of observation functions in state-space models by nonparametric moment methods
Qi An
Yannis G. Kevrekidis
Fei Lu
Mauro Maggioni
44
2
0
12 Jul 2022
Multi-objective Optimization of Notifications Using Offline
  Reinforcement Learning
Multi-objective Optimization of Notifications Using Offline Reinforcement Learning
Prakruthi Prabhakar
Yiping Yuan
Guangyu Yang
Wensheng Sun
A. Muralidharan
OffRL
60
6
0
07 Jul 2022
Compositional Generalization in Grounded Language Learning via Induced
  Model Sparsity
Compositional Generalization in Grounded Language Learning via Induced Model Sparsity
Sam Spilsbury
Alexander Ilin
76
8
0
06 Jul 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Edoardo Cetin
Philip J. Ball
Steve Roberts
Oya Celiktutan
118
38
0
03 Jul 2022
Depth-CUPRL: Depth-Imaged Contrastive Unsupervised Prioritized
  Representations in Reinforcement Learning for Mapless Navigation of Unmanned
  Aerial Vehicles
Depth-CUPRL: Depth-Imaged Contrastive Unsupervised Prioritized Representations in Reinforcement Learning for Mapless Navigation of Unmanned Aerial Vehicles
J. C. Jesus
V. A. Kich
A. H. Kolling
Ricardo B. Grando
R. S. Guerra
P. Drews
SSL
146
18
0
30 Jun 2022
Visual Foresight With a Local Dynamics Model
Visual Foresight With a Local Dynamics Model
Colin Kohler
Robert Platt
62
1
0
29 Jun 2022
Masked World Models for Visual Control
Masked World Models for Visual Control
Younggyo Seo
Danijar Hafner
Hao Liu
Fangchen Liu
Stephen James
Kimin Lee
Pieter Abbeel
OffRL
186
149
0
28 Jun 2022
Value-Consistent Representation Learning for Data-Efficient
  Reinforcement Learning
Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
Yang Yue
Bingyi Kang
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
99
13
0
25 Jun 2022
Robust Task Representations for Offline Meta-Reinforcement Learning via
  Contrastive Learning
Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
Haoqi Yuan
Zongqing Lu
SSLOffRL
91
42
0
21 Jun 2022
Relative Policy-Transition Optimization for Fast Policy Transfer
Relative Policy-Transition Optimization for Fast Policy Transfer
Jiawei Xu
Cheng Zhou
Yizheng Zhang
Zhengyou Zhang
Lei Han
47
0
0
13 Jun 2022
A Relational Intervention Approach for Unsupervised Dynamics
  Generalization in Model-Based Reinforcement Learning
A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning
Jinpei Guo
Biwei Huang
Dacheng Tao
66
20
0
09 Jun 2022
Generalized Data Distribution Iteration
Generalized Data Distribution Iteration
Jiajun Fan
Changnan Xiao
OffRL
53
13
0
07 Jun 2022
Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL
Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL
Abhinav Bhatia
Philip S. Thomas
S. Zilberstein
OffRL
38
3
0
06 Jun 2022
Beyond Value: CHECKLIST for Testing Inferences in Planning-Based RL
Beyond Value: CHECKLIST for Testing Inferences in Planning-Based RL
Kin-Ho Lam
Delyar Tabatabai
Jed Irvine
Donald Bertucci
Anita Ruangrotsakun
Minsuk Kahng
Alan Fern
OffRL
56
1
0
04 Jun 2022
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Zhengyao Jiang
Tianjun Zhang
Robert Kirk
Tim Rocktaschel
Edward Grefenstette
OffRL
41
2
0
31 May 2022
Multi-Source Transfer Learning for Deep Model-Based Reinforcement
  Learning
Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning
Remo Sasso
M. Sabatelli
M. Wiering
106
9
0
28 May 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in
  World Models
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models
Minting Pan
Geng Chen
Yunbo Wang
Xiaokang Yang
106
43
0
27 May 2022
Scalable Multi-Agent Model-Based Reinforcement Learning
Scalable Multi-Agent Model-Based Reinforcement Learning
Vladimir Egorov
A. Shpilman
90
27
0
25 May 2022
Flexible Diffusion Modeling of Long Videos
Flexible Diffusion Modeling of Long Videos
William Harvey
Saeid Naderiparizi
Vaden Masrani
Christian D. Weilbach
Frank Wood
DiffMBDLVGen
246
298
0
23 May 2022
Previous
123456...91011
Next