ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.00374
  4. Cited By
Model-Based Reinforcement Learning for Atari
v1v2v3v4v5 (latest)

Model-Based Reinforcement Learning for Atari

1 March 2019
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
K. Czechowski
D. Erhan
Chelsea Finn
Piotr Kozakowski
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Model-Based Reinforcement Learning for Atari"

50 / 521 papers shown
Title
Mixtures of Experts Unlock Parameter Scaling for Deep RL
Mixtures of Experts Unlock Parameter Scaling for Deep RL
J. Obando-Ceron
Ghada Sokar
Timon Willi
Clare Lyle
Jesse Farebrother
Jakob N. Foerster
Gintare Karolina Dziugaite
Doina Precup
Pablo Samuel Castro
183
43
0
13 Feb 2024
Improving Token-Based World Models with Parallel Observation Prediction
Improving Token-Based World Models with Parallel Observation Prediction
Lior Cohen
Kaixin Wang
Bingyi Kang
Shie Mannor
84
6
0
08 Feb 2024
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for
  Offline Reinforcement Learning
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding
Amy Zhang
Yuandong Tian
Qinqing Zheng
OffRL
135
20
0
05 Feb 2024
Locality Sensitive Sparse Encoding for Learning World Models Online
Locality Sensitive Sparse Encoding for Learning World Models Online
Zi-Yan Liu
Chao Du
Wee Sun Lee
Min Lin
KELMCLLOffRL
92
11
0
23 Jan 2024
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TSAI4CE
96
29
0
17 Jan 2024
Solving Continual Offline Reinforcement Learning with Decision
  Transformer
Solving Continual Offline Reinforcement Learning with Decision Transformer
Kaixin Huang
Li Shen
Chen Zhao
Chun Yuan
Dacheng Tao
CLLOffRL
95
5
0
16 Jan 2024
CoVO-MPC: Theoretical Analysis of Sampling-based MPC and Optimal
  Covariance Design
CoVO-MPC: Theoretical Analysis of Sampling-based MPC and Optimal Covariance Design
Zeji Yi
Chaoyi Pan
Guanqi He
Guannan Qu
Guanya Shi
79
10
0
14 Jan 2024
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents
Quentin Delfosse
Sebastian Sztwiertnia
M. Rothermel
Wolfgang Stammer
Kristian Kersting
139
20
0
11 Jan 2024
Unsupervised Salient Patch Selection for Data-Efficient Reinforcement
  Learning
Unsupervised Salient Patch Selection for Data-Efficient Reinforcement Learning
Zhaohui Jiang
Paul Weng
OffRL
71
0
0
10 Jan 2024
Policy Optimization with Smooth Guidance Learned from State-Only
  Demonstrations
Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations
Guojian Wang
Faguo Wu
Xiao Zhang
Tianyuan Chen
Zhiming Zheng
97
0
0
30 Dec 2023
AutoTask: Executing Arbitrary Voice Commands by Exploring and Learning
  from Mobile GUI
AutoTask: Executing Arbitrary Voice Commands by Exploring and Learning from Mobile GUI
Lihang Pan
Bowen Wang
Chun Yu
Yuxuan Chen
Xiangyu Zhang
Yuanchun Shi
84
3
0
26 Dec 2023
Improve Robustness of Reinforcement Learning against Observation
  Perturbations via $l_\infty$ Lipschitz Policy Networks
Improve Robustness of Reinforcement Learning against Observation Perturbations via l∞l_\inftyl∞​ Lipschitz Policy Networks
Buqing Nie
Jingtian Ji
Yangqing Fu
Yue Gao
84
4
0
14 Dec 2023
World Models via Policy-Guided Trajectory Diffusion
World Models via Policy-Guided Trajectory Diffusion
Marc Rigter
Jun Yamada
Ingmar Posner
110
21
0
13 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRLOOD
197
5
0
13 Dec 2023
Language Models, Agent Models, and World Models: The LAW for Machine
  Reasoning and Planning
Language Models, Agent Models, and World Models: The LAW for Machine Reasoning and Planning
Zhiting Hu
Tianmin Shu
LLMAGLM&RoLRM
163
37
0
08 Dec 2023
Backward Learning for Goal-Conditioned Policies
Backward Learning for Goal-Conditioned Policies
Marc Höftmann
Jan Robine
Stefan Harmeling
103
1
0
08 Dec 2023
CODEX: A Cluster-Based Method for Explainable Reinforcement Learning
CODEX: A Cluster-Based Method for Explainable Reinforcement Learning
Timothy K. Mathes
Jessica Inman
Andrés Colón
Simon Khan
OffRL
40
1
0
07 Dec 2023
Action Inference by Maximising Evidence: Zero-Shot Imitation from
  Observation with World Models
Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models
Xingyuan Zhang
Philip Becker-Ehmck
Patrick van der Smagt
Maximilian Karl
80
6
0
04 Dec 2023
Copilot4D: Learning Unsupervised World Models for Autonomous Driving via
  Discrete Diffusion
Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion
Lunjun Zhang
Yuwen Xiong
Ze Yang
Sergio Casas
Rui Hu
R. Urtasun
106
60
0
02 Nov 2023
A Tractable Inference Perspective of Offline RL
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Hoang Trung-Dung
Guy Van den Broeck
Yitao Liang
OffRL
136
1
0
31 Oct 2023
Model-Based Reparameterization Policy Gradient Methods: Theory and
  Practical Algorithms
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Shenao Zhang
Boyi Liu
Zhaoran Wang
Tuo Zhao
65
2
0
30 Oct 2023
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic
  Detection of Infeasible Plans
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans
Kyowoon Lee
Seongun Kim
Jaesik Choi
DiffM
81
11
0
30 Oct 2023
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL
Zhongjian Qiao
Jiafei Lyu
Xiu Li
70
3
0
23 Oct 2023
STORM: Efficient Stochastic Transformer based World Models for
  Reinforcement Learning
STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning
Weipu Zhang
Gang Wang
Jian Sun
Yetian Yuan
Gao Huang
107
45
0
14 Oct 2023
Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules
  and Training Stages
Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Guozheng Ma
Lu Li
Sen Zhang
Zixuan Liu
Zhen Wang
Yixin Chen
Li Shen
Xueqian Wang
Dacheng Tao
OffRL
97
21
0
11 Oct 2023
Hieros: Hierarchical Imagination on Structured State Space Sequence
  World Models
Hieros: Hierarchical Imagination on Structured State Space Sequence World Models
Paul Mattes
Rainer Schlosser
R. Herbrich
73
5
0
08 Oct 2023
Unifying Foundation Models with Quadrotor Control for Visual Tracking
  Beyond Object Categories
Unifying Foundation Models with Quadrotor Control for Visual Tracking Beyond Object Categories
Alessandro Saviolo
P. Rao
Vivek Radhakrishnan
Jiuhong Xiao
Giuseppe Loianno
93
6
0
07 Oct 2023
Small batch deep reinforcement learning
Small batch deep reinforcement learning
J. Obando-Ceron
Marc G. Bellemare
Pablo Samuel Castro
VLM
104
19
0
05 Oct 2023
Differentially Encoded Observation Spaces for Perceptive Reinforcement
  Learning
Differentially Encoded Observation Spaces for Perceptive Reinforcement Learning
Lev Grossman
Brian Plancher
OffRL
49
0
0
03 Oct 2023
HarmonyDream: Task Harmonization Inside World Models
HarmonyDream: Task Harmonization Inside World Models
Haoyu Ma
Jialong Wu
Ningya Feng
Chenjun Xiao
Dong Li
Jianye Hao
Jianmin Wang
Mingsheng Long
80
8
0
30 Sep 2023
GAIA-1: A Generative World Model for Autonomous Driving
GAIA-1: A Generative World Model for Autonomous Driving
Masane Fuchi
Lloyd Russell
Hudson Yeo
Zak Murez
Hiroto Minami
Alex Kendall
Tomohiro Takagi
Gianluca Corrado
VGen
136
253
0
29 Sep 2023
Deep Learning in Deterministic Computational Mechanics
Deep Learning in Deterministic Computational Mechanics
L. Herrmann
Stefan Kollmannsberger
AI4CEPINN
118
0
0
27 Sep 2023
Enhancing data efficiency in reinforcement learning: a novel imagination
  mechanism based on mesh information propagation
Enhancing data efficiency in reinforcement learning: a novel imagination mechanism based on mesh information propagation
Zihang Wang
Maowei Jiang
AI4CE
82
0
0
25 Sep 2023
MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation
MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation
Patrick E. Lancaster
Nicklas Hansen
Aravind Rajeswaran
Vikash Kumar
LM&Ro
93
16
0
25 Sep 2023
Deep Reinforcement Learning for the Heat Transfer Control of Pulsating
  Impinging Jets
Deep Reinforcement Learning for the Heat Transfer Control of Pulsating Impinging Jets
Sajad Salavatidezfouli
G. Stabile
G. Rozza
AI4CE
23
4
0
25 Sep 2023
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning
Xiao-Yin Liu
Xiao-Hu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRLOOD
86
5
0
16 Sep 2023
RoboAgent: Generalization and Efficiency in Robot Manipulation via
  Semantic Augmentations and Action Chunking
RoboAgent: Generalization and Efficiency in Robot Manipulation via Semantic Augmentations and Action Chunking
Homanga Bharadhwaj
Jay Vakil
Mohit Sharma
Abhi Gupta
Shubham Tulsiani
Vikash Kumar
LM&Ro
118
132
0
05 Sep 2023
Model-free Reinforcement Learning with Stochastic Reward Stabilization
  for Recommender Systems
Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
Tianchi Cai
Shenliao Bao
Jiyan Jiang
Shiji Zhou
Wenpeng Zhang
Lihong Gu
Jinjie Gu
Guannan Zhang
OffRL
65
2
0
25 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
89
32
0
14 Aug 2023
BarlowRL: Barlow Twins for Data-Efficient Reinforcement Learning
BarlowRL: Barlow Twins for Data-Efficient Reinforcement Learning
Omer Veysel Cagatan
Barış Akgün
BDLOffRL
98
4
0
08 Aug 2023
Elastic Decision Transformer
Elastic Decision Transformer
Yueh-hua Wu
Xiaolong Wang
Masashi Hamaya
OffRL
135
43
0
05 Jul 2023
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
Outongyi Lv
Bingxin Zhou
OffRL
117
0
0
05 Jul 2023
Curious Replay for Model-based Adaptation
Curious Replay for Model-based Adaptation
Isaac Kauvar
Christopher Doyle
Linqi Zhou
Nick Haber
68
12
0
28 Jun 2023
Trajectory Generation, Control, and Safety with Denoising Diffusion
  Probabilistic Models
Trajectory Generation, Control, and Safety with Denoising Diffusion Probabilistic Models
N. Botteghi
Federico Califano
M. Poel
C. Brune
DiffMAI4CE
72
11
0
27 Jun 2023
Introspective Action Advising for Interpretable Transfer Learning
Introspective Action Advising for Interpretable Transfer Learning
Joseph Campbell
Yue (Sophie) Guo
Fiona Xie
Simon Stepputtis
Katia Sycara
105
2
0
21 Jun 2023
PLASTIC: Improving Input and Label Plasticity for Sample Efficient
  Reinforcement Learning
PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning
Hojoon Lee
Hanseul Cho
Hyunseung Kim
Daehoon Gwak
Joonkee Kim
Jaegul Choo
Se-Young Yun
Chulhee Yun
OffRL
157
30
0
19 Jun 2023
Optimal Exploration for Model-Based RL in Nonlinear Systems
Optimal Exploration for Model-Based RL in Nonlinear Systems
Andrew Wagenmaker
Guanya Shi
Kevin Jamieson
87
15
0
15 Jun 2023
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
86
1
0
15 Jun 2023
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
Quentin Delfosse
Johannes Czech
Bjarne Gregori
Sebastian Sztwiertnia
Kristian Kersting
108
18
0
14 Jun 2023
Robust Reinforcement Learning through Efficient Adversarial Herding
Robust Reinforcement Learning through Efficient Adversarial Herding
Juncheng Dong
Hao-Lun Hsu
Qitong Gao
Vahid Tarokh
Miroslav Pajic
88
4
0
12 Jun 2023
Previous
123456...91011
Next