ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.00374
  4. Cited By
Model-Based Reinforcement Learning for Atari
v1v2v3v4v5 (latest)

Model-Based Reinforcement Learning for Atari

1 March 2019
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
K. Czechowski
D. Erhan
Chelsea Finn
Piotr Kozakowski
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Model-Based Reinforcement Learning for Atari"

50 / 521 papers shown
Title
On the Importance of Feature Decorrelation for Unsupervised
  Representation Learning in Reinforcement Learning
On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement Learning
Hojoon Lee
Ko-tik Lee
Dongyoon Hwang
Hyunho Lee
ByungKun Lee
Jaegul Choo
SSLOOD
60
5
0
09 Jun 2023
Finding Counterfactually Optimal Action Sequences in Continuous State
  Spaces
Finding Counterfactually Optimal Action Sequences in Continuous State Spaces
Stratis Tsirtsis
Manuel Gomez Rodriguez
CMLOffRL
113
11
0
06 Jun 2023
Model-Based Reinforcement Learning with Multi-Task Offline Pretraining
Model-Based Reinforcement Learning with Multi-Task Offline Pretraining
Minting Pan
Yitao Zheng
Yunbo Wang
Xiaokang Yang
OffRL
80
0
0
06 Jun 2023
Active Vision Reinforcement Learning under Limited Visual Observability
Active Vision Reinforcement Learning under Limited Visual Observability
Jinghuan Shang
Michael S. Ryoo
88
0
0
01 Jun 2023
SafeDiffuser: Safe Planning with Diffusion Probabilistic Models
SafeDiffuser: Safe Planning with Diffusion Probabilistic Models
Wei Xiao
Tsun-Hsuan Wang
Chuang Gan
Daniela Rus
DiffM
66
32
0
31 May 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Rameswar Panda
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
124
102
0
30 May 2023
Pre-training Contextualized World Models with In-the-wild Videos for
  Reinforcement Learning
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
84
33
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
66
9
0
29 May 2023
Deep Reinforcement Learning with Plasticity Injection
Deep Reinforcement Learning with Plasticity Injection
Evgenii Nikishin
Junhyuk Oh
Georg Ostrovski
Clare Lyle
Razvan Pascanu
Will Dabney
André Barreto
OffRL
69
52
0
24 May 2023
KARNet: Kalman Filter Augmented Recurrent Neural Network for Learning
  World Models in Autonomous Driving Tasks
KARNet: Kalman Filter Augmented Recurrent Neural Network for Learning World Models in Autonomous Driving Tasks
Hemanth Manjunatha
A. Pak
Dimitar Filev
Panagiotis Tsiotras
77
5
0
24 May 2023
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning
  via Transition Occupancy Matching
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRLMU
82
6
0
22 May 2023
A Generalist Dynamics Model for Control
A Generalist Dynamics Model for Control
Ingmar Schubert
Jingwei Zhang
Jake Bruce
Sarah Bechtle
Emilio Parisotto
Martin Riedmiller
Jost Tobias Springenberg
Arunkumar Byravan
Leonard Hasenclever
N. Heess
AI4CE
95
33
0
18 May 2023
Explainable Reinforcement Learning via a Causal World Model
Explainable Reinforcement Learning via a Causal World Model
Zhongwei Yu
Jingqing Ruan
Dengpeng Xing
CML
104
16
0
04 May 2023
Posterior Sampling for Deep Reinforcement Learning
Posterior Sampling for Deep Reinforcement Learning
Remo Sasso
Michelangelo Conserva
Paulo E. Rauber
OffRLBDL
71
7
0
30 Apr 2023
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs
  Transformation
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs Transformation
Guoliang He
Sean Parker
Eiko Yoneki
56
4
0
28 Apr 2023
Proto-Value Networks: Scaling Representation Learning with Auxiliary
  Tasks
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Jesse Farebrother
Joshua Greaves
Rishabh Agarwal
Charline Le Lan
Ross Goroshin
Pablo Samuel Castro
Marc G. Bellemare
106
29
0
25 Apr 2023
A Cookbook of Self-Supervised Learning
A Cookbook of Self-Supervised Learning
Randall Balestriero
Mark Ibrahim
Vlad Sobal
Ari S. Morcos
Shashank Shekhar
...
Pierre Fernandez
Amir Bar
Hamed Pirsiavash
Yann LeCun
Micah Goldblum
SyDaFedMLSSL
166
285
0
24 Apr 2023
Hierarchical State Abstraction Based on Structural Information
  Principles
Hierarchical State Abstraction Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Angsheng Li
Chunyang Liu
Lifang He
Philip S. Yu
64
20
0
24 Apr 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential
  Decision Making
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
Carlos Núnez-Molina
Pablo Mesejo
Juan Fernández-Olivares
130
3
0
20 Apr 2023
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient
  Multi-Agent Reinforcement Learning
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning
Aravind Venugopal
Stephanie Milani
Fei Fang
Balaraman Ravindran
OffRL
64
1
0
12 Apr 2023
Habits and goals in synergy: a variational Bayesian framework for
  behavior
Habits and goals in synergy: a variational Bayesian framework for behavior
Dongqi Han
Kenji Doya
Dongsheng Li
Jun Tani
BDL
78
215
0
11 Apr 2023
Model-Based Reinforcement Learning with Isolated Imaginations
Model-Based Reinforcement Learning with Isolated Imaginations
Minting Pan
Geng Chen
Yitao Zheng
Yunbo Wang
Xiaokang Yang
91
0
0
27 Mar 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A
  Survey
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
78
1
0
23 Mar 2023
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Nicolai Dorka
Tim Welschehold
Wolfram Burgard
55
3
0
17 Mar 2023
Transformer-based World Models Are Happy With 100k Interactions
Transformer-based World Models Are Happy With 100k Interactions
Jan Robine
Marc Höftmann
Tobias Uelwer
Stefan Harmeling
OffRL
115
88
0
13 Mar 2023
Beware of Instantaneous Dependence in Reinforcement Learning
Beware of Instantaneous Dependence in Reinforcement Learning
Zhengmao Zhu
Yu-Ren Liu
Hong Tian
Yang Yu
Kun Zhang
OffRL
59
1
0
09 Mar 2023
Foundation Models for Decision Making: Problems, Methods, and
  Opportunities
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&RoOffRLLRMAI4CE
203
172
0
07 Mar 2023
Ensemble Reinforcement Learning: A Survey
Ensemble Reinforcement Learning: A Survey
Yanjie Song
Ponnuthurai Nagaratnam Suganthan
Witold Pedrycz
Junwei Ou
Yongming He
Y. Chen
Yutong Wu
OffRL
100
41
0
05 Mar 2023
RePreM: Representation Pre-training with Masked Model for Reinforcement
  Learning
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning
Yuanying Cai
Wei Shen
Wei Shen
Xuyun Zhang
Wenjie Ruan
Longbo Huang
OffRL
99
5
0
03 Mar 2023
Data-efficient, Explainable and Safe Box Manipulation: Illustrating the
  Advantages of Physical Priors in Model-Predictive Control
Data-efficient, Explainable and Safe Box Manipulation: Illustrating the Advantages of Physical Priors in Model-Predictive Control
Achkan Salehi
Stéphane Doncieux
OffRL
89
2
0
02 Mar 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
117
100
0
24 Feb 2023
CERiL: Continuous Event-based Reinforcement Learning
CERiL: Continuous Event-based Reinforcement Learning
Celyn Walters
Simon Hadfield
OffRL
54
2
0
15 Feb 2023
Learning a model is paramount for sample efficiency in reinforcement
  learning control of PDEs
Learning a model is paramount for sample efficiency in reinforcement learning control of PDEs
Stefan Werner
Sebastian Peitz
119
9
0
14 Feb 2023
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
Xin Liu
Yaran Chen
Haoran Li
Boyu Li
Dong Zhao
SSL
129
10
0
11 Feb 2023
Learning Interaction-aware Motion Prediction Model for Decision-making
  in Autonomous Driving
Learning Interaction-aware Motion Prediction Model for Decision-making in Autonomous Driving
Zhiyu Huang
Haochen Liu
Jingda Wu
Wenhui Huang
Chen Lv
79
18
0
08 Feb 2023
Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for
  Dynamic Environments
Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for Dynamic Environments
Tan Chong Min John
Mehul Motani
43
2
0
31 Jan 2023
Neural Episodic Control with State Abstraction
Neural Episodic Control with State Abstraction
Zhuo Li
Derui Zhu
Yujing Hu
Xiaofei Xie
Lei Ma
Yan Zheng
Yan Song
Yingfeng Chen
Jianjun Zhao
OffRL
88
16
0
27 Jan 2023
Sample-Efficient Multi-Objective Learning via Generalized Policy
  Improvement Prioritization
Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization
L. N. Alegre
A. Bazzan
D. Roijers
Ann Nowé
Bruno C. da Silva
80
30
0
18 Jan 2023
World Models and Predictive Coding for Cognitive and Developmental
  Robotics: Frontiers and Challenges
World Models and Predictive Coding for Cognitive and Developmental Robotics: Frontiers and Challenges
T. Taniguchi
Shingo Murata
Masahiro Suzuki
D. Ognibene
Pablo Lanillos
...
L. Jamone
Tomoaki Nakamura
Alejandra Ciria
B. Lara
G. Pezzulo
103
57
0
14 Jan 2023
Efficient Preference-Based Reinforcement Learning Using Learned Dynamics
  Models
Efficient Preference-Based Reinforcement Learning Using Learned Dynamics Models
Yi Liu
Gaurav Datta
Ellen R. Novoseller
Daniel S. Brown
104
24
0
11 Jan 2023
Mastering Diverse Domains through World Models
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
97
617
0
10 Jan 2023
Exploration in Model-based Reinforcement Learning with Randomized Reward
Exploration in Model-based Reinforcement Learning with Randomized Reward
Lingxiao Wang
Ping Li
72
0
0
09 Jan 2023
IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via
  Sequence Modeling
IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling
Yilin Wen
Biao Luo
Yuqian Zhao
50
1
0
06 Jan 2023
Risk-Averse MDPs under Reward Ambiguity
Risk-Averse MDPs under Reward Ambiguity
Haolin Ruan
Zhi Chen
C. Ho
93
2
0
03 Jan 2023
New Challenges in Reinforcement Learning: A Survey of Security and
  Privacy
New Challenges in Reinforcement Learning: A Survey of Security and Privacy
Yunjiao Lei
Dayong Ye
Sheng Shen
Yulei Sui
Tianqing Zhu
Wanlei Zhou
135
20
0
31 Dec 2022
Symbolic Visual Reinforcement Learning: A Scalable Framework with
  Object-Level Abstraction and Differentiable Expression Search
Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search
Wenqing Zheng
S. Sharan
Zhiwen Fan
Kevin Wang
Yihan Xi
Zhangyang Wang
102
10
0
30 Dec 2022
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
148
30
0
29 Dec 2022
PrefRec: Recommender Systems with Human Preferences for Reinforcing
  Long-term User Engagement
PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement
Wanqi Xue
Qingpeng Cai
Zhenghai Xue
Shuo Sun
Shuchang Liu
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
62
28
0
06 Dec 2022
The Effectiveness of World Models for Continual Reinforcement Learning
The Effectiveness of World Models for Continual Reinforcement Learning
Samuel Kessler
M. Ostaszewski
Michal Bortkiewicz
M. Żarski
Maciej Wołczyk
Jack Parker-Holder
Stephen J. Roberts
Piotr Milo's
KELMOffRLCLL
85
8
0
29 Nov 2022
A Reinforcement Learning Approach for Process Parameter Optimization in
  Additive Manufacturing
A Reinforcement Learning Approach for Process Parameter Optimization in Additive Manufacturing
Susheel Dharmadhikari
Nandana Menon
A. Basak
OffRLAI4CE
50
30
0
17 Nov 2022
Previous
12345...91011
Next