Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1903.00374
Cited By
v1
v2
v3
v4
v5 (latest)
Model-Based Reinforcement Learning for Atari
1 March 2019
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
K. Czechowski
D. Erhan
Chelsea Finn
Piotr Kozakowski
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Model-Based Reinforcement Learning for Atari"
50 / 521 papers shown
Title
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
Guozheng Ma
Lu Li
Zilin Wang
Li Shen
Pierre-Luc Bacon
Dacheng Tao
OffRL
27
0
0
20 Jun 2025
Different Questions, Different Models: Fine-Grained Evaluation of Uncertainty and Calibration in Clinical QA with LLMs
Alberto Testoni
Iacer Calixto
ELM
118
0
0
12 Jun 2025
Time-Aware World Model for Adaptive Prediction and Control
Anh N. Nhu
Sanghyun Son
Ming-Chyuan Lin
AI4TS
TTA
38
0
0
10 Jun 2025
Simple, Good, Fast: Self-Supervised World Models Free of Baggage
Jan Robine
Marc Höftmann
Stefan Harmeling
DRL
OCL
71
1
0
03 Jun 2025
State-Covering Trajectory Stitching for Diffusion Planners
Kyowoon Lee
Jaesik Choi
OffRL
44
0
0
01 Jun 2025
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners
Michal Nauman
Marek Cygan
Carmelo Sferrazza
Aviral Kumar
Pieter Abbeel
OffRL
100
0
0
29 May 2025
Normalizing Flows are Capable Models for RL
Raj Ghugare
Benjamin Eysenbach
OffRL
AI4CE
90
0
0
29 May 2025
Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective
Yang Zhang
Xinran Li
Jianing Ye
Delin Qu
Shuang Qiu
Chongjie Zhang
Xiu Li
Chenjia Bai
52
0
0
27 May 2025
OSVI-WM: One-Shot Visual Imitation for Unseen Tasks using World-Model-Guided Trajectory Generation
Raktim Gautam Goswami
Prashanth Krishnamurthy
Yann LeCun
Farshad Khorrami
VGen
OffRL
58
0
0
26 May 2025
Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning
Ghada Sokar
Pablo Samuel Castro
59
0
0
23 May 2025
Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach
Xiaoran Yin
Xu Luo
Hao Wu
Lianli Gao
Jingkuan Song
95
0
0
22 May 2025
RLVR-World: Training World Models with Reinforcement Learning
Jialong Wu
Shaofeng Yin
Ningya Feng
Mingsheng Long
OffRL
VGen
87
2
0
20 May 2025
TD-GRPC: Temporal Difference Learning with Group Relative Policy Constraint for Humanoid Locomotion
Khang Nguyen
Khai Nguyen
An T. Le
Jan Peters
Manfred Huber
Ngo Anh Vien
Minh Nhat Vu
68
0
0
19 May 2025
Building spatial world models from sparse transitional episodic memories
Zizhan He
Maxime Daigle
Pouya Bashivan
KELM
60
0
0
19 May 2025
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation
Jun Guo
Xiaojian Ma
Yikai Wang
Min Yang
Huaping Liu
Qing Li
VGen
78
0
0
15 May 2025
Scaling Multi Agent Reinforcement Learning for Underwater Acoustic Tracking via Autonomous Vehicles
Matteo Gallici
Ivan Masmitja
Mario Martin
OffRL
69
0
0
13 May 2025
Adaptive 3D UI Placement in Mixed Reality Using Deep Reinforcement Learning
Feiyu Lu
Mengyu Chen
Hsiang Hsu
Pranav Deshpande
Cheng Yao Wang
Blair MacIntyre
86
4
0
30 Apr 2025
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
223
4
0
29 Apr 2025
Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator
Chenhao Li
Andreas Krause
Marco Hutter
OffRL
56
0
0
23 Apr 2025
Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning
Tao He
Lizi Liao
Ming Liu
Bing Qin
88
1
0
18 Apr 2025
Neural Motion Simulator: Pushing the Limit of World Models in Reinforcement Learning
Chenjie Hao
Weyl Lu
Yifan Xu
Yubei Chen
48
0
0
09 Apr 2025
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
209
1
0
26 Mar 2025
Extendable Long-Horizon Planning via Hierarchical Multiscale Diffusion
Chang Chen
Hany Hamed
Doojin Baek
Taegu Kang
Yoshua Bengio
Sungjin Ahn
117
1
0
25 Mar 2025
AdaWorld: Learning Adaptable World Models with Latent Actions
Shenyuan Gao
Siyuan Zhou
Yilun Du
Jun Zhang
Chuang Gan
VGen
193
8
0
24 Mar 2025
Multi-Task Reinforcement Learning Enables Parameter Scaling
Reginald McLean
Evangelos Chataroulas
Jordan Terry
Isaac Woungang
Nariman Farsad
Pablo Samuel Castro
LRM
156
1
0
07 Mar 2025
Knowledge Retention for Continual Model-Based Reinforcement Learning
Yixiang Sun
Haotian Fu
M. L. Littman
George Konidaris
OffRL
CLL
VLM
114
0
0
06 Mar 2025
Learning Transformer-based World Models with Contrastive Predictive Coding
Maxime Burchi
Radu Timofte
128
2
0
06 Mar 2025
Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning
Jaehyeon Son
Soochan Lee
Gunhee Kim
OffRL
135
4
0
26 Feb 2025
GLAM: Global-Local Variation Awareness in Mamba-based World Model
Qian He
Wenqi Liang
Chunhui Hao
Gan Sun
Jiandong Tian
135
0
0
21 Jan 2025
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Siddharth Aravindan
Dixant Mittal
Wee Sun Lee
BDL
127
0
0
17 Jan 2025
Highway Graph to Accelerate Reinforcement Learning
Zidu Yin
Zhen Zhang
Dong Gong
Stefano V. Albrecht
J. Q. Shi
OffRL
75
0
0
08 Jan 2025
Contrastive Representation for Interactive Recommendation
Jingyu Li
Zhiyong Feng
Dongxiao He
Hongqi Chen
Qinghang Gao
Guoli Wu
83
0
0
24 Dec 2024
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Xin Liu
Yaran Chen
Haoran Li
SSL
221
0
0
14 Dec 2024
Policy-shaped prediction: avoiding distractions in model-based reinforcement learning
Miles Hutson
Isaac Kauvar
Nick Haber
159
0
0
08 Dec 2024
World Models: The Safety Perspective
Zifan Zeng
Chongzhe Zhang
Feng Liu
Joseph Sifakis
Qunli Zhang
Shiming Liu
Peng Wang
KELM
LLMAG
86
2
0
12 Nov 2024
Autoregressive Models in Vision: A Survey
Jing Xiong
Gongye Liu
Lun Huang
Chengyue Wu
Taiqiang Wu
...
Hao Fei
Guillermo Sapiro
Jiebo Luo
Ping Luo
Ngai Wong
VGen
196
14
0
08 Nov 2024
CALE: Continuous Arcade Learning Environment
Jesse Farebrother
Pablo Samuel Castro
ELM
68
0
0
31 Oct 2024
EconoJax: A Fast & Scalable Economic Simulation in Jax
Koen Ponse
Aske Plaat
Niki van Stein
Thomas M. Moerland
94
1
0
29 Oct 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
150
7
0
25 Oct 2024
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Ömer Veysel Çağatan
Barış Akgün
OffRL
116
0
0
22 Oct 2024
Vision-Language Navigation with Energy-Based Policy
Rui Liu
Wenguan Wang
Yue Yang
81
5
0
18 Oct 2024
Novelty-based Sample Reuse for Continuous Robotics Control
Ke Duan
Kai Yang
Houde Liu
Xueqian Wang
77
0
0
17 Oct 2024
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient
Wenlong Wang
Ivana Dusparic
Yucheng Shi
Ke Zhang
Vinny Cahill
Mamba
466
1
0
11 Oct 2024
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Cristian Meo
Mircea Lica
Zarif Ikram
Akihiro Nakano
Vedant Shah
Aniket Didolkar
Dianbo Liu
Anirudh Goyal
Justin Dauwels
OffRL
247
0
0
10 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
199
0
0
07 Oct 2024
Compositional Diffusion Models for Powered Descent Trajectory Generation with Flexible Constraints
Julia Briden
Yilun Du
Enrico M. Zucchelli
Richard Linares
77
0
0
05 Oct 2024
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
112
3
0
03 Oct 2024
Focus On What Matters: Separated Models For Visual-Based RL Generalization
Di Zhang
Bowen Lv
Hai Zhang
Feifan Yang
Junqiao Zhao
Hang Yu
Chang Huang
Hongtu Zhou
Chen Ye
Changjun Jiang
92
3
0
29 Sep 2024
R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World Models
Viet Dung Nguyen
Zhizhuo Yang
Christopher L. Buckley
Alexander Ororbia
93
4
0
21 Sep 2024
One-shot World Models Using a Transformer Trained on a Synthetic Prior
Fabio Ferreira
Moreno Schlageter
Raghu Rajan
André Biedenkapp
Frank Hutter
94
0
0
21 Sep 2024
1
2
3
4
...
9
10
11
Next