Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.00210
Cited By
Mastering Atari Games with Limited Data
30 October 2021
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mastering Atari Games with Limited Data"
50 / 163 papers shown
Title
Residual Q-Learning: Offline and Online Policy Customization without Value
Chenran Li
Chen Tang
Haruki Nishimura
Jean Mercat
Masayoshi Tomizuka
Wei Zhan
OffRL
51
6
0
15 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
30
13
0
15 Jun 2023
Agents Explore the Environment Beyond Good Actions to Improve Their Model for Better Decisions
Matthias Unverzagt
LLMAG
22
0
0
06 Jun 2023
Model-Based Reinforcement Learning with Multi-Task Offline Pretraining
Minting Pan
Yitao Zheng
Yunbo Wang
Xiaokang Yang
OffRL
32
0
0
06 Jun 2023
MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning
Haolin Song
Ming Feng
Wen-gang Zhou
Houqiang Li
OffRL
25
6
0
03 Jun 2023
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Rohan Chitnis
Yingchen Xu
B. Hashemi
Lucas Lehnert
Ürün Dogan
Zheqing Zhu
Olivier Delalleau
OffRL
34
9
0
01 Jun 2023
What model does MuZero learn?
Jinke He
Thomas M. Moerland
F. Oliehoek
33
4
0
01 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Rameswar Panda
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
54
85
0
30 May 2023
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
36
25
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
36
9
0
29 May 2023
Reinforcement Learning with Partial Parametric Model Knowledge
Shuyuan Wang
Philip D. Loewen
Nathan P. Lawrence
M. Forbes
R. Bhushan Gopaluni
KELM
21
0
0
26 Apr 2023
A Cookbook of Self-Supervised Learning
Randall Balestriero
Mark Ibrahim
Vlad Sobal
Ari S. Morcos
Shashank Shekhar
...
Pierre Fernandez
Amir Bar
Hamed Pirsiavash
Yann LeCun
Micah Goldblum
SyDa
FedML
SSL
50
275
0
24 Apr 2023
Model Predictive Control with Self-supervised Representation Learning
Jonas A. Matthies
Muhammad Burhan Hafez
Mostafa Kotb
S. Wermter
SSL
13
0
0
14 Apr 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
35
1
0
23 Mar 2023
Transformer Models for Type Inference in the Simply Typed Lambda Calculus: A Case Study in Deep Learning for Code
Brando Miranda
Avraham Shinnar
V. Pestun
B. Trager
22
3
0
15 Mar 2023
Transformer-based World Models Are Happy With 100k Interactions
Jan Robine
Marc Höftmann
Tobias Uelwer
Stefan Harmeling
OffRL
27
71
0
13 Mar 2023
Real-time scheduling of renewable power systems through planning-based reinforcement learning
Shao-Wei Liu
Jinbo Liu
Weirui Ye
Nan Yang
Guanglu Zhang
...
C. Kang
Qirong Jiang
Xuri Song
Fangchun Di
Yang Gao
47
4
0
09 Mar 2023
Policy-Induced Self-Supervision Improves Representation Finetuning in Visual RL
Sébastien M. R. Arnold
Fei Sha
SSL
21
0
0
12 Feb 2023
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
7
0
08 Feb 2023
SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Gal Dalal
Assaf Hallak
Gugan Thoppe
Shie Mannor
Gal Chechik
34
3
0
30 Jan 2023
Hierarchical Imitation Learning with Vector Quantized Models
Kalle Kujanpää
Joni Pajarinen
Alexander Ilin
24
12
0
30 Jan 2023
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
38
559
0
10 Jan 2023
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
30
25
0
29 Dec 2022
Applying Deep Reinforcement Learning to the HP Model for Protein Structure Prediction
Kaiyuan Yang
Houjing Huang
Olafs Vandans
A. Murali
Fujia Tian
R. Yap
Liang Dai
22
10
0
27 Nov 2022
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets
David Venuto
Sherry Yang
Pieter Abbeel
Doina Precup
Igor Mordatch
Ofir Nachum
OffRL
25
5
0
23 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
41
0
0
23 Nov 2022
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning
Katherine Metcalf
Miguel Sarabia
B. Theobald
OffRL
38
4
0
12 Nov 2022
The Benefits of Model-Based Generalization in Reinforcement Learning
K. Young
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
28
12
0
04 Nov 2022
Will we run out of data? Limits of LLM scaling based on human-generated data
Pablo Villalobos
A. Ho
J. Sevilla
T. Besiroglu
Lennart Heim
Marius Hobbhahn
ALM
49
114
0
26 Oct 2022
Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions
Weirui Ye
Pieter Abbeel
Yang Gao
46
5
0
23 Oct 2022
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining
Hao Liu
Tom Zahavy
Volodymyr Mnih
Satinder Singh
SSL
43
7
0
19 Oct 2022
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Yifan Xu
Nicklas Hansen
Zirui Wang
Yung-Chieh Chan
H. Su
Zhuowen Tu
OffRL
36
16
0
19 Oct 2022
Transformers Learn Shortcuts to Automata
Bingbin Liu
Jordan T. Ash
Surbhi Goel
A. Krishnamurthy
Cyril Zhang
OffRL
LRM
48
158
0
19 Oct 2022
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
33
21
0
18 Oct 2022
Bridging the Gap between Artificial Intelligence and Artificial General Intelligence: A Ten Commandment Framework for Human-Like Intelligence
Ananta Nair
F. Kashani
34
2
0
17 Oct 2022
Visual Reinforcement Learning with Self-Supervised 3D Representations
Yanjie Ze
Nicklas Hansen
Yinbo Chen
Mohit Jain
Xiaolong Wang
SSL
32
49
0
13 Oct 2022
Continuous Monte Carlo Graph Search
Kalle Kujanpää
Amin Babadi
Yi Zhao
Arno Solin
Alexander Ilin
Joni Pajarinen
LRM
183
2
0
04 Oct 2022
Mastering Spatial Graph Prediction of Road Networks
Sotiris Anagnostidis
Aurelien Lucchi
Thomas Hofmann
GNN
27
1
0
03 Oct 2022
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
52
28
0
15 Sep 2022
Concept-modulated model-based offline reinforcement learning for rapid generalization
Nicholas A. Ketz
Praveen K. Pilly
OffRL
27
1
0
07 Sep 2022
Transformers are Sample-Efficient World Models
Vincent Micheli
Eloi Alonso
Franccois Fleuret
VLM
OffRL
19
163
0
01 Sep 2022
Light-weight probing of unsupervised representations for Reinforcement Learning
Wancong Zhang
Anthony GX-Chen
Vlad Sobal
Yann LeCun
Nicolas Carion
SSL
OffRL
46
13
0
25 Aug 2022
A model-based approach to meta-Reinforcement Learning: Transformers and tree search
Brieuc Pinon
Jean-Charles Delvenne
Raphaël Jungers
OffRL
37
3
0
24 Aug 2022
Efficient Planning in a Compact Latent Action Space
Zhengyao Jiang
Tianjun Zhang
Michael Janner
Yueying Li
Tim Rocktaschel
Edward Grefenstette
Yuandong Tian
OffRL
24
37
0
22 Aug 2022
Towards Situation Awareness and Attention Guidance in a Multiplayer Environment using Augmented Reality and Carcassonne
D. Kadish
Arezoo Sarkheyli-Hägele
J. Font
D. Niehorster
Thomas Pederson
26
2
0
18 Aug 2022
The Curse of Low Task Diversity: On the Failure of Transfer Learning to Outperform MAML and Their Empirical Equivalence
Brando Miranda
P. Yu
Yu-xiong Wang
Oluwasanmi Koyejo
39
10
0
02 Aug 2022
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models
Alex Lamb
Riashat Islam
Yonathan Efroni
Aniket Didolkar
Dipendra Kumar Misra
Dylan J. Foster
Lekan Molu
Rajan Chari
A. Krishnamurthy
John Langford
46
24
0
17 Jul 2022
Masked World Models for Visual Control
Younggyo Seo
Danijar Hafner
Hao Liu
Fangchen Liu
Stephen James
Kimin Lee
Pieter Abbeel
OffRL
93
147
0
28 Jun 2022
Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
Yang Yue
Bingyi Kang
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
38
13
0
25 Jun 2022
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning
Deyao Zhu
Erran L. Li
Mohamed Elhoseiny
OffRL
40
8
0
09 Jun 2022
Previous
1
2
3
4
Next