ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.00210
  4. Cited By
Mastering Atari Games with Limited Data

Mastering Atari Games with Limited Data

30 October 2021
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
    VLM
ArXivPDFHTML

Papers citing "Mastering Atari Games with Limited Data"

50 / 163 papers shown
Title
Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities
Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities
Jinyang Wu
Chonghua Liao
Mingkuan Feng
Shuai Zhang
Zhengqi Wen
Pengpeng Shao
Huazhe Xu
Jianhua Tao
OffRL
LRM
12
0
0
21 May 2025
Hadamax Encoding: Elevating Performance in Model-Free Atari
Hadamax Encoding: Elevating Performance in Model-Free Atari
Jacob E. Kooi
Zhao Yang
Vincent François-Lavet
12
0
0
21 May 2025
TD-GRPC: Temporal Difference Learning with Group Relative Policy Constraint for Humanoid Locomotion
TD-GRPC: Temporal Difference Learning with Group Relative Policy Constraint for Humanoid Locomotion
Khang Nguyen
Khai Nguyen
An T. Le
Jan Peters
Manfred Huber
Ngo Anh Vien
Minh Nhat Vu
12
0
0
19 May 2025
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
71
0
0
04 May 2025
Rulebook: bringing co-routines to reinforcement learning environments
Rulebook: bringing co-routines to reinforcement learning environments
Massimo Fioravanti
Samuele Pasini
Giovanni Agosta
33
0
0
28 Apr 2025
Trust-Region Twisted Policy Improvement
Trust-Region Twisted Policy Improvement
Joery A. de Vries
Jinke He
Yaniv Oren
M. Spaan
OffRL
LRM
35
0
0
08 Apr 2025
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
Shiguang Sun
Hanbo Zhang
Zeyang Liu
Xinrui Yang
Lipeng Wan
Bing Yan
Xingyu Chen
Xuguang Lan
40
0
0
05 Apr 2025
Bootstrapped Model Predictive Control
Bootstrapped Model Predictive Control
Yuhang Wang
Hanwei Guo
Sizhe Wang
Long Qian
Xuguang Lan
56
0
0
24 Mar 2025
Learning Transformer-based World Models with Contrastive Predictive Coding
Maxime Burchi
Radu Timofte
72
0
0
06 Mar 2025
A2Perf: Real-World Autonomous Agents Benchmark
Ikechukwu Uchendu
Jason J. Jabbour
Korneel Van den Berghe
Joel Runevic
Matthew P. Stewart
...
S. Guadarrama
Jie Tan
Jordan K. Terry
Aleksandra Faust
Vijay Janapa Reddi
39
0
0
04 Mar 2025
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Baiting Luo
Ava Pettet
Aron Laszka
A. Dubey
Ayan Mukhopadhyay
OffRL
48
1
0
28 Feb 2025
Implicit Search via Discrete Diffusion: A Study on Chess
Implicit Search via Discrete Diffusion: A Study on Chess
Jiacheng Ye
Zhenyu Wu
Jiahui Gao
Zhiyong Wu
Xin Jiang
Zhiyu Li
Lingpeng Kong
DiffM
52
2
0
27 Feb 2025
Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning
Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning
Jaehyeon Son
Soochan Lee
Gunhee Kim
OffRL
82
1
0
26 Feb 2025
OptionZero: Planning with Learned Options
OptionZero: Planning with Learned Options
Po-Wei Huang
Pei-Chiun Peng
Hung Guei
Ti-Rong Wu
57
1
0
23 Feb 2025
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Marcos Negre Saura
Richard Allmendinger
Theodore Papamarkou
Wei Pan
226
0
0
17 Feb 2025
Video2Policy: Scaling up Manipulation Tasks in Simulation through Internet Videos
Video2Policy: Scaling up Manipulation Tasks in Simulation through Internet Videos
Weirui Ye
Fangchen Liu
Z. Ding
Yang Gao
Oleh Rybkin
Pieter Abbeel
VGen
OffRL
88
3
0
14 Feb 2025
DMWM: Dual-Mind World Model with Long-Term Imagination
DMWM: Dual-Mind World Model with Long-Term Imagination
Lingyi Wang
Rashed Shelim
Walid Saad
Naren Ramakrishnan
LRM
223
1
0
11 Feb 2025
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking
Jinyang Wu
Mingkuan Feng
Shuai Zhang
Ruihan Jin
Feihu Che
Zengqi Wen
J. Tao
LRM
68
8
0
04 Feb 2025
Towards General-Purpose Model-Free Reinforcement Learning
Scott Fujimoto
P. DÓro
Amy Zhang
Yuandong Tian
Michael Rabbat
OffRL
46
3
0
28 Jan 2025
Evolution and The Knightian Blindspot of Machine Learning
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
96
2
0
22 Jan 2025
GLAM: Global-Local Variation Awareness in Mamba-based World Model
GLAM: Global-Local Variation Awareness in Mamba-based World Model
Qian He
Wenqi Liang
Chunhui Hao
Gan Sun
Jiandong Tian
66
0
0
21 Jan 2025
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Siddharth Aravindan
Dixant Mittal
Wee Sun Lee
BDL
79
0
0
17 Jan 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
73
0
0
03 Jan 2025
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree
  Search and Progress Reward Modeling
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling
Junyi Li
Hwee Tou Ng
LRM
97
1
0
19 Dec 2024
Digital Twin-Empowered Voltage Control for Power Systems
Digital Twin-Empowered Voltage Control for Power Systems
Jiachen Xu
Yushuai Li
Torben Bach Pedersen
Yuqiang He
Kim Guldstrand Larsen
Tianyi Li
67
0
0
09 Dec 2024
Policy-shaped prediction: avoiding distractions in model-based
  reinforcement learning
Policy-shaped prediction: avoiding distractions in model-based reinforcement learning
Miles Hutson
Isaac Kauvar
Nick Haber
75
0
0
08 Dec 2024
Decision Transformer vs. Decision Mamba: Analysing the Complexity of
  Sequential Decision Making in Atari Games
Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games
Ke Yan
75
0
0
01 Dec 2024
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context
  Learning via MCTS
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS
Jinyang Wu
Mingkuan Feng
Shuai Zhang
Feihu Che
Zengqi Wen
J. Tao
ReLM
LRM
115
10
0
27 Nov 2024
Interpreting the Learned Model in MuZero Planning
Interpreting the Learned Model in MuZero Planning
Hung Guei
Yan-Ru Ju
Wei-Yu Chen
Ti-Rong Wu
30
1
0
07 Nov 2024
Prioritized Generative Replay
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
119
2
0
23 Oct 2024
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Ömer Veysel Çağatan
Barış Akgün
OffRL
41
0
0
22 Oct 2024
Reward-free World Models for Online Imitation Learning
Reward-free World Models for Online Imitation Learning
Shangzhe Li
Zhiao Huang
H. Su
OffRL
67
1
0
17 Oct 2024
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning
Jiayu Chen
Wentse Chen
Jeff Schneider
OffRL
35
2
0
15 Oct 2024
Optimizing Instruction Synthesis: Effective Exploration of Evolutionary
  Space with Tree Search
Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search
Chenglin Li
Qianglong Chen
Zhi Li
Feng Tao
Yicheng Li
Hao Chen
Fei Yu
Yin Zhang
SyDa
33
0
0
14 Oct 2024
Development and Validation of Heparin Dosing Policies Using an Offline
  Reinforcement Learning Algorithm
Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm
Yooseok Lim
Inbeom Park
Sujee Lee
OffRL
28
0
0
24 Sep 2024
No Saved Kaleidosope: an 100% Jitted Neural Network Coding Language with
  Pythonic Syntax
No Saved Kaleidosope: an 100% Jitted Neural Network Coding Language with Pythonic Syntax
Augusto Seben da Rosa
Marlon Daniel Angeli
Jorge Aikes Junior
Alef Iury Ferreira
L. Gris
Anderson da Silva Soares
Arnaldo Candido Junior
Frederico Santos de Oliveira
Gabriel Trevisan Damke
Rafael Teixeira Sousa
33
0
0
17 Sep 2024
Enhancing Reinforcement Learning Through Guided Search
Enhancing Reinforcement Learning Through Guided Search
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
OffRL
99
0
0
19 Aug 2024
Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations
Yupei Yang
Erdun Gao
Fan Feng
Xinyue Wang
Shikui Tu
Lei Xu
CML
OOD
TTA
43
1
0
30 Jul 2024
Generalizing soft actor-critic algorithms to discrete action spaces
Generalizing soft actor-critic algorithms to discrete action spaces
Le Zhang
Yong Gu
Xin Zhao
Yanshuo Zhang
Shu Zhao
Yifei Jin
Xinxin Wu
34
0
0
08 Jul 2024
Combining AI Control Systems and Human Decision Support via Robustness
  and Criticality
Combining AI Control Systems and Human Decision Support via Robustness and Criticality
Walt Woods
Alexander Grushin
Simon Khan
Alvaro Velasquez
35
1
0
03 Jul 2024
Efficient World Models with Context-Aware Tokenization
Efficient World Models with Context-Aware Tokenization
Vincent Micheli
Eloi Alonso
François Fleuret
OffRL
VLM
34
6
0
27 Jun 2024
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement
  Learning
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning
Matthias Weissenbacher
Rishabh Agarwal
Yoshinobu Kawahara
OffRL
34
1
0
21 Jun 2024
CoDreamer: Communication-Based Decentralised World Models
CoDreamer: Communication-Based Decentralised World Models
Edan Toledo
Amanda Prorok
48
0
0
19 Jun 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
51
1
0
15 Jun 2024
iQRL -- Implicitly Quantized Representations for Sample-efficient
  Reinforcement Learning
iQRL -- Implicitly Quantized Representations for Sample-efficient Reinforcement Learning
Aidan Scannell
Kalle Kujanpää
Yi Zhao
Mohammadreza Nakhaei
Dieter Büchler
Joni Pajarinen
SSL
55
5
0
04 Jun 2024
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in
  Offline Reinforcement Learning
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning
Jiahang Cao
Qiang Zhang
Ziqing Wang
Jiaxu Wang
Hao Cheng
Yecheng Shao
Wen Zhao
Gang Han
Yijie Guo
Renjing Xu
Mamba
59
2
0
04 Jun 2024
Learning to Play Atari in a World of Tokens
Learning to Play Atari in a World of Tokens
Pranav Agarwal
Sheldon Andrews
Samira Ebrahimi Kahou
OffRL
38
1
0
03 Jun 2024
Value Improved Actor Critic Algorithms
Value Improved Actor Critic Algorithms
Yaniv Oren
Moritz A. Zanger
Pascal R. van der Vaart
M. Spaan
Wendelin Bohmer
Wendelin Bohmer
OffRL
33
0
0
03 Jun 2024
Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned
  Action Abstraction
Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction
Yunhyeok Kwak
Inwoo Hwang
Dooyoung Kim
Sanghack Lee
Byoung-Tak Zhang
38
0
0
02 Jun 2024
Hierarchical World Models as Visual Whole-Body Humanoid Controllers
Hierarchical World Models as Visual Whole-Body Humanoid Controllers
Nicklas Hansen
V. JyothirS
Vlad Sobal
Yann LeCun
Xiaolong Wang
Hao Su
VGen
54
10
0
28 May 2024
1234
Next