Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.11052
Cited By
On Bonus-Based Exploration Methods in the Arcade Learning Environment
22 September 2021
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On Bonus-Based Exploration Methods in the Arcade Learning Environment"
40 / 40 papers shown
Title
Exploration by Random Distribution Distillation
Zhirui Fang
Kai Yang
Jian Tao
Jiafei Lyu
Lusong Li
Li Shen
Xiu Li
14
0
0
16 May 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
103
2
0
24 Mar 2025
β
\beta
β
-DQN: Improving Deep Q-Learning By Evolving the Behavior
Hongming Zhang
Fengshuo Bai
Chenjun Xiao
Chao Gao
Bo Xu
Martin Müller
OffRL
43
2
0
03 Jan 2025
Deterministic Exploration via Stationary Bellman Error Maximization
Sebastian Griesbach
Carlo DÉramo
35
0
0
31 Oct 2024
CALE: Continuous Arcade Learning Environment
Jesse Farebrother
Pablo Samuel Castro
ELM
38
0
0
31 Oct 2024
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling
Haque Ishfaq
Yixin Tan
Yu Yang
Qingfeng Lan
Jianfeng Lu
A. Rupam Mahmood
Doina Precup
Pan Xu
37
4
0
18 Jun 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
40
0
0
29 May 2024
Small batch deep reinforcement learning
J. Obando-Ceron
Marc G. Bellemare
Pablo Samuel Castro
VLM
34
14
0
05 Oct 2023
Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Martin Klissarov
P. DÓro
Shagun Sodhani
Roberta Raileanu
Pierre-Luc Bacon
Pascal Vincent
Amy Zhang
Mikael Henaff
LRM
LLMAG
37
55
0
29 Sep 2023
Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation Tasks
Wenke Huang
Filippos Christianos
Zhibin Li
44
8
0
28 Sep 2023
On the Importance of Exploration for Generalization in Reinforcement Learning
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCV
OffRL
34
20
0
08 Jun 2023
Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning
Sam Lobel
Akhil Bagaria
George Konidaris
34
16
0
05 Jun 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDL
OffRL
30
20
0
29 May 2023
Simple Noisy Environment Augmentation for Reinforcement Learning
Raad Khraishi
Ramin Okhrati
OffRL
21
1
0
04 May 2023
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning
Ji-Yun Oh
Joonkee Kim
Minchan Jeong
Se-Young Yun
38
1
0
03 Mar 2023
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Fanqing Lin
Shiyu Huang
Tim Pearce
Wenze Chen
Weijuan Tu
26
17
0
15 Feb 2023
Scaling Goal-based Exploration via Pruning Proto-goals
Akhil Bagaria
Ray Jiang
Ramana Kumar
Tom Schaul
LRM
16
2
0
09 Feb 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Yash Chandak
Shiv Shankar
Nathaniel D. Bastian
Bruno Castro da Silva
Emma Brunskil
Philip S. Thomas
OffRL
52
6
0
24 Jan 2023
Cell-Free Latent Go-Explore
Quentin Gallouedec
Emmanuel Dellandrea
19
1
0
31 Aug 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Shuang Qiu
Lingxiao Wang
Chenjia Bai
Zhuoran Yang
Zhaoran Wang
SSL
OffRL
26
32
0
29 Jul 2022
An Evaluation Study of Intrinsic Motivation Techniques applied to Reinforcement Learning over Hard Exploration Environments
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
29
9
0
23 May 2022
Multi-Stage Episodic Control for Strategic Exploration in Text Games
Jens Tuyls
Shunyu Yao
Sham Kakade
Karthik Narasimhan
38
25
0
04 Jan 2022
Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari
Dominik Schmidt
Thomas Schmied
OffRL
28
12
0
19 Nov 2021
Dynamic Bottleneck for Robust Self-Supervised Exploration
Chenjia Bai
Lingxiao Wang
Lei Han
Animesh Garg
Jianye Hao
Peng Liu
Zhaoran Wang
32
29
0
20 Oct 2021
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations
Shiyu Huang
Wenze Chen
Longfei Zhang
Shizhen Xu
Ziyang Li
Fengming Zhu
Deheng Ye
Tingling Chen
Jun Zhu
OffRL
45
25
0
09 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
38
93
0
14 Sep 2021
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Iou-Jen Liu
Unnat Jain
Raymond A. Yeh
Alex Schwing
42
104
0
23 Jul 2021
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
Tengyang Xie
Nan Jiang
Huan Wang
Caiming Xiong
Yu Bai
OffRL
OnRL
44
162
0
09 Jun 2021
Principled Exploration via Optimistic Bootstrapping and Backward Induction
Chenjia Bai
Lingxiao Wang
Lei Han
Jianye Hao
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
21
38
0
13 May 2021
State-Aware Variational Thompson Sampling for Deep Q-Networks
Siddharth Aravindan
W. Lee
22
6
0
07 Feb 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
29
13
0
23 Jan 2021
Evaluating Agents without Rewards
Brendon Matusch Jimmy Ba
Jimmy Ba
Danijar Hafner
32
12
0
21 Dec 2020
Perturbation-based exploration methods in deep reinforcement learning
Sneha Aenugu
8
0
0
10 Nov 2020
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
53
819
0
05 Oct 2020
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao
Vincent François-Lavet
Joelle Pineau
30
43
0
28 Sep 2020
Show me the Way: Intrinsic Motivation from Demonstrations
Léonard Hussenot
Robert Dadashi
M. Geist
Olivier Pietquin
22
9
0
23 Jun 2020
Temporally-Extended ε-Greedy Exploration
Will Dabney
Georg Ostrovski
André Barreto
22
33
0
02 Jun 2020
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
47
351
0
27 Apr 2020
Reward Prediction Error as an Exploration Objective in Deep RL
Riley Simmons-Edler
Ben Eisner
Daniel Yang
Anthony Bisulco
E. Mitchell
Sebastian Seung
Daniel D. Lee
31
5
0
19 Jun 2019
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSeg
GAN
272
2,553
0
25 Jan 2016
1