On Bonus-Based Exploration Methods in the Arcade Learning Environment

22 September 2021

Aaron Courville

Papers citing "On Bonus-Based Exploration Methods in the Arcade Learning Environment"

40 / 40 papers shown

Title
Exploration by Random Distribution Distillation Zhirui Fang Kai Yang Jian Tao Jiafei Lyu Lusong Li Li Shen Xiu Li 14 0 0 16 May 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning Yongshuai Liu Xin Liu GAN 103 2 0 24 Mar 2025
$\beta$ -DQN: Improving Deep Q-Learning By Evolving the Behavior Hongming Zhang Fengshuo Bai Chenjun Xiao Chao Gao Bo Xu Martin Müller OffRL 43 2 0 03 Jan 2025
Deterministic Exploration via Stationary Bellman Error Maximization Sebastian Griesbach Carlo DÉramo 35 0 0 31 Oct 2024
CALE: Continuous Arcade Learning Environment Jesse Farebrother Pablo Samuel Castro ELM 38 0 0 31 Oct 2024
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling Haque Ishfaq Yixin Tan Yu Yang Qingfeng Lan Jianfeng Lu A. Rupam Mahmood Doina Precup Pan Xu 37 4 0 18 Jun 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning Mingqi Yuan Roger Creus Castanyer Bo Li Xin Jin Glen Berseth Wenjun Zeng 40 0 0 29 May 2024
Small batch deep reinforcement learning J. Obando-Ceron Marc G. Bellemare Pablo Samuel Castro VLM 34 14 0 05 Oct 2023
Motif: Intrinsic Motivation from Artificial Intelligence Feedback Martin Klissarov P. DÓro Shagun Sodhani Roberta Raileanu Pierre-Luc Bacon Pascal Vincent Amy Zhang Mikael Henaff LRM LLMAG 37 55 0 29 Sep 2023
Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation Tasks Wenke Huang Filippos Christianos Zhibin Li 44 8 0 28 Sep 2023
On the Importance of Exploration for Generalization in Reinforcement Learning Yiding Jiang J. Zico Kolter Roberta Raileanu UQCV OffRL 34 20 0 08 Jun 2023
Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning Sam Lobel Akhil Bagaria George Konidaris 34 16 0 05 Jun 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo Haque Ishfaq Qingfeng Lan Pan Xu A. R. Mahmood Doina Precup Anima Anandkumar Kamyar Azizzadenesheli BDL OffRL 30 20 0 29 May 2023
Simple Noisy Environment Augmentation for Reinforcement Learning Raad Khraishi Ramin Okhrati OffRL 21 1 0 04 May 2023
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning Ji-Yun Oh Joonkee Kim Minchan Jeong Se-Young Yun 38 1 0 03 Mar 2023
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play Fanqing Lin Shiyu Huang Tim Pearce Wenze Chen Weijuan Tu 26 17 0 15 Feb 2023
Scaling Goal-based Exploration via Pruning Proto-goals Akhil Bagaria Ray Jiang Ramana Kumar Tom Schaul LRM 16 2 0 09 Feb 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments Yash Chandak Shiv Shankar Nathaniel D. Bastian Bruno Castro da Silva Emma Brunskil Philip S. Thomas OffRL 52 6 0 24 Jan 2023
Cell-Free Latent Go-Explore Quentin Gallouedec Emmanuel Dellandrea 19 1 0 31 Aug 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning Shuang Qiu Lingxiao Wang Chenjia Bai Zhuoran Yang Zhaoran Wang SSL OffRL 26 32 0 29 Jul 2022
An Evaluation Study of Intrinsic Motivation Techniques applied to Reinforcement Learning over Hard Exploration Environments Alain Andres Esther Villar-Rodriguez Javier Del Ser 29 9 0 23 May 2022
Multi-Stage Episodic Control for Strategic Exploration in Text Games Jens Tuyls Shunyu Yao Sham Kakade Karthik Narasimhan 38 25 0 04 Jan 2022
Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari Dominik Schmidt Thomas Schmied OffRL 28 12 0 19 Nov 2021
Dynamic Bottleneck for Robust Self-Supervised Exploration Chenjia Bai Lingxiao Wang Lei Han Animesh Garg Jianye Hao Peng Liu Zhaoran Wang 32 29 0 20 Oct 2021
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations Shiyu Huang Wenze Chen Longfei Zhang Shizhen Xu Ziyang Li Fengming Zhu Deheng Ye Tingling Chen Jun Zhu OffRL 45 25 0 09 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain Jianye Hao Tianpei Yang Hongyao Tang Chenjia Bai Jinyi Liu Zhaopeng Meng Peng Liu Zhen Wang OffRL 38 93 0 14 Sep 2021
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning Iou-Jen Liu Unnat Jain Raymond A. Yeh Alex Schwing 42 104 0 23 Jul 2021
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning Tengyang Xie Nan Jiang Huan Wang Caiming Xiong Yu Bai OffRL OnRL 44 162 0 09 Jun 2021
Principled Exploration via Optimistic Bootstrapping and Backward Induction Chenjia Bai Lingxiao Wang Lei Han Jianye Hao Animesh Garg Peng Liu Zhaoran Wang OffRL 21 38 0 13 May 2021
State-Aware Variational Thompson Sampling for Deep Q-Networks Siddharth Aravindan W. Lee 22 6 0 07 Feb 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning William F. Whitney Michael Bloesch Jost Tobias Springenberg A. Abdolmaleki Kyunghyun Cho Martin Riedmiller OffRL 29 13 0 23 Jan 2021
Evaluating Agents without Rewards Brendon Matusch Jimmy Ba Jimmy Ba Danijar Hafner 32 12 0 21 Dec 2020
Perturbation-based exploration methods in deep reinforcement learning Sneha Aenugu 8 0 0 10 Nov 2020
Mastering Atari with Discrete World Models Danijar Hafner Timothy Lillicrap Mohammad Norouzi Jimmy Ba DRL 53 819 0 05 Oct 2020
Novelty Search in Representational Space for Sample Efficient Exploration Ruo Yu Tao Vincent François-Lavet Joelle Pineau 30 43 0 28 Sep 2020
Show me the Way: Intrinsic Motivation from Demonstrations Léonard Hussenot Robert Dadashi M. Geist Olivier Pietquin 22 9 0 23 Jun 2020
Temporally-Extended ε-Greedy Exploration Will Dabney Georg Ostrovski André Barreto 22 33 0 02 Jun 2020
First return, then explore Adrien Ecoffet Joost Huizinga Joel Lehman Kenneth O. Stanley Jeff Clune 47 351 0 27 Apr 2020
Reward Prediction Error as an Exploration Objective in Deep RL Riley Simmons-Edler Ben Eisner Daniel Yang Anthony Bisulco E. Mitchell Sebastian Seung Daniel D. Lee 31 5 0 19 Jun 2019
Pixel Recurrent Neural Networks Aaron van den Oord Nal Kalchbrenner Koray Kavukcuoglu SSeg GAN 272 2,553 0 25 Jan 2016