Agent57: Outperforming the Atari Human Benchmark

30 March 2020

Adria Puigdomenech Badia

Bilal Piot

Papers citing "Agent57: Outperforming the Atari Human Benchmark"

50 / 106 papers shown

Title
Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning Durgesh Kalwar Omkar Shelke Somjit Nath Hardik Meisheri H. Khadilkar 19 1 0 02 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share? Alain Andres Esther Villar-Rodriguez Javier Del Ser 22 9 0 24 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics Honghu Xue Benedikt Hein M. Bakr Georg Schildbach Bengt Abel Elmar Rueckert 16 15 0 23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges Yuxi Li OffRL 36 9 0 23 Feb 2022
Regularized Q-learning Han-Dong Lim Donghwan Lee 21 10 0 11 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks Maja Franz Lucas Wolf Maniraman Periyasamy Christian Ufrecht Daniel D. Scherer Axel Plinge Christopher Mutschler Wolfgang Mauerer 30 29 0 10 Feb 2022
Interpretable pipelines with evolutionarily optimized modules for RL tasks with visual inputs Leonardo Lucio Custode Giovanni Iacca 27 13 0 10 Feb 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error Scott Fujimoto D. Meger Doina Precup Ofir Nachum S. Gu 30 32 0 28 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems Jack Parker-Holder Raghunandan Rajan Xingyou Song André Biedenkapp Yingjie Miao ... Vu-Linh Nguyen Roberto Calandra Aleksandra Faust Frank Hutter Marius Lindauer AI4CE 33 100 0 11 Jan 2022
Variational Quantum Soft Actor-Critic Qingfeng Lan 22 20 0 20 Dec 2021
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration Lu Zheng Jiarui Chen Jianhao Wang Jiamin He Yujing Hu Yingfeng Chen Changjie Fan Yang Gao Chongjie Zhang 16 82 0 22 Nov 2021
Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning Sindre Benjamin Remman Inga Strümke A. Lekkas CML 15 7 0 04 Nov 2021
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations Sindre Benjamin Remman A. Lekkas 23 14 0 07 Oct 2021
CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning C. Benjamins Theresa Eimer Frederik Schubert André Biedenkapp Bodo Rosenhahn Frank Hutter Marius Lindauer OffRL 41 23 0 05 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey Amjad Yousef Majid Serge Saaybi Tomas van Rietbergen Vincent François-Lavet R. V. Prasad Chris Verhoeven OffRL 60 54 0 28 Sep 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research Mikayel Samvelyan Robert Kirk Vitaly Kurin Jack Parker-Holder Minqi Jiang Eric Hambro Fabio Petroni Heinrich Küttler Edward Grefenstette Tim Rocktaschel OffRL 238 89 0 27 Sep 2021
Integrating Deep Reinforcement and Supervised Learning to Expedite Indoor Mapping E. Zwecher Eran Iceland Sean R. Levy S. Hayoun O. Gal Ariel Barel 49 10 0 17 Sep 2021
Evolutionary Self-Replication as a Mechanism for Producing Artificial Intelligence Samuel Schmidgall Joe Hays 41 1 0 16 Sep 2021
Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns Prasanth Buddareddygari Travis Zhang Yezhou Yang Yi Ren AAML 37 13 0 16 Sep 2021
Benchmarking the Spectrum of Agent Capabilities Danijar Hafner ELM 33 127 0 14 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain Jianye Hao Tianpei Yang Hongyao Tang Chenjia Bai Jinyi Liu Zhaopeng Meng Peng Liu Zhen Wang OffRL 36 93 0 14 Sep 2021
ADER:Adapting between Exploration and Robustness for Actor-Critic Methods Bo Zhou Kejiao Li Hongsheng Zeng Fan Wang Hao Tian OffRL 27 1 0 08 Sep 2021
Variational Quantum Reinforcement Learning via Evolutionary Optimization Samuel Yen-Chi Chen Chih-Min Huang Chia-Wei Hsing H. Goan Y. Kao 38 82 0 01 Sep 2021
Interactive Machine Comprehension with Dynamic Knowledge Graphs Xingdi Yuan 34 3 0 31 Aug 2021
APS: Active Pretraining with Successor Features Hao Liu Pieter Abbeel 47 119 0 31 Aug 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice Rishabh Agarwal Max Schwarzer Pablo Samuel Castro Aaron Courville Marc G. Bellemare OffRL 59 637 0 30 Aug 2021
When should agents explore? Miruna Pislar David Szepesvari Georg Ostrovski Diana Borsa Tom Schaul 40 22 0 26 Aug 2021
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training T. Gulrez W. Mansell 16 0 0 04 Aug 2021
Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning Pedro Tsividis J. Loula Jake Burga Nathan Foss Andres Campero Thomas Pouncy S. Gershman J. Tenenbaum LM&Ro 24 43 0 27 Jul 2021
Reasoning-Modulated Representations Petar Velivcković Matko Bovsnjak Thomas Kipf Alexander Lerchner R. Hadsell Razvan Pascanu Charles Blundell OCL OOD SSL 18 15 0 19 Jul 2021
Explore and Control with Adversarial Surprise Arnaud Fickinger Natasha Jaques Samyak Parajuli Michael Chang Nicholas Rhinehart Glen Berseth Stuart J. Russell Sergey Levine 40 8 0 12 Jul 2021
Convergent and Efficient Deep Q Network Algorithm Zhikang T. Wang Masahito Ueda 27 12 0 29 Jun 2021
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers Kazuki Irie Imanol Schlag Róbert Csordás Jürgen Schmidhuber 33 57 0 11 Jun 2021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning Changnan Xiao Haosen Shi Jiajun Fan Shihong Deng 18 5 0 01 Jun 2021
Did I do that? Blame as a means to identify controlled effects in reinforcement learning Oriol Corcoll Youssef Mohamed Raul Vicente 18 3 0 01 Jun 2021
A brain basis of dynamical intelligence for AI and computational neuroscience J. Monaco Kanaka Rajan Grace M. Hwang AI4CE 26 6 0 15 May 2021
Behavior From the Void: Unsupervised Active Pre-Training Hao Liu Pieter Abbeel VLM SSL 41 195 0 08 Mar 2021
Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter Control Jacopo Panerati Hehui Zheng Siqi Zhou James Xu Amanda Prorok Angela P. Schoellig University of Toronto Institute for A Studies AI4CE 22 155 0 03 Mar 2021
Neural Production Systems: Learning Rule-Governed Visual Dynamics Anirudh Goyal Aniket Didolkar Nan Rosemary Ke Charles Blundell Philippe Beaudoin N. Heess Michael C. Mozer Yoshua Bengio OCL 50 82 0 02 Mar 2021
Learning to run a Power Network Challenge: a Retrospective Analysis Antoine Marot Benjamin Donnot Gabriel Dulac-Arnold A. Kelly A. O'Sullivan J. Viebahn M. Awad Isabelle M Guyon P. Panciatici Camilo Romero 14 77 0 02 Mar 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning Victor Campos Pablo Sprechmann Steven Hansen André Barreto Steven Kapturowski Alex Vitvitskyi Adria Puigdomenech Badia Charles Blundell OffRL OnRL 38 25 0 24 Feb 2021
Geometric Entropic Exploration Z. Guo M. G. Azar Alaa Saade S. Thakoor Bilal Piot Bernardo Avila-Pires Michal Valko Thomas Mesnard Tor Lattimore Rémi Munos 38 30 0 06 Jan 2021
Planning from Pixels in Atari with Learned Symbolic Representations Andrea Dittadi Frederik K. Drachmann Thomas Bolander 26 11 0 16 Dec 2020
BeBold: Exploration Beyond the Boundary of Explored Regions Tianjun Zhang Huazhe Xu Xiaolong Wang Yi Wu Kurt Keutzer Joseph E. Gonzalez Yuandong Tian 36 40 0 15 Dec 2020
NavRep: Unsupervised Representations for Reinforcement Learning of Robot Navigation in Dynamic Human Environments Daniel Dugas Juan I. Nieto Roland Siegwart Jen Jen Chung SSL 24 51 0 08 Dec 2020
Exploration-Exploitation in Multi-Agent Learning: Catastrophe Theory Meets Game Theory Stefanos Leonardos Georgios Piliouras 31 40 0 05 Dec 2020
Distributed Deep Reinforcement Learning: An Overview Mohammad Reza Samsami Hossein Alimadad OffRL 14 27 0 22 Nov 2020
Visual Navigation in Real-World Indoor Environments Using End-to-End Deep Reinforcement Learning Jonáš Kulhánek Erik Derner Robert Babuška 31 40 0 21 Oct 2020
EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models Cédric Colas B. Hejblum S. Rouillon R. Thiébaut Pierre-Yves Oudeyer Clément Moulin-Frier M. Prague 13 22 0 09 Oct 2020
Mastering Atari with Discrete World Models Danijar Hafner Timothy Lillicrap Mohammad Norouzi Jimmy Ba DRL 48 814 0 05 Oct 2020