Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.13350
Cited By
Agent57: Outperforming the Atari Human Benchmark
30 March 2020
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Agent57: Outperforming the Atari Human Benchmark"
50 / 106 papers shown
Title
Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Durgesh Kalwar
Omkar Shelke
Somjit Nath
Hardik Meisheri
H. Khadilkar
19
1
0
02 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
22
9
0
24 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
16
15
0
23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Regularized Q-learning
Han-Dong Lim
Donghwan Lee
21
10
0
11 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
30
29
0
10 Feb 2022
Interpretable pipelines with evolutionarily optimized modules for RL tasks with visual inputs
Leonardo Lucio Custode
Giovanni Iacca
27
13
0
10 Feb 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
D. Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Variational Quantum Soft Actor-Critic
Qingfeng Lan
22
20
0
20 Dec 2021
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Lu Zheng
Jiarui Chen
Jianhao Wang
Jiamin He
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
Chongjie Zhang
16
82
0
22 Nov 2021
Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning
Sindre Benjamin Remman
Inga Strümke
A. Lekkas
CML
15
7
0
04 Nov 2021
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations
Sindre Benjamin Remman
A. Lekkas
23
14
0
07 Oct 2021
CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning
C. Benjamins
Theresa Eimer
Frederik Schubert
André Biedenkapp
Bodo Rosenhahn
Frank Hutter
Marius Lindauer
OffRL
41
23
0
05 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
60
54
0
28 Sep 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
238
89
0
27 Sep 2021
Integrating Deep Reinforcement and Supervised Learning to Expedite Indoor Mapping
E. Zwecher
Eran Iceland
Sean R. Levy
S. Hayoun
O. Gal
Ariel Barel
49
10
0
17 Sep 2021
Evolutionary Self-Replication as a Mechanism for Producing Artificial Intelligence
Samuel Schmidgall
Joe Hays
41
1
0
16 Sep 2021
Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns
Prasanth Buddareddygari
Travis Zhang
Yezhou Yang
Yi Ren
AAML
37
13
0
16 Sep 2021
Benchmarking the Spectrum of Agent Capabilities
Danijar Hafner
ELM
33
127
0
14 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
93
0
14 Sep 2021
ADER:Adapting between Exploration and Robustness for Actor-Critic Methods
Bo Zhou
Kejiao Li
Hongsheng Zeng
Fan Wang
Hao Tian
OffRL
27
1
0
08 Sep 2021
Variational Quantum Reinforcement Learning via Evolutionary Optimization
Samuel Yen-Chi Chen
Chih-Min Huang
Chia-Wei Hsing
H. Goan
Y. Kao
38
82
0
01 Sep 2021
Interactive Machine Comprehension with Dynamic Knowledge Graphs
Xingdi Yuan
34
3
0
31 Aug 2021
APS: Active Pretraining with Successor Features
Hao Liu
Pieter Abbeel
47
119
0
31 Aug 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
59
637
0
30 Aug 2021
When should agents explore?
Miruna Pislar
David Szepesvari
Georg Ostrovski
Diana Borsa
Tom Schaul
40
22
0
26 Aug 2021
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training
T. Gulrez
W. Mansell
16
0
0
04 Aug 2021
Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning
Pedro Tsividis
J. Loula
Jake Burga
Nathan Foss
Andres Campero
Thomas Pouncy
S. Gershman
J. Tenenbaum
LM&Ro
24
43
0
27 Jul 2021
Reasoning-Modulated Representations
Petar Velivcković
Matko Bovsnjak
Thomas Kipf
Alexander Lerchner
R. Hadsell
Razvan Pascanu
Charles Blundell
OCL
OOD
SSL
18
15
0
19 Jul 2021
Explore and Control with Adversarial Surprise
Arnaud Fickinger
Natasha Jaques
Samyak Parajuli
Michael Chang
Nicholas Rhinehart
Glen Berseth
Stuart J. Russell
Sergey Levine
40
8
0
12 Jul 2021
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
27
12
0
29 Jun 2021
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
33
57
0
11 Jun 2021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
18
5
0
01 Jun 2021
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
Oriol Corcoll
Youssef Mohamed
Raul Vicente
18
3
0
01 Jun 2021
A brain basis of dynamical intelligence for AI and computational neuroscience
J. Monaco
Kanaka Rajan
Grace M. Hwang
AI4CE
26
6
0
15 May 2021
Behavior From the Void: Unsupervised Active Pre-Training
Hao Liu
Pieter Abbeel
VLM
SSL
41
195
0
08 Mar 2021
Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter Control
Jacopo Panerati
Hehui Zheng
Siqi Zhou
James Xu
Amanda Prorok
Angela P. Schoellig University of Toronto Institute for A Studies
AI4CE
22
155
0
03 Mar 2021
Neural Production Systems: Learning Rule-Governed Visual Dynamics
Anirudh Goyal
Aniket Didolkar
Nan Rosemary Ke
Charles Blundell
Philippe Beaudoin
N. Heess
Michael C. Mozer
Yoshua Bengio
OCL
50
82
0
02 Mar 2021
Learning to run a Power Network Challenge: a Retrospective Analysis
Antoine Marot
Benjamin Donnot
Gabriel Dulac-Arnold
A. Kelly
A. O'Sullivan
J. Viebahn
M. Awad
Isabelle M Guyon
P. Panciatici
Camilo Romero
14
77
0
02 Mar 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Victor Campos
Pablo Sprechmann
Steven Hansen
André Barreto
Steven Kapturowski
Alex Vitvitskyi
Adria Puigdomenech Badia
Charles Blundell
OffRL
OnRL
38
25
0
24 Feb 2021
Geometric Entropic Exploration
Z. Guo
M. G. Azar
Alaa Saade
S. Thakoor
Bilal Piot
Bernardo Avila-Pires
Michal Valko
Thomas Mesnard
Tor Lattimore
Rémi Munos
38
30
0
06 Jan 2021
Planning from Pixels in Atari with Learned Symbolic Representations
Andrea Dittadi
Frederik K. Drachmann
Thomas Bolander
26
11
0
16 Dec 2020
BeBold: Exploration Beyond the Boundary of Explored Regions
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
36
40
0
15 Dec 2020
NavRep: Unsupervised Representations for Reinforcement Learning of Robot Navigation in Dynamic Human Environments
Daniel Dugas
Juan I. Nieto
Roland Siegwart
Jen Jen Chung
SSL
24
51
0
08 Dec 2020
Exploration-Exploitation in Multi-Agent Learning: Catastrophe Theory Meets Game Theory
Stefanos Leonardos
Georgios Piliouras
31
40
0
05 Dec 2020
Distributed Deep Reinforcement Learning: An Overview
Mohammad Reza Samsami
Hossein Alimadad
OffRL
14
27
0
22 Nov 2020
Visual Navigation in Real-World Indoor Environments Using End-to-End Deep Reinforcement Learning
Jonáš Kulhánek
Erik Derner
Robert Babuška
31
40
0
21 Oct 2020
EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models
Cédric Colas
B. Hejblum
S. Rouillon
R. Thiébaut
Pierre-Yves Oudeyer
Clément Moulin-Frier
M. Prague
13
22
0
09 Oct 2020
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
48
814
0
05 Oct 2020
Previous
1
2
3
Next