ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.13350
  4. Cited By
Agent57: Outperforming the Atari Human Benchmark

Agent57: Outperforming the Atari Human Benchmark

30 March 2020
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
    OffRL
ArXivPDFHTML

Papers citing "Agent57: Outperforming the Atari Human Benchmark"

50 / 106 papers shown
Title
Follow your Nose: Using General Value Functions for Directed Exploration
  in Reinforcement Learning
Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Durgesh Kalwar
Omkar Shelke
Somjit Nath
Hardik Meisheri
H. Khadilkar
19
1
0
02 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
22
9
0
24 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for
  Mapless Navigation in Intralogistics
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
16
15
0
23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Regularized Q-learning
Regularized Q-learning
Han-Dong Lim
Donghwan Lee
21
10
0
11 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
30
29
0
10 Feb 2022
Interpretable pipelines with evolutionarily optimized modules for RL
  tasks with visual inputs
Interpretable pipelines with evolutionarily optimized modules for RL tasks with visual inputs
Leonardo Lucio Custode
Giovanni Iacca
27
13
0
10 Feb 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement
  for Value Error
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
D. Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Variational Quantum Soft Actor-Critic
Variational Quantum Soft Actor-Critic
Qingfeng Lan
22
20
0
20 Dec 2021
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven
  Exploration
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Lu Zheng
Jiarui Chen
Jianhao Wang
Jiamin He
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
Chongjie Zhang
16
82
0
22 Nov 2021
Causal versus Marginal Shapley Values for Robotic Lever Manipulation
  Controlled using Deep Reinforcement Learning
Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning
Sindre Benjamin Remman
Inga Strümke
A. Lekkas
CML
15
7
0
04 Nov 2021
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley
  Additive Explanations
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations
Sindre Benjamin Remman
A. Lekkas
23
14
0
07 Oct 2021
CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning
CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning
C. Benjamins
Theresa Eimer
Frederik Schubert
André Biedenkapp
Bodo Rosenhahn
Frank Hutter
Marius Lindauer
OffRL
41
23
0
05 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
60
54
0
28 Sep 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning
  Research
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
238
89
0
27 Sep 2021
Integrating Deep Reinforcement and Supervised Learning to Expedite
  Indoor Mapping
Integrating Deep Reinforcement and Supervised Learning to Expedite Indoor Mapping
E. Zwecher
Eran Iceland
Sean R. Levy
S. Hayoun
O. Gal
Ariel Barel
49
10
0
17 Sep 2021
Evolutionary Self-Replication as a Mechanism for Producing Artificial
  Intelligence
Evolutionary Self-Replication as a Mechanism for Producing Artificial Intelligence
Samuel Schmidgall
Joe Hays
41
1
0
16 Sep 2021
Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual
  Patterns
Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns
Prasanth Buddareddygari
Travis Zhang
Yezhou Yang
Yi Ren
AAML
37
13
0
16 Sep 2021
Benchmarking the Spectrum of Agent Capabilities
Benchmarking the Spectrum of Agent Capabilities
Danijar Hafner
ELM
33
127
0
14 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
93
0
14 Sep 2021
ADER:Adapting between Exploration and Robustness for Actor-Critic
  Methods
ADER:Adapting between Exploration and Robustness for Actor-Critic Methods
Bo Zhou
Kejiao Li
Hongsheng Zeng
Fan Wang
Hao Tian
OffRL
27
1
0
08 Sep 2021
Variational Quantum Reinforcement Learning via Evolutionary Optimization
Variational Quantum Reinforcement Learning via Evolutionary Optimization
Samuel Yen-Chi Chen
Chih-Min Huang
Chia-Wei Hsing
H. Goan
Y. Kao
38
82
0
01 Sep 2021
Interactive Machine Comprehension with Dynamic Knowledge Graphs
Interactive Machine Comprehension with Dynamic Knowledge Graphs
Xingdi Yuan
34
3
0
31 Aug 2021
APS: Active Pretraining with Successor Features
APS: Active Pretraining with Successor Features
Hao Liu
Pieter Abbeel
47
119
0
31 Aug 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
59
637
0
30 Aug 2021
When should agents explore?
When should agents explore?
Miruna Pislar
David Szepesvari
Georg Ostrovski
Diana Borsa
Tom Schaul
40
22
0
26 Aug 2021
High Performance Across Two Atari Paddle Games Using the Same Perceptual
  Control Architecture Without Training
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training
T. Gulrez
W. Mansell
16
0
0
04 Aug 2021
Human-Level Reinforcement Learning through Theory-Based Modeling,
  Exploration, and Planning
Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning
Pedro Tsividis
J. Loula
Jake Burga
Nathan Foss
Andres Campero
Thomas Pouncy
S. Gershman
J. Tenenbaum
LM&Ro
24
43
0
27 Jul 2021
Reasoning-Modulated Representations
Reasoning-Modulated Representations
Petar Velivcković
Matko Bovsnjak
Thomas Kipf
Alexander Lerchner
R. Hadsell
Razvan Pascanu
Charles Blundell
OCL
OOD
SSL
18
15
0
19 Jul 2021
Explore and Control with Adversarial Surprise
Explore and Control with Adversarial Surprise
Arnaud Fickinger
Natasha Jaques
Samyak Parajuli
Michael Chang
Nicholas Rhinehart
Glen Berseth
Stuart J. Russell
Sergey Levine
40
8
0
12 Jul 2021
Convergent and Efficient Deep Q Network Algorithm
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
27
12
0
29 Jun 2021
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
33
57
0
11 Jun 2021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement
  Learning
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
18
5
0
01 Jun 2021
Did I do that? Blame as a means to identify controlled effects in
  reinforcement learning
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
Oriol Corcoll
Youssef Mohamed
Raul Vicente
18
3
0
01 Jun 2021
A brain basis of dynamical intelligence for AI and computational
  neuroscience
A brain basis of dynamical intelligence for AI and computational neuroscience
J. Monaco
Kanaka Rajan
Grace M. Hwang
AI4CE
26
6
0
15 May 2021
Behavior From the Void: Unsupervised Active Pre-Training
Behavior From the Void: Unsupervised Active Pre-Training
Hao Liu
Pieter Abbeel
VLM
SSL
41
195
0
08 Mar 2021
Learning to Fly -- a Gym Environment with PyBullet Physics for
  Reinforcement Learning of Multi-agent Quadcopter Control
Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter Control
Jacopo Panerati
Hehui Zheng
Siqi Zhou
James Xu
Amanda Prorok
Angela P. Schoellig University of Toronto Institute for A Studies
AI4CE
22
155
0
03 Mar 2021
Neural Production Systems: Learning Rule-Governed Visual Dynamics
Neural Production Systems: Learning Rule-Governed Visual Dynamics
Anirudh Goyal
Aniket Didolkar
Nan Rosemary Ke
Charles Blundell
Philippe Beaudoin
N. Heess
Michael C. Mozer
Yoshua Bengio
OCL
50
82
0
02 Mar 2021
Learning to run a Power Network Challenge: a Retrospective Analysis
Learning to run a Power Network Challenge: a Retrospective Analysis
Antoine Marot
Benjamin Donnot
Gabriel Dulac-Arnold
A. Kelly
A. O'Sullivan
J. Viebahn
M. Awad
Isabelle M Guyon
P. Panciatici
Camilo Romero
14
77
0
02 Mar 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Victor Campos
Pablo Sprechmann
Steven Hansen
André Barreto
Steven Kapturowski
Alex Vitvitskyi
Adria Puigdomenech Badia
Charles Blundell
OffRL
OnRL
38
25
0
24 Feb 2021
Geometric Entropic Exploration
Geometric Entropic Exploration
Z. Guo
M. G. Azar
Alaa Saade
S. Thakoor
Bilal Piot
Bernardo Avila-Pires
Michal Valko
Thomas Mesnard
Tor Lattimore
Rémi Munos
38
30
0
06 Jan 2021
Planning from Pixels in Atari with Learned Symbolic Representations
Planning from Pixels in Atari with Learned Symbolic Representations
Andrea Dittadi
Frederik K. Drachmann
Thomas Bolander
26
11
0
16 Dec 2020
BeBold: Exploration Beyond the Boundary of Explored Regions
BeBold: Exploration Beyond the Boundary of Explored Regions
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
36
40
0
15 Dec 2020
NavRep: Unsupervised Representations for Reinforcement Learning of Robot
  Navigation in Dynamic Human Environments
NavRep: Unsupervised Representations for Reinforcement Learning of Robot Navigation in Dynamic Human Environments
Daniel Dugas
Juan I. Nieto
Roland Siegwart
Jen Jen Chung
SSL
24
51
0
08 Dec 2020
Exploration-Exploitation in Multi-Agent Learning: Catastrophe Theory
  Meets Game Theory
Exploration-Exploitation in Multi-Agent Learning: Catastrophe Theory Meets Game Theory
Stefanos Leonardos
Georgios Piliouras
31
40
0
05 Dec 2020
Distributed Deep Reinforcement Learning: An Overview
Distributed Deep Reinforcement Learning: An Overview
Mohammad Reza Samsami
Hossein Alimadad
OffRL
14
27
0
22 Nov 2020
Visual Navigation in Real-World Indoor Environments Using End-to-End
  Deep Reinforcement Learning
Visual Navigation in Real-World Indoor Environments Using End-to-End Deep Reinforcement Learning
Jonáš Kulhánek
Erik Derner
Robert Babuška
31
40
0
21 Oct 2020
EpidemiOptim: A Toolbox for the Optimization of Control Policies in
  Epidemiological Models
EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models
Cédric Colas
B. Hejblum
S. Rouillon
R. Thiébaut
Pierre-Yves Oudeyer
Clément Moulin-Frier
M. Prague
13
22
0
09 Oct 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
48
814
0
05 Oct 2020
Previous
123
Next