ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.12894
  4. Cited By
Exploration by Random Network Distillation

Exploration by Random Network Distillation

30 October 2018
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
ArXivPDFHTML

Papers citing "Exploration by Random Network Distillation"

50 / 290 papers shown
Title
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning
  Research
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
238
89
0
27 Sep 2021
On Bonus-Based Exploration Methods in the Arcade Learning Environment
On Bonus-Based Exploration Methods in the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
18
58
0
22 Sep 2021
Active inference, Bayesian optimal design, and expected utility
Active inference, Bayesian optimal design, and expected utility
Noor Sajid
Lancelot Da Costa
Thomas Parr
Karl J. Friston
24
16
0
21 Sep 2021
Is Curiosity All You Need? On the Utility of Emergent Behaviours from
  Curious Exploration
Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration
Oliver Groth
Markus Wulfmeier
Giulia Vezzani
Vibhavari Dasagi
Tim Hertweck
Roland Hafner
N. Heess
Martin Riedmiller
LRM
41
20
0
17 Sep 2021
Focus on Impact: Indoor Exploration with Intrinsic Motivation
Focus on Impact: Indoor Exploration with Intrinsic Motivation
Roberto Bigazzi
Federico Landi
S. Cascianelli
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
OffRL
29
13
0
14 Sep 2021
Benchmarking the Spectrum of Agent Capabilities
Benchmarking the Spectrum of Agent Capabilities
Danijar Hafner
ELM
33
127
0
14 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
93
0
14 Sep 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic
  Reinforcement Learning and Global Convergence of Policy Gradient Methods
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
28
6
0
13 Sep 2021
When should agents explore?
When should agents explore?
Miruna Pislar
David Szepesvari
Georg Ostrovski
Diana Borsa
Tom Schaul
40
22
0
26 Aug 2021
Imitation Learning by Reinforcement Learning
Imitation Learning by Reinforcement Learning
K. Ciosek
22
18
0
10 Aug 2021
A Pragmatic Look at Deep Imitation Learning
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
29
9
0
04 Aug 2021
Strategically Efficient Exploration in Competitive Multi-agent
  Reinforcement Learning
Strategically Efficient Exploration in Competitive Multi-agent Reinforcement Learning
R. Loftin
Aadirupa Saha
Sam Devlin
Katja Hofmann
30
5
0
30 Jul 2021
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Iou-Jen Liu
Unnat Jain
Raymond A. Yeh
A. Schwing
42
104
0
23 Jul 2021
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Sungryull Sohn
Sungtae Lee
Jongwook Choi
H. V. Seijen
Mehdi Fatemi
Honglak Lee
149
3
0
13 Jul 2021
Explore and Control with Adversarial Surprise
Explore and Control with Adversarial Surprise
Arnaud Fickinger
Natasha Jaques
Samyak Parajuli
Michael Chang
Nicholas Rhinehart
Glen Berseth
Stuart J. Russell
Sergey Levine
40
8
0
12 Jul 2021
Backprop-Free Reinforcement Learning with Active Neural Generative
  Coding
Backprop-Free Reinforcement Learning with Active Neural Generative Coding
Alexander Ororbia
A. Mali
41
15
0
10 Jul 2021
Robust Out-of-Distribution Detection on Deep Probabilistic Generative
  Models
Robust Out-of-Distribution Detection on Deep Probabilistic Generative Models
Jaemoo Choi
Changyeon Yoon
Jeongwoo Bae
Myung-joo Kang
OODD
30
4
0
15 Jun 2021
Offline Reinforcement Learning as Anti-Exploration
Offline Reinforcement Learning as Anti-Exploration
Shideh Rezaeifar
Robert Dadashi
Nino Vieillard
Léonard Hussenot
Olivier Bachem
Olivier Pietquin
M. Geist
OffRL
51
51
0
11 Jun 2021
Learning Markov State Abstractions for Deep Reinforcement Learning
Learning Markov State Abstractions for Deep Reinforcement Learning
Cameron Allen
Neev Parikh
Omer Gottesman
George Konidaris
BDL
OffRL
31
36
0
08 Jun 2021
Exploration and preference satisfaction trade-off in reward-free
  learning
Exploration and preference satisfaction trade-off in reward-free learning
Noor Sajid
P. Tigas
Alexey Zakharov
Z. Fountas
Karl J. Friston
22
20
0
08 Jun 2021
Did I do that? Blame as a means to identify controlled effects in
  reinforcement learning
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
Oriol Corcoll
Youssef Mohamed
Raul Vicente
18
3
0
01 Jun 2021
A brain basis of dynamical intelligence for AI and computational
  neuroscience
A brain basis of dynamical intelligence for AI and computational neuroscience
J. Monaco
Kanaka Rajan
Grace M. Hwang
AI4CE
26
6
0
15 May 2021
Learning on a Budget via Teacher Imitation
Learning on a Budget via Teacher Imitation
Ercüment Ilhan
Jeremy Gow
Diego Perez-Liebana
OffRL
27
2
0
17 Apr 2021
Rapid Exploration for Open-World Navigation with Latent Goal Models
Rapid Exploration for Open-World Navigation with Latent Goal Models
Dhruv Shah
Benjamin Eysenbach
G. Kahn
Nicholas Rhinehart
Sergey Levine
29
70
0
12 Apr 2021
BR-NS: an Archive-less Approach to Novelty Search
BR-NS: an Archive-less Approach to Novelty Search
Achkan Salehi
Alexandre Coninx
Stéphane Doncieux
28
6
0
08 Apr 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
Clément Romac
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
27
21
0
17 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with
  Curiosity Contrastive Forward Dynamics Model
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
23
17
0
15 Mar 2021
An Information-Theoretic Perspective on Credit Assignment in
  Reinforcement Learning
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
24
17
0
10 Mar 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Victor Campos
Pablo Sprechmann
Steven Hansen
André Barreto
Steven Kapturowski
Alex Vitvitskyi
Adria Puigdomenech Badia
Charles Blundell
OffRL
OnRL
38
25
0
24 Feb 2021
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've
  Learned
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
16
516
0
04 Feb 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient
  Reinforcement Learning
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
29
13
0
23 Jan 2021
Asymmetric self-play for automatic goal discovery in robotic
  manipulation
Asymmetric self-play for automatic goal discovery in robotic manipulation
OpenAI OpenAI
Matthias Plappert
Raul Sampedro
Tao Xu
Ilge Akkaya
...
Hyeonwoo Noh
Lilian Weng
Qiming Yuan
Casey Chu
Wojciech Zaremba
SSL
82
76
0
13 Jan 2021
Bridging In- and Out-of-distribution Samples for Their Better
  Discriminability
Bridging In- and Out-of-distribution Samples for Their Better Discriminability
Engkarat Techapanurak
Anh-Chuong Dang
Takayuki Okatani
OODD
25
3
0
07 Jan 2021
Geometric Entropic Exploration
Geometric Entropic Exploration
Z. Guo
M. G. Azar
Alaa Saade
S. Thakoor
Bilal Piot
Bernardo Avila-Pires
Michal Valko
Thomas Mesnard
Tor Lattimore
Rémi Munos
38
30
0
06 Jan 2021
BeBold: Exploration Beyond the Boundary of Explored Regions
BeBold: Exploration Beyond the Boundary of Explored Regions
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
36
40
0
15 Dec 2020
Meta Automatic Curriculum Learning
Meta Automatic Curriculum Learning
Rémy Portelas
Clément Romac
Katja Hofmann
Pierre-Yves Oudeyer
35
8
0
16 Nov 2020
Reinforcement Learning with Videos: Combining Offline Observations with
  Interaction
Reinforcement Learning with Videos: Combining Offline Observations with Interaction
Karl Schmeckpeper
Oleh Rybkin
Kostas Daniilidis
Sergey Levine
Chelsea Finn
OffRL
16
105
0
12 Nov 2020
Continual Learning of Control Primitives: Skill Discovery via
  Reset-Games
Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Kelvin Xu
Siddharth Verma
Chelsea Finn
Sergey Levine
CLL
33
33
0
10 Nov 2020
TAMPC: A Controller for Escaping Traps in Novel Environments
TAMPC: A Controller for Escaping Traps in Novel Environments
Sheng Zhong
Zhenyuan Zhang
Nima Fazeli
Dmitry Berenson
28
7
0
23 Oct 2020
Optimising Stochastic Routing for Taxi Fleets with Model Enhanced
  Reinforcement Learning
Optimising Stochastic Routing for Taxi Fleets with Model Enhanced Reinforcement Learning
Shen Ren
Qianxiao Li
Liye Zhang
Zheng Qin
Bo Yang
26
0
0
22 Oct 2020
Online Safety Assurance for Deep Reinforcement Learning
Online Safety Assurance for Deep Reinforcement Learning
Noga H. Rotman
Michael Schapira
Aviv Tamar
OffRL
36
5
0
07 Oct 2020
Latent World Models For Intrinsically Motivated Exploration
Latent World Models For Intrinsically Motivated Exploration
Aleksandr Ermolov
N. Sebe
25
25
0
05 Oct 2020
Novelty Search in Representational Space for Sample Efficient
  Exploration
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao
Vincent François-Lavet
Joelle Pineau
30
43
0
28 Sep 2020
Robust and Generalizable Visual Representation Learning via Random
  Convolutions
Robust and Generalizable Visual Representation Learning via Random Convolutions
Zhenlin Xu
Deyi Liu
Junlin Yang
Colin Raffel
Marc Niethammer
OOD
AAML
53
190
0
25 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
25
4
0
24 Jul 2020
Learning Abstract Models for Strategic Exploration and Fast Reward
  Transfer
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
E. Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Percy Liang
OffRL
27
5
0
12 Jul 2020
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State
  Entropy Estimate
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
Mirco Mutti
Lorenzo Pratissoli
Marcello Restelli
11
19
0
09 Jul 2020
Information Theoretic Regret Bounds for Online Nonlinear Control
Information Theoretic Regret Bounds for Online Nonlinear Control
Sham Kakade
A. Krishnamurthy
Kendall Lowrey
Motoya Ohnishi
Wen Sun
31
117
0
22 Jun 2020
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Andres Campero
Roberta Raileanu
Heinrich Küttler
J. Tenenbaum
Tim Rocktaschel
Edward Grefenstette
38
125
0
22 Jun 2020
Model-based Adversarial Meta-Reinforcement Learning
Model-based Adversarial Meta-Reinforcement Learning
Zichuan Lin
G. Thomas
Guangwen Yang
Tengyu Ma
OOD
27
52
0
16 Jun 2020
Previous
123456
Next