ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.12894
  4. Cited By
Exploration by Random Network Distillation

Exploration by Random Network Distillation

30 October 2018
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
ArXivPDFHTML

Papers citing "Exploration by Random Network Distillation"

50 / 290 papers shown
Title
Intrinsic Rewards from Self-Organizing Feature Maps for Exploration in
  Reinforcement Learning
Intrinsic Rewards from Self-Organizing Feature Maps for Exploration in Reinforcement Learning
Marius Lindegaard
Hjalmar Jacob Vinje
Odin Severinsen
30
2
0
06 Feb 2023
DITTO: Offline Imitation Learning with World Models
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
21
18
0
06 Feb 2023
Reinforcing User Retention in a Billion Scale Short Video Recommender
  System
Reinforcing User Retention in a Billion Scale Short Video Recommender System
Qingpeng Cai
Shuchang Liu
Xueliang Wang
Tianyou Zuo
Wentao Xie
Bin Yang
Dong Zheng
Peng Jiang
Kun Gai
OffRL
30
41
0
03 Feb 2023
Diversity Through Exclusion (DTE): Niche Identification for
  Reinforcement Learning through Value-Decomposition
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
26
2
0
02 Feb 2023
A general Markov decision process formalism for action-state
  entropy-regularized reward maximization
A general Markov decision process formalism for action-state entropy-regularized reward maximization
D. Grytskyy
Jorge Ramírez-Ruiz
R. Moreno-Bote
22
3
0
02 Feb 2023
Anti-Exploration by Random Network Distillation
Anti-Exploration by Random Network Distillation
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
38
24
0
31 Jan 2023
Sample Efficient Deep Reinforcement Learning via Local Planning
Sample Efficient Deep Reinforcement Learning via Local Planning
Dong Yin
S. Thiagarajan
N. Lazić
Nived Rajaraman
Botao Hao
Csaba Szepesvári
25
4
0
29 Jan 2023
Neural Episodic Control with State Abstraction
Neural Episodic Control with State Abstraction
Zhuo Li
Derui Zhu
Yujing Hu
Xiaofei Xie
L. Ma
Yan Zheng
Yan Song
Yingfeng Chen
Jianjun Zhao
OffRL
23
14
0
27 Jan 2023
Deep Laplacian-based Options for Temporally-Extended Exploration
Deep Laplacian-based Options for Temporally-Extended Exploration
Martin Klissarov
Marlos C. Machado
OffRL
16
19
0
26 Jan 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement
  Learning
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
29
8
0
26 Jan 2023
A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning
  for Voice-Controlled Robots
A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning for Voice-Controlled Robots
Peixin Chang
Shuijing Liu
Tianchen Ji
Neeloy Chakraborty
Kaiwen Hong
Katherine Driggs-Campbell
51
3
0
23 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
23
0
0
19 Jan 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
122
0
19 Jan 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward
  Shaping
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
44
18
0
05 Jan 2023
Self-Motivated Multi-Agent Exploration
Self-Motivated Multi-Agent Exploration
Shaowei Zhang
Jiahan Cao
Lei Yuan
Yang Yu
De-Chuan Zhan
47
5
0
05 Jan 2023
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Qiyang Li
Yuexiang Zhai
Yi Ma
Sergey Levine
37
14
0
24 Dec 2022
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep
  Guidance
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance
Kelvin Xu
Zheyuan Hu
Ria Doshi
Aaron Rovinsky
Vikash Kumar
Abhishek Gupta
Sergey Levine
32
19
0
19 Dec 2022
MoDem: Accelerating Visual Model-Based Reinforcement Learning with
  Demonstrations
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen
Yixin Lin
H. Su
Xiaolong Wang
Vikash Kumar
Aravind Rajeswaran
OffRL
32
49
0
12 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player
  Multi-Agent Learning Toolbox
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
28
13
0
01 Dec 2022
Five Properties of Specific Curiosity You Didn't Know Curious Machines
  Should Have
Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have
Nadia M. Ady
R. Shariff
J. Günther
P. Pilarski
16
0
0
01 Dec 2022
Tackling Visual Control via Multi-View Exploration Maximization
Tackling Visual Control via Multi-View Exploration Maximization
Mingqi Yuan
Xin Jin
Bo Li
Wenjun Zeng
30
1
0
28 Nov 2022
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control
Xiang Zheng
Xingjun Ma
Cong Wang
28
1
0
28 Nov 2022
Choreographer: Learning and Adapting Skills in Imagination
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
29
22
0
23 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
Offline Reinforcement Learning with Adaptive Behavior Regularization
Offline Reinforcement Learning with Adaptive Behavior Regularization
Yunfan Zhou
Xijun Li
Qingyu Qu
OffRL
27
1
0
15 Nov 2022
Redeeming Intrinsic Rewards via Constrained Optimization
Redeeming Intrinsic Rewards via Constrained Optimization
Eric Chen
Zhang-Wei Hong
Joni Pajarinen
Pulkit Agrawal
OnRL
36
24
0
14 Nov 2022
Foundation Models for Semantic Novelty in Reinforcement Learning
Foundation Models for Semantic Novelty in Reinforcement Learning
Tarun Gupta
Peter Karkus
Tong Che
Danfei Xu
Marco Pavone
VLM
OffRL
LRM
45
7
0
09 Nov 2022
Leveraging Sequentiality in Reinforcement Learning from a Single
  Demonstration
Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration
Alexandre Chenu
Olivier Serris
Olivier Sigaud
Nicolas Perrin-Gilbert
26
4
0
09 Nov 2022
Reward Shaping Using Convolutional Neural Network
Reward Shaping Using Convolutional Neural Network
Hani Sami
Hadi Otrok
Jamal Bentahar
Azzam Mourad
Ernesto Damiani
29
3
0
30 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
57
9
0
23 Oct 2022
A Mixture of Surprises for Unsupervised Reinforcement Learning
A Mixture of Surprises for Unsupervised Reinforcement Learning
Andrew Zhao
Matthieu Lin
Yangguang Li
Yong Liu
Gao Huang
28
13
0
13 Oct 2022
Exploration via Elliptical Episodic Bonuses
Exploration via Elliptical Episodic Bonuses
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
35
40
0
11 Oct 2022
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
Zixian Ma
Rose E. Wang
Li Fei-Fei
Michael S. Bernstein
Ranjay Krishna
26
16
0
09 Oct 2022
Learning Social Navigation from Demonstrations with Conditional Neural
  Processes
Learning Social Navigation from Demonstrations with Conditional Neural Processes
Yigit Yildirim
Emre Ugur
24
8
0
07 Oct 2022
Exploration via Planning for Information about the Optimal Trajectory
Exploration via Planning for Information about the Optimal Trajectory
Viraj Mehta
I. Char
J. Abbate
R. Conlin
M. Boyer
Stefano Ermon
J. Schneider
W. Neiswanger
OffRL
27
6
0
06 Oct 2022
Query The Agent: Improving sample efficiency through epistemic
  uncertainty estimation
Query The Agent: Improving sample efficiency through epistemic uncertainty estimation
Julian Alverio
Boris Katz
Andrei Barbu
35
0
0
05 Oct 2022
An information-theoretic perspective on intrinsic motivation in
  reinforcement learning: a survey
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
37
35
0
19 Sep 2022
Rewarding Episodic Visitation Discrepancy for Exploration in
  Reinforcement Learning
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
36
12
0
19 Sep 2022
Task-Agnostic Learning to Accomplish New Tasks
Task-Agnostic Learning to Accomplish New Tasks
Xianqi Zhang
Xingtao Wang
Xu Liu
Wenrui Wang
Xiaopeng Fan
Debin Zhao
OffRL
88
0
0
09 Sep 2022
Learning to Deceive in Multi-Agent Hidden Role Games
Learning to Deceive in Multi-Agent Hidden Role Games
Matthew Aitchison
L. Benke
Penny Sweetser
OffRL
30
5
0
04 Sep 2022
Go-Explore Complex 3D Game Environments for Automated Reachability
  Testing
Go-Explore Complex 3D Game Environments for Automated Reachability Testing
Cong Lu
Raluca Georgescu
J. Verwey
27
7
0
01 Sep 2022
Cell-Free Latent Go-Explore
Cell-Free Latent Go-Explore
Quentin Gallouedec
Emmanuel Dellandrea
14
1
0
31 Aug 2022
Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Zijian Gao
Yiying Li
Kele Xu
Yuanzhao Zhai
Dawei Feng
Bo Ding
Xinjun Mao
Huaimin Wang
38
0
0
24 Aug 2022
Entropy Augmented Reinforcement Learning
Entropy Augmented Reinforcement Learning
Jianfei Ma
36
0
0
19 Aug 2022
Impact Makes a Sound and Sound Makes an Impact: Sound Guides
  Representations and Explorations
Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and Explorations
Xufeng Zhao
C. Weber
Muhammad Burhan Hafez
S. Wermter
25
8
0
04 Aug 2022
Uncertainty-aware Multi-modal Learning via Cross-modal Random Network
  Prediction
Uncertainty-aware Multi-modal Learning via Cross-modal Random Network Prediction
Hu Wang
Jianpeng Zhang
Yuanhong Chen
Congbo Ma
Jodie Avery
Louise Hull
G. Carneiro
UQCV
19
18
0
22 Jul 2022
Bootstrap State Representation using Style Transfer for Better
  Generalization in Deep Reinforcement Learning
Bootstrap State Representation using Style Transfer for Better Generalization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
34
4
0
15 Jul 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
24
11
0
12 Jul 2022
Towards Semantic Communication Protocols: A Probabilistic Logic
  Perspective
Towards Semantic Communication Protocols: A Probabilistic Logic Perspective
Sejin Seo
Jihong Park
Seung-Woo Ko
Jinho Choi
M. Bennis
Seong-Lyun Kim
30
22
0
08 Jul 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned
  Reinforcement Learning
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
31
22
0
24 Jun 2022
Previous
123456
Next