Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.06038
Cited By
Never Give Up: Learning Directed Exploration Strategies
14 February 2020
Adria Puigdomenech Badia
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Bilal Piot
Steven Kapturowski
O. Tieleman
Martín Arjovsky
Alexander Pritzel
Andew Bolt
Charles Blundell
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Never Give Up: Learning Directed Exploration Strategies"
50 / 76 papers shown
Title
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
103
2
0
24 Mar 2025
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems
Se-Wook Yoo
Seung-Woo Seo
58
0
0
30 Jan 2025
Adaptive Data Exploitation in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
261
0
0
22 Jan 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
73
0
0
03 Jan 2025
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
38
0
0
06 Nov 2024
Diversity-Aware Reinforcement Learning for de novo Drug Design
Hampus Gummesson Svensson
C. Tyrchan
Ola Engkvist
M. Chehreghani
34
2
0
14 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
43
1
0
07 Oct 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
46
6
0
06 Aug 2024
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization
Liam Schramm
Abdeslam Boularias
28
1
0
07 Jul 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
51
1
0
15 Jun 2024
Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec
Yann Besse
Florent Teichteil-Königsbuch
Dennis G. Wilson
Emmanuel Rachelson
40
0
0
14 Jun 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
40
0
0
29 May 2024
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
31
0
0
19 Apr 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
55
11
0
22 Feb 2024
Joint Intrinsic Motivation for Coordinated Exploration in Multi-Agent Deep Reinforcement Learning
Maxime Toquebiau
Nicolas Bredeche
F. Benamar
Jae-Yun Jun
36
1
0
06 Feb 2024
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
36
8
0
15 Dec 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
33
34
0
13 Oct 2023
Contrastive Initial State Buffer for Reinforcement Learning
Nico Messikommer
Yunlong Song
Davide Scaramuzza
OffRL
46
9
0
18 Sep 2023
Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach
KB Vimal
Saketh Bachu
Tanmay Garg
Niveditha Lakshmi Narasimhan
Raghavan Konuru
Vineeth N. Balasubramanian
42
2
0
05 Sep 2023
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Yash Chandak
S. Thakoor
Z. Guo
Yunhao Tang
Rémi Munos
Will Dabney
Diana Borsa
24
2
0
01 May 2023
Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward
Zihan Zhou
Animesh Garg
OffRL
27
3
0
30 Apr 2023
Self-supervised network distillation: an effective approach to exploration in sparse reward environments
Matej Pecháč
M. Chovanec
Igor Farkaš
32
3
0
22 Feb 2023
Improving robot navigation in crowded environments using intrinsic rewards
Diego Martínez Baselga
L. Riazuelo
Luis Montano
45
13
0
13 Feb 2023
Sample Efficient Deep Reinforcement Learning via Local Planning
Dong Yin
S. Thiagarajan
N. Lazić
Nived Rajaraman
Botao Hao
Csaba Szepesvári
25
4
0
29 Jan 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
32
8
0
26 Jan 2023
Self-Motivated Multi-Agent Exploration
Shaowei Zhang
Jiahan Cao
Lei Yuan
Yang Yu
De-Chuan Zhan
47
5
0
05 Jan 2023
Tackling Visual Control via Multi-View Exploration Maximization
Mingqi Yuan
Xin Jin
Bo Li
Wenjun Zeng
33
1
0
28 Nov 2022
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control
Xiang Zheng
Xingjun Ma
Cong Wang
31
1
0
28 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
Foundation Models for Semantic Novelty in Reinforcement Learning
Tarun Gupta
Peter Karkus
Tong Che
Danfei Xu
Marco Pavone
VLM
OffRL
LRM
45
7
0
09 Nov 2022
Disentangled (Un)Controllable Features
Jacob E. Kooi
Mark Hoogendoorn
Vincent François-Lavet
DRL
27
0
0
31 Oct 2022
Using Contrastive Samples for Identifying and Leveraging Possible Causal Relationships in Reinforcement Learning
H. Khadilkar
Hardik Meisheri
OffRL
34
1
0
28 Oct 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
39
35
0
19 Sep 2022
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
36
12
0
19 Sep 2022
Task-Agnostic Learning to Accomplish New Tasks
Xianqi Zhang
Xingtao Wang
Xu Liu
Wenrui Wang
Xiaopeng Fan
Debin Zhao
OffRL
91
0
0
09 Sep 2022
Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View
Han-Dong Lim
Dong-hwan Lee
32
1
0
25 Jul 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
29
11
0
12 Jul 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
33
22
0
24 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
22
68
0
16 Jun 2022
Nuclear Norm Maximization Based Curiosity-Driven Learning
Chao Chen
Zijian Gao
Kele Xu
Sen Yang
Yiying Li
Bo Ding
Dawei Feng
Huaimin Wang
187
5
0
21 May 2022
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
D. Tiapkin
Denis Belomestny
Eric Moulines
A. Naumov
S. Samsonov
Yunhao Tang
Michal Valko
Pierre Menard
31
17
0
16 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
28
324
0
02 May 2022
Automatically Learning Fallback Strategies with Model-Free Reinforcement Learning in Safety-Critical Driving Scenarios
Ugo Lecerf
Christelle Yemdji Tchassi
S. Aubert
Pietro Michiardi
26
0
0
11 Apr 2022
Semantic Exploration from Language Abstractions and Pretrained Representations
Allison C. Tam
Neil C. Rabinowitz
Andrew Kyle Lampinen
Nicholas A. Roy
Stephanie C. Y. Chan
D. Strouse
Jane X. Wang
Andrea Banino
Felix Hill
LM&Ro
41
67
0
08 Apr 2022
Rényi State Entropy for Exploration Acceleration in Reinforcement Learning
Mingqi Yuan
Man-On Pun
Dong Wang
27
23
0
08 Mar 2022
Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Durgesh Kalwar
Omkar Shelke
Somjit Nath
Hardik Meisheri
H. Khadilkar
30
1
0
02 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
22
9
0
24 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
36
29
0
10 Feb 2022
Chaining Value Functions for Off-Policy Learning
Simon Schmitt
John Shawe-Taylor
Hado van Hasselt
OffRL
25
2
0
17 Jan 2022
Physical Derivatives: Computing policy gradients by physical forward-propagation
Arash Mehrjou
Ashkan Soleymani
Stefan Bauer
Bernhard Schölkopf
38
0
0
15 Jan 2022
1
2
Next