ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.04717
  4. Cited By
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

15 November 2016
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
    OffRL
ArXivPDFHTML

Papers citing "#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning"

50 / 152 papers shown
Title
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
31
5
0
05 Oct 2022
Query The Agent: Improving sample efficiency through epistemic
  uncertainty estimation
Query The Agent: Improving sample efficiency through epistemic uncertainty estimation
Julian Alverio
Boris Katz
Andrei Barbu
35
0
0
05 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing
  Plausible Novel States
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States
C. Banerjee
Zhiyong Chen
N. Noman
26
3
0
01 Oct 2022
An information-theoretic perspective on intrinsic motivation in
  reinforcement learning: a survey
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
37
35
0
19 Sep 2022
Rewarding Episodic Visitation Discrepancy for Exploration in
  Reinforcement Learning
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
36
12
0
19 Sep 2022
Go-Explore Complex 3D Game Environments for Automated Reachability
  Testing
Go-Explore Complex 3D Game Environments for Automated Reachability Testing
Cong Lu
Raluca Georgescu
J. Verwey
27
7
0
01 Sep 2022
Annealed Training for Combinatorial Optimization on Graphs
Annealed Training for Combinatorial Optimization on Graphs
Haoran Sun
E. Guha
H. Dai
30
18
0
23 Jul 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
29
11
0
12 Jul 2022
Towards Understanding How Machines Can Learn Causal Overhypotheses
Towards Understanding How Machines Can Learn Causal Overhypotheses
Eliza Kosoy
David M. Chan
Adrian Liu
Jasmine Collins
Bryanna Kaufmann
Sandy Han Huang
Jessica B. Hamrick
John F. Canny
Nan Rosemary Ke
Alison Gopnik
CML
AI4CE
28
18
0
16 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
22
68
0
16 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on
  Exploration and Performance
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
41
17
0
08 Jun 2022
Off-Beat Multi-Agent Reinforcement Learning
Off-Beat Multi-Agent Reinforcement Learning
Wei Qiu
Weixun Wang
R. Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
29
2
0
27 May 2022
Reward Uncertainty for Exploration in Preference-based Reinforcement
  Learning
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
Xinran Liang
Katherine Shu
Kimin Lee
Pieter Abbeel
21
58
0
24 May 2022
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically
  Simulated Characters
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters
Xue Bin Peng
Yunrong Guo
L. Halper
Sergey Levine
Sanja Fidler
28
15
0
04 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
26
324
0
02 May 2022
Habitat-Web: Learning Embodied Object-Search Strategies from Human
  Demonstrations at Scale
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Ram Ramrakhya
Eric Undersander
Dhruv Batra
Abhishek Das
LM&Ro
35
109
0
07 Apr 2022
Off-Policy Evaluation with Online Adaptation for Robot Exploration in
  Challenging Environments
Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments
Yafei Hu
Junyi Geng
Chen Wang
John Keller
Sebastian Scherer
OffRL
28
15
0
07 Apr 2022
Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects
Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects
Yujie Lu
Jianren Wang
Vikash Kumar
31
4
0
31 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
18
118
0
25 Mar 2022
Follow your Nose: Using General Value Functions for Directed Exploration
  in Reinforcement Learning
Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Durgesh Kalwar
Omkar Shelke
Somjit Nath
Hardik Meisheri
H. Khadilkar
24
1
0
02 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
22
9
0
24 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for
  Mapless Navigation in Intralogistics
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
16
15
0
23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Learning Causal Overhypotheses through Exploration in Children and
  Computational Models
Learning Causal Overhypotheses through Exploration in Children and Computational Models
Eliza Kosoy
Adrian Liu
Jasmine Collins
David M. Chan
Jessica B. Hamrick
Nan Rosemary Ke
Sandy H Huang
Bryanna Kaufmann
John F. Canny
Alison Gopnik
CML
22
9
0
21 Feb 2022
Generative Adversarial Exploration for Reinforcement Learning
Generative Adversarial Exploration for Reinforcement Learning
Weijun Hong
Menghui Zhu
Minghuan Liu
Weinan Zhang
Ming Zhou
Yong Yu
Peng Sun
OnRL
39
7
0
27 Jan 2022
Learning to Act with Affordance-Aware Multimodal Neural SLAM
Learning to Act with Affordance-Aware Multimodal Neural SLAM
Zhiwei Jia
Kaixiang Lin
Yizhou Zhao
Qiaozi Gao
Govind Thattai
Gaurav Sukhatme
LM&Ro
31
15
0
24 Jan 2022
On the Unreasonable Efficiency of State Space Clustering in
  Personalization Tasks
On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Anton Dereventsov
R. Vatsavai
Clayton Webster
33
5
0
24 Dec 2021
Interesting Object, Curious Agent: Learning Task-Agnostic Exploration
Interesting Object, Curious Agent: Learning Task-Agnostic Exploration
Simone Parisi
Victoria Dean
Deepak Pathak
Abhinav Gupta
LM&Ro
40
50
0
25 Nov 2021
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven
  Exploration
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Lu Zheng
Jiarui Chen
Jianhao Wang
Jiamin He
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
Chongjie Zhang
16
82
0
22 Nov 2021
Agent Spaces
Agent Spaces
John C. Raisbeck
M. W. Allen
Hakho Lee
30
1
0
11 Nov 2021
Anti-Concentrated Confidence Bonuses for Scalable Exploration
Anti-Concentrated Confidence Bonuses for Scalable Exploration
Jordan T. Ash
Cyril Zhang
Surbhi Goel
A. Krishnamurthy
Sham Kakade
43
6
0
21 Oct 2021
Hierarchical Skills for Efficient Exploration
Hierarchical Skills for Efficient Exploration
Jonas Gehring
Gabriel Synnaeve
Andreas Krause
Nicolas Usunier
28
40
0
20 Oct 2021
Continuous Control with Action Quantization from Demonstrations
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
M. Geist
Olivier Pietquin
OffRL
33
23
0
19 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
62
54
0
28 Sep 2021
On Bonus-Based Exploration Methods in the Arcade Learning Environment
On Bonus-Based Exploration Methods in the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
18
58
0
22 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
93
0
14 Sep 2021
ADER:Adapting between Exploration and Robustness for Actor-Critic
  Methods
ADER:Adapting between Exploration and Robustness for Actor-Critic Methods
Bo Zhou
Kejiao Li
Hongsheng Zeng
Fan Wang
Hao Tian
OffRL
38
1
0
08 Sep 2021
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Iou-Jen Liu
Unnat Jain
Raymond A. Yeh
A. Schwing
42
104
0
23 Jul 2021
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement
  Learning Agents
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents
Grgur Kovač
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
ALM
27
6
0
02 Jul 2021
Offline Reinforcement Learning as Anti-Exploration
Offline Reinforcement Learning as Anti-Exploration
Shideh Rezaeifar
Robert Dadashi
Nino Vieillard
Léonard Hussenot
Olivier Bachem
Olivier Pietquin
M. Geist
OffRL
51
51
0
11 Jun 2021
Sample-efficient Reinforcement Learning Representation Learning with
  Curiosity Contrastive Forward Dynamics Model
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
23
17
0
15 Mar 2021
Reinforcement Learning, Bit by Bit
Reinforcement Learning, Bit by Bit
Xiuyuan Lu
Benjamin Van Roy
Vikranth Dwaracherla
M. Ibrahimi
Ian Osband
Zheng Wen
30
70
0
06 Mar 2021
DEUP: Direct Epistemic Uncertainty Prediction
DEUP: Direct Epistemic Uncertainty Prediction
Salem Lahlou
Moksh Jain
Hadi Nekoei
V. Butoi
Paul Bertin
Jarrid Rector-Brooks
Maksym Korablyov
Yoshua Bengio
PER
UQLM
UQCV
UD
210
81
0
16 Feb 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient
  Reinforcement Learning
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
29
13
0
23 Jan 2021
MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning
  for Decentralized Traffic Signal Control
MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning for Decentralized Traffic Signal Control
Liwen Zhu
Peixi Peng
Zongqing Lu
Xiangqian Wang
Yonghong Tian
13
20
0
04 Jan 2021
Policy Manifold Search for Improving Diversity-based Neuroevolution
Policy Manifold Search for Improving Diversity-based Neuroevolution
Nemanja Rakićević
Antoine Cully
Petar Kormushev
27
0
0
15 Dec 2020
Model-based Reinforcement Learning for Continuous Control with Posterior
  Sampling
Model-based Reinforcement Learning for Continuous Control with Posterior Sampling
Ying Fan
Yifei Ming
27
17
0
20 Nov 2020
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep
  Reinforcement Learning Research
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning Research
J. Obando-Ceron
Pablo Samuel Castro
OffRL
20
105
0
20 Nov 2020
Continual Learning of Control Primitives: Skill Discovery via
  Reset-Games
Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Kelvin Xu
Siddharth Verma
Chelsea Finn
Sergey Levine
CLL
33
33
0
10 Nov 2020
TAMPC: A Controller for Escaping Traps in Novel Environments
TAMPC: A Controller for Escaping Traps in Novel Environments
Sheng Zhong
Zhenyuan Zhang
Nima Fazeli
Dmitry Berenson
28
7
0
23 Oct 2020
Previous
1234
Next