ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.10295
  4. Cited By
Noisy Networks for Exploration

Noisy Networks for Exploration

30 June 2017
Meire Fortunato
M. G. Azar
Bilal Piot
Jacob Menick
Ian Osband
Alex Graves
Vlad Mnih
Rémi Munos
Demis Hassabis
Olivier Pietquin
Charles Blundell
Shane Legg
ArXivPDFHTML

Papers citing "Noisy Networks for Exploration"

50 / 165 papers shown
Title
Does Self-supervised Learning Really Improve Reinforcement Learning from
  Pixels?
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li
Jinghuan Shang
Srijan Das
Michael S. Ryoo
SSL
40
31
0
10 Jun 2022
Hub-Pathway: Transfer Learning from A Hub of Pre-trained Models
Hub-Pathway: Transfer Learning from A Hub of Pre-trained Models
Yang Shu
Zhangjie Cao
Ziyang Zhang
Jianmin Wang
Mingsheng Long
22
4
0
08 Jun 2022
Distributed Multi-Agent Deep Reinforcement Learning for Robust
  Coordination against Noise
Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise
Yoshinari Motokawa
T. Sugawara
30
2
0
19 May 2022
CCLF: A Contrastive-Curiosity-Driven Learning Framework for
  Sample-Efficient Reinforcement Learning
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning
Chenyu Sun
Hangwei Qian
Chunyan Miao
OffRL
34
12
0
02 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
31
324
0
02 May 2022
COPA: Certifying Robust Policies for Offline Reinforcement Learning
  against Poisoning Attacks
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks
Fan Wu
Linyi Li
Chejian Xu
Huan Zhang
B. Kailkhura
K. Kenthapadi
Ding Zhao
Bo Li
AAML
OffRL
34
34
0
16 Mar 2022
Follow your Nose: Using General Value Functions for Directed Exploration
  in Reinforcement Learning
Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Durgesh Kalwar
Omkar Shelke
Somjit Nath
Hardik Meisheri
H. Khadilkar
30
1
0
02 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
46
11
0
01 Mar 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for
  Mapless Navigation in Intralogistics
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
16
15
0
23 Feb 2022
Sequential Bayesian experimental designs via reinforcement learning
Sequential Bayesian experimental designs via reinforcement learning
Hikaru Asano
OffRL
18
0
0
14 Feb 2022
Interpretable pipelines with evolutionarily optimized modules for RL
  tasks with visual inputs
Interpretable pipelines with evolutionarily optimized modules for RL tasks with visual inputs
Leonardo Lucio Custode
Giovanni Iacca
27
13
0
10 Feb 2022
Mask-based Latent Reconstruction for Reinforcement Learning
Mask-based Latent Reconstruction for Reinforcement Learning
Tao Yu
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
26
44
0
28 Jan 2022
Generative Planning for Temporally Coordinated Exploration in
  Reinforcement Learning
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning
Haichao Zhang
Wei Xu
Haonan Yu
38
10
0
24 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
38
100
0
11 Jan 2022
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
Vincent Mai
Kaustubh Mani
Liam Paull
40
34
0
05 Jan 2022
Reinforcement Learning-based Switching Controller for a Milliscale Robot
  in a Constrained Environment
Reinforcement Learning-based Switching Controller for a Milliscale Robot in a Constrained Environment
Abbas Tariverdi
Ulysse Côté-Allard
Kim Mathiassen
O. Elle
H. Kalvøy
Ø. Martinsen
J. Tørresen
16
4
0
27 Nov 2021
Learning to Be Cautious
Learning to Be Cautious
Montaser Mohammedalamen
Dustin Morrill
Alexander Sieusahai
Yash Satsangi
Michael Bowling
18
3
0
29 Oct 2021
Bayesian Sequential Optimal Experimental Design for Nonlinear Models
  Using Policy Gradient Reinforcement Learning
Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning
Wanggang Shen
Xun Huan
13
40
0
28 Oct 2021
Offline Reinforcement Learning for Autonomous Driving with Safety and
  Exploration Enhancement
Offline Reinforcement Learning for Autonomous Driving with Safety and Exploration Enhancement
Tianyu Shi
Dong Chen
Kaian Chen
Zhaojian Li
OffRL
36
31
0
13 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery
  phantom
Deep reinforcement learning for guidewire navigation in coronary artery phantom
Jihoon Kweon
Kyunghwan Kim
Chaehyuk Lee
Hwi Kwon
Jinwoo Park
...
Inwook Back
J. Roh
Y. Moon
Jaesoon Choi
Young-Hak Kim
OnRL
24
33
0
05 Oct 2021
On Bonus-Based Exploration Methods in the Arcade Learning Environment
On Bonus-Based Exploration Methods in the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
24
58
0
22 Sep 2021
Carl-Lead: Lidar-based End-to-End Autonomous Driving with Contrastive
  Deep Reinforcement Learning
Carl-Lead: Lidar-based End-to-End Autonomous Driving with Contrastive Deep Reinforcement Learning
Peide Cai
Sukai Wang
Hengli Wang
Ming Liu
AI4TS
26
15
0
17 Sep 2021
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for
  Efficient Deep-Reinforcement Learning
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for Efficient Deep-Reinforcement Learning
Adarsh Kosta
Malik Aqeel Anwar
Priyadarshini Panda
A. Raychowdhury
Kaushik Roy
13
4
0
16 Sep 2021
Evolutionary Self-Replication as a Mechanism for Producing Artificial
  Intelligence
Evolutionary Self-Replication as a Mechanism for Producing Artificial Intelligence
Samuel Schmidgall
Joe Hays
43
1
0
16 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
41
93
0
14 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
37
80
0
01 Sep 2021
DQ-GAT: Towards Safe and Efficient Autonomous Driving with Deep
  Q-Learning and Graph Attention Networks
DQ-GAT: Towards Safe and Efficient Autonomous Driving with Deep Q-Learning and Graph Attention Networks
Peide Cai
Hengli Wang
Yuxiang Sun
Ming Liu
GNN
35
39
0
11 Aug 2021
A Survey on Deep Reinforcement Learning for Data Processing and
  Analytics
A Survey on Deep Reinforcement Learning for Data Processing and Analytics
Qingpeng Cai
Can Cui
Yiyuan Xiong
Wei Wang
Zhongle Xie
Meihui Zhang
OffRL
21
29
0
10 Aug 2021
High Performance Across Two Atari Paddle Games Using the Same Perceptual
  Control Architecture Without Training
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training
T. Gulrez
W. Mansell
24
0
0
04 Aug 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under
  Data Augmentation
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
44
135
0
01 Jul 2021
Zoo-Tuning: Adaptive Transfer from a Zoo of Models
Zoo-Tuning: Adaptive Transfer from a Zoo of Models
Yang Shu
Zhi Kou
Zhangjie Cao
Jianmin Wang
Mingsheng Long
29
44
0
29 Jun 2021
Randomized Exploration for Reinforcement Learning with General Value
  Function Approximation
Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Haque Ishfaq
Qiwen Cui
V. Nguyen
Alex Ayoub
Zhuoran Yang
Zhaoran Wang
Doina Precup
Lin F. Yang
37
43
0
15 Jun 2021
Bayesian Bellman Operators
Bayesian Bellman Operators
M. Fellows
Kristian Hartikainen
Shimon Whiteson
OffRL
42
15
0
09 Jun 2021
Denoising Noisy Neural Networks: A Bayesian Approach with Compensation
Denoising Noisy Neural Networks: A Bayesian Approach with Compensation
Yulin Shao
Soung Chang Liew
Deniz Gunduz
61
14
0
22 May 2021
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Yue Wu
Shuangfei Zhai
Nitish Srivastava
J. Susskind
Jian Zhang
Ruslan Salakhutdinov
Hanlin Goh
EDL
OffRL
OnRL
21
184
0
17 May 2021
Principled Exploration via Optimistic Bootstrapping and Backward
  Induction
Principled Exploration via Optimistic Bootstrapping and Backward Induction
Chenjia Bai
Lingxiao Wang
Lei Han
Jianye Hao
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
26
38
0
13 May 2021
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation
  Perspective
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
24
54
0
11 May 2021
A Deep Reinforcement Learning Approach for the Meal Delivery Problem
A Deep Reinforcement Learning Approach for the Meal Delivery Problem
H. Jahanshahi
Aysun Bozanta
Mucahit Cevik
E. M. Kavuk
Ayse Tosun Misirli
Sibel B. Sonuc
Bilgin Kosucu
Ayse Basar
48
28
0
24 Apr 2021
Training a Resilient Q-Network against Observational Interference
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
31
14
0
18 Feb 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient
  Reinforcement Learning
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
29
13
0
23 Jan 2021
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Y. Fu
Zhongzhi Yu
Yongan Zhang
Yingyan Lin
24
4
0
24 Dec 2020
Policy Manifold Search for Improving Diversity-based Neuroevolution
Policy Manifold Search for Improving Diversity-based Neuroevolution
Nemanja Rakićević
Antoine Cully
Petar Kormushev
29
0
0
15 Dec 2020
BeBold: Exploration Beyond the Boundary of Explored Regions
BeBold: Exploration Beyond the Boundary of Explored Regions
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
36
40
0
15 Dec 2020
Deep Reinforcement Learning for Resource Constrained Multiclass
  Scheduling in Wireless Networks
Deep Reinforcement Learning for Resource Constrained Multiclass Scheduling in Wireless Networks
Apostolos Avranas
Marios Kountouris
P. Ciblat
24
7
0
27 Nov 2020
Large-Scale Multi-Agent Deep FBSDEs
Large-Scale Multi-Agent Deep FBSDEs
T. Chen
Ziyi Wang
Ioannis Exarchos
Evangelos A. Theodorou
37
4
0
21 Nov 2020
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep
  Reinforcement Learning Research
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning Research
J. Obando-Ceron
Pablo Samuel Castro
OffRL
20
105
0
20 Nov 2020
Proximal Policy Optimization via Enhanced Exploration Efficiency
Proximal Policy Optimization via Enhanced Exploration Efficiency
Junwei Zhang
Zhenghao Zhang
Shuai Han
Shuai Lu
34
41
0
11 Nov 2020
Improved Worst-Case Regret Bounds for Randomized Least-Squares Value
  Iteration
Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration
Priyank Agrawal
Jinglin Chen
Nan Jiang
30
18
0
23 Oct 2020
Masked Contrastive Representation Learning for Reinforcement Learning
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua Zhu
Yingce Xia
Lijun Wu
Jiajun Deng
Wen-gang Zhou
Tao Qin
Houqiang Li
SSL
OffRL
34
55
0
15 Oct 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
53
823
0
05 Oct 2020
Previous
1234
Next