Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Single-Agent Policy Tree Search With Guarantees
Laurent Orseau
Levi H. S. Lelis
Tor Lattimore
T. Weber
71
37
0
27 Nov 2018
Quality-Aware Multimodal Saliency Detection via Deep Reinforcement Learning
Tianlin Li
Tao Sun
Rui Yang
Chenglong Li
Bin Luo
Jin Tang
30
2
0
27 Nov 2018
Genetic-Gated Networks for Deep Reinforcement
Simyung Chang
John Yang
Jaeseok Choi
Nojun Kwak
AI4CE
51
17
0
26 Nov 2018
PNS: Population-Guided Novelty Search for Reinforcement Learning in Hard Exploration Environments
Qihao Liu
Yujia Wang
Xiao-Fei Liu
84
8
0
26 Nov 2018
InstaNAS: Instance-aware Neural Architecture Search
A. Cheng
Chieh Hubert Lin
Da-Cheng Juan
Wei Wei
Min Sun
73
48
0
26 Nov 2018
A Model-Based Reinforcement Learning Approach for a Rare Disease Diagnostic Task
R. Besson
E. L. Pennec
S. Allassonnière
J. Stirnemann
E. Spaggiari
A. Neuraz
OffRL
29
1
0
25 Nov 2018
Object-oriented Targets for Visual Navigation using Rich Semantic Representations
Jean-Benoit Delbrouck
Stéphane Dupont
82
3
0
22 Nov 2018
Urban Driving with Multi-Objective Deep Reinforcement Learning
Changjian Li
Krzysztof Czarnecki
80
73
0
21 Nov 2018
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
124
421
0
19 Nov 2018
Policy Optimization with Model-based Explorations
Feiyang Pan
Qingpeng Cai
Anxiang Zeng
C. Pan
Qing Da
Hua-Lin He
Qing He
Pingzhong Tang
78
11
0
18 Nov 2018
Improving Automatic Source Code Summarization via Deep Reinforcement Learning
Yao Wan
Zhou Zhao
Min Yang
Guandong Xu
Haochao Ying
Jian Wu
Philip S. Yu
86
392
0
17 Nov 2018
Parameter Sharing Reinforcement Learning Architecture for Multi Agent Driving Behaviors
Meha Kaushik
S. Phaniteja
K. M. Krishna
AI4CE
41
11
0
17 Nov 2018
Reward learning from human preferences and demonstrations in Atari
Borja Ibarz
Jan Leike
Tobias Pohlen
G. Irving
Shane Legg
Dario Amodei
131
398
0
15 Nov 2018
Orthogonal Policy Gradient and Autonomous Driving Application
Mincong Luo
Yin Tong
Jiachi Liu
38
2
0
15 Nov 2018
Natural Environment Benchmarks for Reinforcement Learning
Amy Zhang
Yuxin Wu
Joelle Pineau
OffRL
OOD
69
69
0
14 Nov 2018
Blindfold Baselines for Embodied QA
Ankesh Anand
Eugene Belilovsky
Kyle Kastner
Hugo Larochelle
Aaron Courville
104
45
0
12 Nov 2018
Coordinating Disaster Emergency Response with Heuristic Reinforcement Learning
L. Nguyen
Zhou Yang
Jiazhen Zhu
Jia Ming Li
Fang Jin
44
23
0
12 Nov 2018
Learning Latent Dynamics for Planning from Pixels
Danijar Hafner
Timothy Lillicrap
Ian S. Fischer
Ruben Villegas
David R Ha
Honglak Lee
James Davidson
BDL
188
1,452
0
12 Nov 2018
Agent Embeddings: A Latent Representation for Pole-Balancing Networks
Oscar Chang
Robert Kwiatkowski
Siyuan Chen
Hod Lipson
147
6
0
12 Nov 2018
An initial attempt of combining visual selective attention with deep reinforcement learning
Liu Yuezhang
Ruohan Zhang
D. Ballard
80
20
0
11 Nov 2018
Towards Governing Agent's Efficacy: Action-Conditional
β
β
β
-VAE for Deep Transparent Reinforcement Learning
John Yang
Gyujeong Lee
Minsung Hyun
Simyung Chang
Nojun Kwak
65
3
0
11 Nov 2018
Fully Convolutional Network with Multi-Step Reinforcement Learning for Image Processing
Ryosuke Furuta
Naoto Inoue
T. Yamasaki
86
53
0
10 Nov 2018
Modular Architecture for StarCraft II with Deep Reinforcement Learning
Dennis Lee
Mizanur Rahman
Jeffrey O. Zhang
Huazhe Xu
Jerome McClendon
Pieter Abbeel
120
55
0
08 Nov 2018
Memory-based Deep Reinforcement Learning for Obstacle Avoidance in UAV with Limited Environment Knowledge
Hassan Dashtian
Sindhu Padakandla
M. Sahimi
79
210
0
08 Nov 2018
Correlation Filter Selection for Visual Tracking Using Reinforcement Learning
Yanchun Xie
Jimin Xiao
Hassan Jameel Asghar
Jeyarajan Thiyagalingam
Dali Kaafar
45
21
0
08 Nov 2018
Deep Reinforcement Learning for Green Security Games with Real-Time Information
Yufei Wang
Zheyuan Ryan Shi
Lantao Yu
Yi Wu
Rohit Singh
Lucas Joppa
Fei Fang
72
73
0
06 Nov 2018
QUOTA: The Quantile Option Architecture for Reinforcement Learning
Fengxiang Yang
Zhun Zhong
Shaozi Li
Sheng Lian
Shaozi Li
OffRL
106
30
0
05 Nov 2018
Managing engineering systems with large state and action spaces through deep reinforcement learning
Varun Chandrasekaran
K. Papakonstantinou
AI4CE
80
164
0
05 Nov 2018
Plan Online, Learn Offline: Efficient Learning and Exploration via Model-Based Control
Kendall Lowrey
Aravind Rajeswaran
Sham Kakade
G. Haro
Igor Mordatch
OffRL
79
229
0
05 Nov 2018
Combining Subgoal Graphs with Reinforcement Learning to Build a Rational Pathfinder
Junjie Zeng
C. Alberti
Yue Hu
Cong Hu
Quanjun Yin
28
4
0
05 Nov 2018
Contingency-Aware Exploration in Reinforcement Learning
Jongwook Choi
Yijie Guo
Marcin Moczulski
Junhyuk Oh
Neal Wu
Mohammad Norouzi
Honglak Lee
80
73
0
05 Nov 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
127
56
0
03 Nov 2018
Deep Counterfactual Regret Minimization
Noam Brown
Adam Lerer
Sam Gross
Tuomas Sandholm
189
215
0
01 Nov 2018
Differentiable MPC for End-to-end Planning and Control
Brandon Amos
I. D. Rodriguez
Jacob Sacks
Byron Boots
J. Zico Kolter
110
378
0
31 Oct 2018
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
163
1,347
0
30 Oct 2018
Relative Importance Sampling For Off-Policy Actor-Critic in Deep Reinforcement Learning
Mahammad Humayoo
Xueqi Cheng
BDL
OffRL
47
5
0
30 Oct 2018
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
Basel Alomair
OffRL
128
239
0
29 Oct 2018
Watch the Unobserved: A Simple Approach to Parallelizing Monte Carlo Tree Search
Hoang Trung-Dung
Jianshu Chen
Mingze Yu
Yu Zhai
Xuewen Zhou
Ji Liu
80
33
0
28 Oct 2018
Stability-certified reinforcement learning: A control-theoretic perspective
Ming Jin
Javad Lavaei
57
87
0
26 Oct 2018
Deep Intrinsically Motivated Continuous Actor-Critic for Efficient Robotic Visuomotor Skill Learning
Muhammad Burhan Hafez
C. Weber
Matthias Kerzel
S. Wermter
49
22
0
26 Oct 2018
Neural Modular Control for Embodied Question Answering
Abhishek Das
Georgia Gkioxari
Stefan Lee
Devi Parikh
Dhruv Batra
LM&Ro
204
130
0
26 Oct 2018
Differential Variable Speed Limits Control for Freeway Recurrent Bottlenecks via Deep Reinforcement learning
Yuankai Wu
Huachun Tan
B. Ran
AI4CE
43
17
0
25 Oct 2018
Reconciling
λ
λ
λ
-Returns with Experience Replay
Brett Daley
Chris Amato
64
4
0
23 Oct 2018
The Faults in Our Pi Stars: Security Issues and Open Challenges in Deep Reinforcement Learning
Vahid Behzadan
Arslan Munir
88
27
0
23 Oct 2018
A general learning system based on neuron bursting and tonic firing
H. Lui
25
0
0
22 Oct 2018
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
85
149
0
21 Oct 2018
Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning
Natasha Jaques
Angeliki Lazaridou
Edward Hughes
Çağlar Gülçehre
Pedro A. Ortega
D. Strouse
Joel Z Leibo
Nando de Freitas
114
57
0
19 Oct 2018
Fast deep reinforcement learning using online adjustments from the past
Steven Hansen
Pablo Sprechmann
Alexander Pritzel
André Barreto
Charles Blundell
TTA
OffRL
OnRL
82
43
0
18 Oct 2018
Finding the best design parameters for optical nanostructures using reinforcement learning
Iman Sajedian
Trevon Badloe
J. Rho
29
12
0
18 Oct 2018
Applications of Deep Reinforcement Learning in Communications and Networking: A Survey
Nguyen Cong Luong
D. Hoang
Shimin Gong
Dusit Niyato
Ping Wang
Ying-Chang Liang
Dong In Kim
OffRL
110
1,447
0
18 Oct 2018
Previous
1
2
3
...
59
60
61
...
70
71
72
Next