Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Multi-Agent Deep Reinforcement Learning for Liquidation Strategy Analysis
Wenhang Bao
Xiao-Yang Liu
AIFin
87
43
0
24 Jun 2019
Planning Robot Motion using Deep Visual Prediction
Meenakshi Sarkar
Prabhu Pradhan
Debasish Ghose
67
6
0
24 Jun 2019
Optimal Use of Experience in First Person Shooter Environments
Matthew Aitchison
37
2
0
24 Jun 2019
Ranking Policy Gradient
Kaixiang Lin
Jiayu Zhou
OffRL
67
7
0
24 Jun 2019
Learning Belief Representations for Imitation Learning in POMDPs
Tanmay Gangwani
Joel Lehman
Qiang Liu
Jian Peng
59
37
0
22 Jun 2019
Reinforcement Learning with Convex Constraints
Sobhan Miryoosefi
Kianté Brantley
Hal Daumé
Miroslav Dudík
Robert Schapire
61
93
0
21 Jun 2019
Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction
Fengda Zhu
Xiaojun Chang
Runhao Zeng
Mingkui Tan
CLL
52
3
0
21 Jun 2019
Variable Impedance Control in End-Effector Space: An Action Space for Reinforcement Learning in Contact-Rich Tasks
Roberto Martín-Martín
Michelle A. Lee
Rachel Gardner
Silvio Savarese
Jeannette Bohg
Animesh Garg
86
198
0
20 Jun 2019
Finding Needles in a Moving Haystack: Prioritizing Alerts with Adversarial Reinforcement Learning
Liang Tong
Aron Laszka
Chao Yan
Ning Zhang
Yevgeniy Vorobeychik
AAML
54
18
0
20 Jun 2019
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies
Kai Zhang
Alec Koppel
Haoqi Zhu
Tamer Basar
114
191
0
19 Jun 2019
Reward Prediction Error as an Exploration Objective in Deep RL
Riley Simmons-Edler
Ben Eisner
Daniel Yang
Anthony Bisulco
E. Mitchell
Sebastian Seung
Daniel D. Lee
67
5
0
19 Jun 2019
Directed Exploration for Reinforcement Learning
Z. Guo
Emma Brunskill
71
12
0
18 Jun 2019
Towards White-box Benchmarks for Algorithm Control
André Biedenkapp
H. Bozkurt
Frank Hutter
Marius Lindauer
179
7
0
18 Jun 2019
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning
Tadashi Kozuno
Dongqi Han
Kenji Doya
OffRL
50
2
0
18 Jun 2019
NeoNav: Improving the Generalization of Visual Navigation via Generating Next Expected Observations
Qiaoyun Wu
Tianyi Zhou
Jun Wang
Kai Xu
162
15
0
17 Jun 2019
Is the Policy Gradient a Gradient?
Chris Nota
Philip S. Thomas
94
58
0
17 Jun 2019
A Survey of Optimization Methods from a Machine Learning Perspective
Shiliang Sun
Zehui Cao
Han Zhu
Jing Zhao
90
566
0
17 Jun 2019
Injecting Prior Knowledge for Transfer Learning into Reinforcement Learning Algorithms using Logic Tensor Networks
Samy Badreddine
Michael Spranger
OffRL
51
15
0
15 Jun 2019
Epistemic Risk-Sensitive Reinforcement Learning
Hannes Eriksson
Christos Dimitrakakis
127
29
0
14 Jun 2019
Multigrid Neural Memory
T. Huynh
Michael Maire
Matthew R. Walter
64
10
0
13 Jun 2019
Deep Reinforcement Learning for Cyber Security
Thanh Thi Nguyen
Vijay Janapa Reddi
OffRL
AI4CE
117
335
0
13 Jun 2019
Imitation Learning of Neural Spatio-Temporal Point Processes
Shixiang Zhu
Shuang Li
Zhigang Peng
Yao Xie
3DPC
AI4TS
46
6
0
13 Jun 2019
Conditioning of Reinforcement Learning Agents and its Policy Regularization Application
Arip Asadulaev
Igor Kuznetsov
Gideon Stein
Andrey Filchenkov
16
0
0
13 Jun 2019
Neural Graph Evolution: Towards Efficient Automatic Robot Design
Tingwu Wang
Yuhao Zhou
Sanja Fidler
Jimmy Ba
65
63
0
12 Jun 2019
Active Learning of Dynamics for Data-Driven Control Using Koopman Operators
Ian Abraham
Todd Murphey
71
166
0
12 Jun 2019
Reinforcement Learning for Integer Programming: Learning to Cut
Yunhao Tang
Shipra Agrawal
Yuri Faenza
AI4CE
111
174
0
11 Jun 2019
A Hybrid Approach Between Adversarial Generative Networks and Actor-Critic Policy Gradient for Low Rate High-Resolution Image Compression
N. Savioli
GAN
23
2
0
11 Jun 2019
Causal Discovery with Reinforcement Learning
Shengyu Zhu
Ignavier Ng
Zhitang Chen
CML
114
242
0
11 Jun 2019
Learning Powerful Policies by Using Consistent Dynamics Model
Shagun Sodhani
Anirudh Goyal
T. Deleu
Yoshua Bengio
Sergey Levine
Jian Tang
OffRL
45
5
0
11 Jun 2019
Write, Execute, Assess: Program Synthesis with a REPL
Kevin Ellis
Maxwell Nye
Yewen Pu
Felix Sosa
J. Tenenbaum
Armando Solar-Lezama
109
169
0
09 Jun 2019
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Mahmoud Assran
Joshua Romoff
Nicolas Ballas
Joelle Pineau
Michael G. Rabbat
64
33
0
09 Jun 2019
Transfer Learning by Modeling a Distribution over Policies
Disha Shrivastava
Eeshan Gunesh Dhekane
Riashat Islam
OOD
OffRL
27
0
0
09 Jun 2019
Planning With Uncertain Specifications (PUnS)
Ankit J. Shah
Shen Li
J. Shah
87
25
0
07 Jun 2019
Clustered Reinforcement Learning
Xiao Ma
Shen-Yi Zhao
Wu-Jun Li
OffRL
64
6
0
06 Jun 2019
Ease-of-Teaching and Language Structure from Emergent Communication
Fushan Li
Michael Bowling
193
102
0
06 Jun 2019
Deep Reinforcement Learning for Multi-objective Optimization
Kaiwen Li
Tao Zhang
Rui Wang
AI4CE
88
273
0
06 Jun 2019
How to Initialize your Network? Robust Initialization for WeightNorm & ResNets
Devansh Arpit
Victor Campos
Yoshua Bengio
83
59
0
05 Jun 2019
Global Optimality Guarantees For Policy Gradient Methods
Jalaj Bhandari
Daniel Russo
115
193
0
05 Jun 2019
Reinforcement Learning with Low-Complexity Liquid State Machines
Wachirawit Ponghiran
G. Srinivasan
Kaushik Roy
52
14
0
04 Jun 2019
Load Balancing for Ultra-Dense Networks: A Deep Reinforcement Learning Based Approach
Yue Xu
Wenjun Xu
Zhi Wang
Jiaru Lin
Shuguang Cui
41
70
0
03 Jun 2019
Neural Replicator Dynamics
Daniel Hennes
Dustin Morrill
Shayegan Omidshafiei
Rémi Munos
Julien Perolat
...
A. Gruslys
Jean-Baptiste Lespiau
Paavo Parmas
Edgar A. Duénez-Guzmán
K. Tuyls
74
16
0
01 Jun 2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
Kai Zhang
Zhuoran Yang
Tamer Basar
111
128
0
31 May 2019
Entropy Minimization In Emergent Languages
Eugene Kharitonov
Rahma Chaabouni
Diane Bouchacourt
Marco Baroni
101
3
0
31 May 2019
Interval timing in deep reinforcement learning agents
B. Deverett
Ryan Faulkner
Meire Fortunato
Greg Wayne
Joel Z Leibo
47
14
0
31 May 2019
Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning
Yang Liu
Yunan Luo
Yuanyi Zhong
Xi Chen
Qiang Liu
Jian-wei Peng
74
36
0
31 May 2019
Combating the Compounding-Error Problem with a Multi-step Model
Kavosh Asadi
Dipendra Kumar Misra
Seungchan Kim
Michel L. Littman
LRM
83
55
0
30 May 2019
Effective Medical Test Suggestions Using Deep Reinforcement Learning
Yang Chen
Kai-Fu Tang
Yu-Shao Peng
Edward Y. Chang
OffRL
OOD
13
6
0
30 May 2019
Don't Forget Your Teacher: A Corrective Reinforcement Learning Framework
M. Nazari
Majid Jahani
L. Snyder
Martin Takáč
OffRL
OnRL
25
1
0
30 May 2019
Reinforcement Learning and Adaptive Sampling for Optimized DNN Compilation
Byung Hoon Ahn
Prannoy Pilligundla
H. Esmaeilzadeh
71
20
0
30 May 2019
On the Generalization Gap in Reparameterizable Reinforcement Learning
Huan Wang
Stephan Zheng
Caiming Xiong
R. Socher
117
41
0
29 May 2019
Previous
1
2
3
...
53
54
55
...
70
71
72
Next