Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Extragradient with player sampling for faster Nash equilibrium finding
Carles Domingo Enrich
Samy Jelassi
Carles Domingo-Enrich
Damien Scieur
A. Mensch
Joan Bruna
18
1
0
29 May 2019
CopyCAT: Taking Control of Neural Policies with Constant Attacks
Léonard Hussenot
Matthieu Geist
Olivier Pietquin
AAML
54
31
0
29 May 2019
Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation
Vihan Jain
Gabriel Ilharco
Alexander Ku
Ashish Vaswani
Eugene Ie
Jason Baldridge
LM&Ro
92
182
0
29 May 2019
Recurrent Existence Determination Through Policy Optimization
Baoxiang Wang
45
1
0
29 May 2019
Snooping Attacks on Deep Reinforcement Learning
Matthew J. Inkawhich
Yiran Chen
Hai Helen Li
AAML
68
25
0
28 May 2019
Beyond Exponentially Discounted Sum: Automatic Learning of Return Function
Yufei Wang
Qiwei Ye
Tie-Yan Liu
OffRL
64
16
0
28 May 2019
AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning
Lu Chen
Zhi Chen
Bowen Tan
Sishan Long
Milica Gasic
Kai Yu
79
35
0
27 May 2019
Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning
Caleb Chuck
Supawit Chockchowwat
S. Niekum
67
14
0
27 May 2019
Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation from Video
Miguel Jaques
Michael G. Burke
Timothy M. Hospedales
VGen
PINN
133
45
0
27 May 2019
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Zhaopeng Meng
Yaodong Yang
Li Wang
35
2
0
27 May 2019
Policy Search by Target Distribution Learning for Continuous Control
Wei Shen
Yuanqi Li
Jian Li
60
6
0
27 May 2019
A Kernel Loss for Solving the Bellman Equation
Yihao Feng
Lihong Li
Qiang Liu
89
70
0
25 May 2019
Continual Reinforcement Learning in 3D Non-stationary Environments
Vincenzo Lomonaco
Karan Desai
Eugenio Culurciello
Davide Maltoni
OffRL
49
42
0
24 May 2019
Flow-based Intrinsic Curiosity Module
Hsuan-Kung Yang
Po-Han Chiang
Min-Fong Hong
Chun-Yi Lee
39
4
0
24 May 2019
Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima
Qi Cai
Zhuoran Yang
Jason D. Lee
Zhaoran Wang
66
32
0
24 May 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Jivitesh Sharma
Per-Arne Andersen
Ole-Christoffer Granmo
M. G. Olsen
AI4CE
76
70
0
23 May 2019
Recurrent Value Functions
Pierre Thodoroff
N. Anand
Lucas Caccia
Doina Precup
Joelle Pineau
36
5
0
23 May 2019
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
Rui Zhao
Xudong Sun
Volker Tresp
69
83
0
21 May 2019
Combining Experience Replay with Exploration by Random Network Distillation
Francesco Sovrano
61
15
0
18 May 2019
SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation
Daniel Gordon
Abhishek Kadian
Devi Parikh
Judy Hoffman
Dhruv Batra
90
75
0
18 May 2019
Optimizing Sequential Medical Treatments with Auto-Encoding Heuristic Search in POMDPs
Luchen Li
Matthieu Komorowski
Aldo A. Faisal
OffRL
139
13
0
17 May 2019
Leveraging exploration in off-policy algorithms via normalizing flows
Bogdan Mazoure
T. Doan
A. Durand
R. Devon Hjelm
Joelle Pineau
OnRL
70
62
0
16 May 2019
Learning and Exploiting Multiple Subgoals for Fast Exploration in Hierarchical Reinforcement Learning
Libo Xing
18
4
0
13 May 2019
Learning Phase Competition for Traffic Signal Control
Guanjie Zheng
Yuanhao Xiong
Xinshi Zang
J. Feng
Hua Wei
Huichu Zhang
Yong Li
Kai Xu
Z. Li
86
230
0
12 May 2019
Graph Attention Memory for Visual Navigation
Dong Li
Qichao Zhang
Dongbin Zhao
Yuzheng Zhuang
Bin Wang
Wulong Liu
Rasul Tutunov
Jun Wang
65
16
0
11 May 2019
Optimizing Routerless Network-on-Chip Designs: An Innovative Learning-Based Framework
Ting-Ru Lin
Drew Penney
Massoud Pedram
Lizhong Chen
3DV
55
8
0
11 May 2019
Do Autonomous Agents Benefit from Hearing?
Abraham Woubie
Anssi Kanervisto
Janne Karttunen
Ville Hautamaki
50
8
0
10 May 2019
Design of Artificial Intelligence Agents for Games using Deep Reinforcement Learning
A. Roibu
63
1
0
10 May 2019
Attention-based Deep Reinforcement Learning for Multi-view Environments
Elaheh Barati
Xuewen Chen
Z. Zhong
83
6
0
10 May 2019
Pretrain Soft Q-Learning with Imperfect Demonstrations
Xiaoqin Zhang
Yunfei Li
Huimin Ma
Xiong Luo
OffRL
OnRL
26
2
0
09 May 2019
Learning to Evolve
Jan Schuchardt
Vladimir Golkov
Zorah Lähner
22
0
0
08 May 2019
Smoothing Policies and Safe Policy Gradients
Matteo Papini
Matteo Pirotta
Marcello Restelli
80
31
0
08 May 2019
Meta-learning of Sequential Strategies
Pedro A. Ortega
Jane X. Wang
Mark Rowland
Tim Genewein
Z. Kurth-Nelson
...
Yee Whye Teh
H. V. Hasselt
Nando de Freitas
M. Botvinick
Shane Legg
OffRL
123
101
0
08 May 2019
Toybox: A Suite of Environments for Experimental Evaluation of Deep Reinforcement Learning
Emma Tosch
Kaleigh Clary
John Foley
David D. Jensen
OffRL
49
9
0
07 May 2019
Continual and Multi-task Reinforcement Learning With Shared Episodic Memory
A. Sorokin
Andrey Kravchenko
KELM
CLL
39
5
0
07 May 2019
A Complementary Learning Systems Approach to Temporal Difference Learning
Sam Blakeman
D. Mareschal
62
42
0
07 May 2019
DeepRMSA: A Deep Reinforcement Learning Framework for Routing, Modulation and Spectrum Assignment in Elastic Optical Networks
Xiaoliang Chen
Baojia Li
R. Proietti
Hongbo Lu
Zuqing Zhu
S. Yoo
64
167
0
06 May 2019
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
93
56
0
05 May 2019
ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Mingzhang Yin
Yuguang Yue
Mingyuan Zhou
66
23
0
04 May 2019
Information asymmetry in KL-regularized RL
Alexandre Galashov
Siddhant M. Jayakumar
Leonard Hasenclever
Dhruva Tirumala
Jonathan Richard Schwarz
Guillaume Desjardins
Wojciech M. Czarnecki
Yee Whye Teh
Razvan Pascanu
N. Heess
OffRL
67
104
0
03 May 2019
Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka
Somdeb Majumdar
Tarek Nassar
Zach Dwiel
E. Tumer
Santiago Miret
Yinyin Liu
Kagan Tumer
66
100
0
02 May 2019
Autonomous Air Traffic Controller: A Deep Multi-Agent Reinforcement Learning Approach
Marc Brittain
Peng Wei
DRL
52
36
0
02 May 2019
From Video Game to Real Robot: The Transfer between Action Spaces
Janne Karttunen
Anssi Kanervisto
Ville Kyrki
Ville Hautamaki
92
8
0
02 May 2019
DAC: The Double Actor-Critic Architecture for Learning Options
Shangtong Zhang
Shimon Whiteson
149
73
0
29 Apr 2019
Reinforcement Learning Scheduler for Vehicle-to-Vehicle Communications Outside Coverage
T. Şahin
R. Khalili
Mate Boban
A. Wolisz
OffRL
28
21
0
29 Apr 2019
Neural Logic Machines
Honghua Dong
Jiayuan Mao
Tian Lin
Chong-Jun Wang
Lihong Li
Denny Zhou
NAI
LRM
AI4CE
141
250
0
26 Apr 2019
Target-Based Temporal Difference Learning
Donghwan Lee
Niao He
OOD
83
32
0
24 Apr 2019
Neural Logic Reinforcement Learning
Zhengyao Jiang
Shan Luo
NAI
103
75
0
24 Apr 2019
Stochastic Lipschitz Q-Learning
Xu Zhu
64
4
0
24 Apr 2019
Towards Combining On-Off-Policy Methods for Real-World Applications
Kai-Chun Hu
Chen-Huan Pi
Ting Han Wei
I-Chen Wu
Stone Cheng
Yi-Wei Dai
Wei-Yuan Ye
OffRL
33
2
0
24 Apr 2019
Previous
1
2
3
...
54
55
56
...
70
71
72
Next