Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,592 papers shown
Title
Learning for Visual Navigation by Imagining the Success
M. Moghaddam
Ehsan Abbasnejad
Qi Wu
Javen Qinfeng Shi
Anton Van Den Hengel
66
3
0
28 Feb 2021
Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report
Michael Walton
Viliam Lisý
70
5
0
27 Feb 2021
Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half Precision
Johan Bjorck
Xiangyu Chen
Christopher De Sa
Carla P. Gomes
Kilian Q. Weinberger
137
6
0
26 Feb 2021
Synthetic Returns for Long-Term Credit Assignment
David Raposo
Samuel Ritter
Adam Santoro
Greg Wayne
T. Weber
M. Botvinick
H. V. Hasselt
Francis Song
AI4TS
107
35
0
24 Feb 2021
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments
Dmitry Ivanov
Vladimir Egorov
A. Shpilman
68
5
0
24 Feb 2021
FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism
Jenny Yang
Seongmin Hong
Joo-Young Kim
61
18
0
24 Feb 2021
Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL
Siddharth Mysore
B. Mabsout
R. Mancuso
Kate Saenko
OffRL
47
9
0
23 Feb 2021
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
114
25
0
23 Feb 2021
Differentiable Logic Machines
Matthieu Zimmer
Xuening Feng
Claire Glanois
Zhaohui Jiang
Jianyi Zhang
Paul Weng
Li Dong
Hao Jianye
Liu Wulong
AI4CE
101
23
0
23 Feb 2021
Communication Efficient Parallel Reinforcement Learning
Mridul Agarwal
Bhargav Ganguly
Vaneet Aggarwal
77
11
0
22 Feb 2021
Delayed Rewards Calibration via Reward Empirical Sufficiency
Yixuan Liu
Hu Wang
Xiaowei Wang
Xiaoyue Sun
Liuyue Jiang
Minhui Xue
88
0
0
21 Feb 2021
How To Train Your HERON
Antoine Richard
Stéphanie Aravecchia
Thomas Schillaci
Matthieu Geist
C´edric Pradalier
41
3
0
20 Feb 2021
Decoupling Value and Policy for Generalization in Reinforcement Learning
Roberta Raileanu
Rob Fergus
DRL
OffRL
121
99
0
20 Feb 2021
On Proximal Policy Optimization's Heavy-tailed Gradients
Saurabh Garg
Joshua Zhanson
Emilio Parisotto
Adarsh Prasad
J. Zico Kolter
Zachary Chase Lipton
Sivaraman Balakrishnan
Ruslan Salakhutdinov
Pradeep Ravikumar
100
13
0
20 Feb 2021
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
73
15
0
18 Feb 2021
State Entropy Maximization with Random Encoders for Efficient Exploration
Younggyo Seo
Lili Chen
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
119
129
0
18 Feb 2021
Adaptive Rational Activations to Boost Deep Reinforcement Learning
Quentin Delfosse
P. Schramowski
Martin Mundt
Alejandro Molina
Kristian Kersting
141
15
0
18 Feb 2021
Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving
Haochen Liu
Zhiyu Huang
Jingda Wu
Chen Lv
102
74
0
18 Feb 2021
Learning Memory-Dependent Continuous Control from Demonstrations
Siqing Hou
Dongqi Han
Jun Tani
30
0
0
18 Feb 2021
Multi-Agent Reinforcement Learning of 3D Furniture Layout Simulation in Indoor Graphics Scenes
Xinhan Di
Pengqian Yu
AI4CE
3DV
92
12
0
18 Feb 2021
On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method
Junyu Zhang
Chengzhuo Ni
Zheng Yu
Csaba Szepesvári
Mengdi Wang
130
69
0
17 Feb 2021
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade Execution
Karush Suri
Xiaolong Shi
Konstantinos Plataniotis
Y. Lawryshyn
OffRL
47
4
0
16 Feb 2021
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Rukshan Wijesinghe
Kasun Vithanage
Dumindu Tissera
A. Xavier
Subha Fernando
Jayathu Samarawickrama
CLL
50
0
0
16 Feb 2021
A Survey of Machine Learning for Computer Architecture and Systems
Nan Wu
Yuan Xie
AI4TS
AI4CE
108
153
0
16 Feb 2021
Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing
Filippos Christianos
Georgios Papoudakis
Arrasy Rahman
Stefano V. Albrecht
94
121
0
15 Feb 2021
Tractable structured natural gradient descent using local parameterizations
Wu Lin
Frank Nielsen
Mohammad Emtiyaz Khan
Mark Schmidt
146
30
0
15 Feb 2021
Reinforcement Learning for IoT Security: A Comprehensive Survey
Aashma Uprety
D. Rawat
AAML
93
124
0
14 Feb 2021
Resilient Machine Learning for Networked Cyber Physical Systems: A Survey for Machine Learning Security to Securing Machine Learning for CPS
Felix O. Olowononi
D. Rawat
Chunmei Liu
95
138
0
14 Feb 2021
Interactive Learning from Activity Description
Khanh Nguyen
Dipendra Kumar Misra
Robert Schapire
Miroslav Dudík
Patrick Shafto
115
35
0
13 Feb 2021
Modelling Cooperation in Network Games with Spatio-Temporal Complexity
Michiel A. Bakker
Richard Everett
Laura Weidinger
Iason Gabriel
William S. Isaac
Joel Z Leibo
Edward Hughes
69
5
0
13 Feb 2021
Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Piotr Kozakowski
Lukasz Kaiser
Henryk Michalewski
Afroz Mohiuddin
Katarzyna Kañska
OffRL
81
5
0
12 Feb 2021
Deep Reinforcement Agent for Scheduling in HPC
Yuping Fan
Z. Lan
T. Childers
Paul M. Rich
W. Allcock
M. Papka
57
37
0
11 Feb 2021
Derivative-Free Reinforcement Learning: A Review
Hong Qian
Yang Yu
OffRL
144
42
0
10 Feb 2021
Improving Model-Based Reinforcement Learning with Internal State Representations through Self-Supervision
Julien Scholz
C. Weber
Muhammad Burhan Hafez
S. Wermter
46
3
0
10 Feb 2021
Learning Equational Theorem Proving
Jelle Piepenbrock
Tom Heskes
Mikolávs Janota
Josef Urban
AIMat
LRM
38
4
0
10 Feb 2021
Policy Augmentation: An Exploration Strategy for Faster Convergence of Deep Reinforcement Learning Algorithms
A. Mahyari
22
1
0
10 Feb 2021
Adaptive Pairwise Weights for Temporal Credit Assignment
Zeyu Zheng
Risto Vuorio
Richard L. Lewis
Satinder Singh
68
5
0
09 Feb 2021
Learning State Representations from Random Deep Action-conditional Predictions
Zeyu Zheng
Vivek Veeriah
Risto Vuorio
Richard L. Lewis
Satinder Singh
70
5
0
09 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
55
13
0
09 Feb 2021
How to Stay Curious while Avoiding Noisy TVs using Aleatoric Uncertainty Estimation
Augustine N. Mavor-Parker
K. Young
Caswell Barry
Lewis D. Griffin
95
22
0
08 Feb 2021
Adversarially Guided Actor-Critic
Yannis Flet-Berliac
Johan Ferret
Olivier Pietquin
Philippe Preux
Matthieu Geist
77
73
0
08 Feb 2021
Towards Hierarchical Task Decomposition using Deep Reinforcement Learning for Pick and Place Subtasks
Luca Marzari
Ameya Pore
Diego DallÁlba
G. Aragon-Camarasa
Alessandro Farinelli
Paolo Fiorini
87
29
0
08 Feb 2021
Concentrated Document Topic Model
Hao Lei
Ying Chen
31
1
0
06 Feb 2021
Rethinking the Implementation Tricks and Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning
Jian Hu
Siyang Jiang
Seth Austin Harding
Haibin Wu
Shihua Liao
217
91
0
06 Feb 2021
An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios
Chengmin Zhou
Bingding Huang
Pasi Fränti
47
1
0
05 Feb 2021
Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning
Julian Bernhard
Robert Gieselmann
Klemens Esterle
Alois Knoll
42
17
0
05 Feb 2021
A review of motion planning algorithms for intelligent robotics
Chengmin Zhou
Bingding Huang
Pasi Fränti
77
4
0
04 Feb 2021
Proactive and AoI-aware Failure Recovery for Stateful NFV-enabled Zero-Touch 6G Networks: Model-Free DRL Approach
Amirhossein Shaghaghi
Abolfazl Zakeri
Nader Mokari
M. Javan
M. Behdadfar
Eduard Axel Jorswieck
36
20
0
02 Feb 2021
DRLDO: A novel DRL based De-ObfuscationSystem for Defense against Metamorphic Malware
Mohit Sewak
S. K. Sahay
Hemant Rathore
43
13
0
01 Feb 2021
Deep Reinforcement Learning Aided Monte Carlo Tree Search for MIMO Detection
Tz-Wei Mo
Ronald Y. Chang
Te-Yi Kan
64
1
0
30 Jan 2021
Previous
1
2
3
...
35
36
37
...
70
71
72
Next