Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Adam Stooke
Valentin Dalibard
Siddhant M. Jayakumar
Wojciech M. Czarnecki
Max Jaderberg
61
1
0
26 Jun 2020
Online 3D Bin Packing with Constrained Deep Reinforcement Learning
Hang Zhao
Qijin She
Chenyang Zhu
Yifan Yang
Kai Xu
OffRL
71
125
0
26 Jun 2020
Local Stochastic Approximation: A Unified View of Federated Learning and Distributed Multi-Task Reinforcement Learning Algorithms
Thinh T. Doan
FedML
66
10
0
24 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
108
58
0
23 Jun 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
76
27
0
23 Jun 2020
dm_control: Software and Tasks for Continuous Control
Yuval Tassa
S. Tunyasuvunakool
Alistair Muldal
Yotam Doron
Piotr Trochim
...
Steven Bohez
J. Merel
Tom Erez
Timothy Lillicrap
N. Heess
LM&Ro
174
419
0
22 Jun 2020
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Andres Campero
Roberta Raileanu
Heinrich Küttler
J. Tenenbaum
Tim Rocktaschel
Edward Grefenstette
114
127
0
22 Jun 2020
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning
Aleksei Petrenko
Zhehui Huang
T. Kumar
Gaurav Sukhatme
V. Koltun
113
105
0
21 Jun 2020
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiaolin Hu
Feng Chen
109
87
0
20 Jun 2020
NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration
Shuai Han
Wenbo Zhou
Jing Liu
Shuai Lu
48
28
0
19 Jun 2020
Reparameterized Variational Divergence Minimization for Stable Imitation
Dilip Arumugam
Debadeepta Dey
Alekh Agarwal
Asli Celikyilmaz
E. Nouri
W. Dolan
52
3
0
18 Jun 2020
Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning
Arrasy Rahman
Niklas Höpner
Filippos Christianos
Stefano V. Albrecht
93
58
0
18 Jun 2020
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
Qiang He
Xinwen Hou
OffRL
72
29
0
18 Jun 2020
Towards Recurrent Autoregressive Flow Models
John Mern
Peter Morales
Mykel J. Kochenderfer
BDL
30
0
0
17 Jun 2020
Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations
Alexey Skrynnik
A. Staroverov
Ermek Aitygulov
Kirill Aksenov
Vasilii Davydov
Aleksandr I. Panov
OffRL
66
4
0
17 Jun 2020
Neural Ordinary Differential Equation Control of Dynamics on Graphs
Thomas Asikis
Lucas Böttcher
Nino Antulov-Fantulin
103
44
0
17 Jun 2020
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning
Eivind Meyer
Amalie Heiberg
Adil Rasheed
Omer San
78
74
0
16 Jun 2020
Solving the Order Batching and Sequencing Problem using Deep Reinforcement Learning
Bram Cals
Yingqian Zhang
R. Dijkman
Claudy van Dorst
OffRL
62
30
0
16 Jun 2020
Agent Modelling under Partial Observability for Deep Reinforcement Learning
Georgios Papoudakis
Filippos Christianos
Stefano V. Albrecht
113
66
0
16 Jun 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
203
620
0
16 Jun 2020
ForMIC: Foraging via Multiagent RL with Implicit Communication
Samuel Shaw
Emerson Wenzel
Alexis Walker
Guillaume Sartoretti
OffRL
45
7
0
15 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
118
233
0
14 Jun 2020
Optimistic Distributionally Robust Policy Optimization
Jun Song
Chaoyue Zhao
48
12
0
14 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
81
19
0
14 Jun 2020
Bootstrap your own latent: A new approach to self-supervised Learning
Jean-Bastien Grill
Florian Strub
Florent Altché
Corentin Tallec
Pierre Harvey Richemond
...
M. G. Azar
Bilal Piot
Koray Kavukcuoglu
Rémi Munos
Michal Valko
SSL
530
6,881
0
13 Jun 2020
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Yunhao Tang
K. Choromanski
OffRL
41
14
0
13 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
121
24
0
12 Jun 2020
Using Reinforcement Learning to Allocate and Manage Service Function Chains in Cellular Networks
Guto Leoni Santos
P. Endo
8
0
0
12 Jun 2020
Learning to Communicate Using Counterfactual Reasoning
Simon Vanneste
Astrid Vanneste
Kevin Mets
Tom De Schepper
Ali Anwar
Siegfried Mercelis
Steven Latré
P. Hellinckx
OffRL
80
11
0
12 Jun 2020
Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
150
170
0
12 Jun 2020
Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch
Michael Wan
Tanmay Gangwani
Jian-wei Peng
61
19
0
12 Jun 2020
Systematic Generalisation through Task Temporal Logic and Deep Reinforcement Learning
Borja G. Leon
Murray Shanahan
Francesco Belardinelli
NAI
AI4CE
95
29
0
12 Jun 2020
Decorrelated Double Q-learning
Gang Chen
33
2
0
12 Jun 2020
High-Precision Extraction of Emerging Concepts from Scientific Literature
Daniel King
Doug Downey
Daniel S. Weld
54
11
0
11 Jun 2020
Borrowing From the Future: Addressing Double Sampling in Model-free Control
Yuhua Zhu
Zachary Izzo
Lexing Ying
37
4
0
11 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
Matthieu Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
98
226
0
10 Jun 2020
Continuous Action Reinforcement Learning from a Mixture of Interpretable Experts
R. Akrour
Davide Tateo
Jan Peters
60
22
0
10 Jun 2020
Regret Minimization for Causal Inference on Large Treatment Space
Akira Tanimoto
Tomoya Sakai
Takashi Takenouchi
H. Kashima
CML
71
10
0
10 Jun 2020
Deep Learning for Change Detection in Remote Sensing Images: Comprehensive Review and Meta-Analysis
Lazhar Khelifi
M. Mignotte
91
265
0
10 Jun 2020
Fitted Q-Learning for Relational Domains
Srijita Das
S. Natarajan
Kaushik Roy
Ronald E. Parr
Kristian Kersting
56
15
0
10 Jun 2020
Dialog Policy Learning for Joint Clarification and Active Learning Queries
Aishwarya Padmakumar
Raymond J. Mooney
83
8
0
09 Jun 2020
Distributed Learning on Heterogeneous Resource-Constrained Devices
Martin Rapp
R. Khalili
J. Henkel
FedML
68
7
0
09 Jun 2020
Constrained episodic reinforcement learning in concave-convex and knapsack settings
Kianté Brantley
Miroslav Dudík
Thodoris Lykouris
Sobhan Miryoosefi
Max Simchowitz
Aleksandrs Slivkins
Wen Sun
OffRL
105
52
0
09 Jun 2020
Stealing Deep Reinforcement Learning Models for Fun and Profit
Kangjie Chen
Shangwei Guo
Tianwei Zhang
Xiaofei Xie
Yang Liu
MLAU
MIACV
OffRL
90
45
0
09 Jun 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
106
43
0
08 Jun 2020
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
Mehdi Jafarnia-Jahromi
Chen-Yu Wei
Rahul Jain
Haipeng Luo
118
7
0
08 Jun 2020
A Decentralized Policy Gradient Approach to Multi-task Reinforcement Learning
Sihan Zeng
Aqeel Anwar
Thinh T. Doan
A. Raychowdhury
Justin Romberg
88
40
0
08 Jun 2020
Deep Reinforcement Learning for Human-Like Driving Policies in Collision Avoidance Tasks of Self-Driving Cars
Ran Emuna
A. Borowsky
Armin Biess
89
24
0
07 Jun 2020
Deep active inference agents using Monte-Carlo methods
Zafeirios Fountas
Noor Sajid
P. Mediano
Karl J. Friston
125
106
0
07 Jun 2020
Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity
Zihan Zhang
Yuanshuo Zhou
Xiangyang Ji
85
36
0
06 Jun 2020
Previous
1
2
3
...
42
43
44
...
70
71
72
Next