Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Complementary reinforcement learning towards explainable agents
J. H. Lee
53
12
0
01 Jan 2019
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
203
613
0
01 Jan 2019
Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications
Thanh Thi Nguyen
Ngoc Duy Nguyen
S. Nahavandi
142
796
0
31 Dec 2018
Learn to Interpret Atari Agents
Zhao Yang
S. Bai
Li Zhang
Philip Torr
80
29
0
29 Dec 2018
Dynamic Planning Networks
Norman L. Tasfi
Miriam A. M. Capretz
70
5
0
28 Dec 2018
Graph Transformation Policy Network for Chemical Reaction Prediction
Kien Do
T. Tran
Svetha Venkatesh
123
163
0
22 Dec 2018
Introducing Neuromodulation in Deep Neural Networks to Learn Adaptive Behaviours
Nicolas Vecoven
D. Ernst
Antoine Wehenkel
G. Drion
AI4CE
64
43
0
21 Dec 2018
NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning
Sirui Xie
Junning Huang
Lanxin Lei
Chunxiao Liu
Zheng Ma
Wayne Zhang
Liang Lin
57
8
0
21 Dec 2018
Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
OffRL
51
26
0
21 Dec 2018
TD-Regularized Actor-Critic Methods
Simone Parisi
Voot Tangkaratt
Jan Peters
Mohammad Emtiyaz Khan
OffRL
61
31
0
19 Dec 2018
Domain Adaptation for Reinforcement Learning on the Atari
Thomas Carr
Maria Chli
George Vogiatzis
42
22
0
18 Dec 2018
Sim-to-Real via Sim-to-Sim: Data-efficient Robotic Grasping via Randomized-to-Canonical Adaptation Networks
Stephen James
Paul Wohlhart
Mrinal Kalakrishnan
Dmitry Kalashnikov
A. Irpan
Julian Ibarz
Sergey Levine
R. Hadsell
Konstantinos Bousmalis
119
450
0
18 Dec 2018
An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents
F. Such
Vashisht Madhavan
Rosanne Liu
Rui Wang
Pablo Samuel Castro
...
Jiale Zhi
Ludwig Schubert
Marc G. Bellemare
Jeff Clune
Joel Lehman
OffRL
86
54
0
17 Dec 2018
Malthusian Reinforcement Learning
Joel Z Leibo
Julien Perolat
Edward Hughes
S. Wheelwright
Adam H. Marblestone
Edgar A. Duénez-Guzmán
P. Sunehag
Iain Dunning
T. Graepel
AI4CE
103
38
0
17 Dec 2018
A Logarithmic Barrier Method For Proximal Policy Optimization
Cheng Zeng
Hongming Zhang
21
2
0
16 Dec 2018
An Empirical Model of Large-Batch Training
Sam McCandlish
Jared Kaplan
Dario Amodei
OpenAI Dota Team
76
280
0
14 Dec 2018
Dopamine: A Research Framework for Deep Reinforcement Learning
Pablo Samuel Castro
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Marc G. Bellemare
OffRL
86
279
0
14 Dec 2018
Scaling shared model governance via model splitting
Miljan Martic
Jan Leike
Andrew Trask
Matteo Hessel
Shane Legg
Pushmeet Kohli
FedML
64
2
0
14 Dec 2018
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
159
2,461
0
13 Dec 2018
Learning with Training Wheels: Speeding up Training with a Simple Controller for Deep Reinforcement Learning
Linhai Xie
Sen Wang
Stefano Rosa
Andrew Markham
A. Trigoni
OffRL
99
80
0
12 Dec 2018
Learning What to Remember: Long-term Episodic Memory Networks for Learning from Streaming Data
Hyunwoo Jung
Moonsu Han
Minki Kang
Sung Ju Hwang
CLL
KELM
RALM
53
5
0
11 Dec 2018
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
129
139
0
08 Dec 2018
Communication-Efficient Policy Gradient Methods for Distributed Reinforcement Learning
Tianyi Chen
Kai Zhang
G. Giannakis
Tamer Basar
OffRL
100
41
0
07 Dec 2018
Zero-shot Deep Reinforcement Learning Driving Policy Transfer for Autonomous Vehicles based on Robust Control
Zhuo Xu
Chen Tang
Masayoshi Tomizuka
OffRL
54
36
0
07 Dec 2018
Wireless Network Intelligence at the Edge
Jihong Park
S. Samarakoon
M. Bennis
Mérouane Debbah
115
521
0
07 Dec 2018
Online Model Distillation for Efficient Video Inference
Ravi Teja Mullapudi
Steven Chen
Keyi Zhang
Deva Ramanan
Kayvon Fatahalian
VGen
126
115
0
06 Dec 2018
Deep Reinforcement Learning and the Deadly Triad
H. V. Hasselt
Yotam Doron
Florian Strub
Matteo Hessel
Nicolas Sonnerat
Joseph Modayil
OffRL
115
232
0
06 Dec 2018
Top-K Off-Policy Correction for a REINFORCE Recommender System
Minmin Chen
Alex Beutel
Paul Covington
Sagar Jain
Francois Belletti
Ed H. Chi
CML
OffRL
149
485
0
06 Dec 2018
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Long Chen
Hanwang Zhang
Jun Xiao
Xiangnan He
Shiliang Pu
Shih-Fu Chang
100
159
0
06 Dec 2018
Towards a Definition of Disentangled Representations
I. Higgins
David Amos
David Pfau
S. Racanière
Loic Matthey
Danilo Jimenez Rezende
Alexander Lerchner
OCL
DRL
148
481
0
05 Dec 2018
Adapting Auxiliary Losses Using Gradient Similarity
Yunshu Du
Wojciech M. Czarnecki
Siddhant M. Jayakumar
Mehrdad Farajtabar
Razvan Pascanu
Balaji Lakshminarayanan
134
159
0
05 Dec 2018
Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Junhui Yin
Jiayan Qiu
Csaba Szepesvári
Siqing Zhang
Avraham Ruderman
Jiyang Xie
Krishnamurthy Dvijotham
Zhanyu Ma
N. Heess
Pushmeet Kohli
AAML
107
82
0
04 Dec 2018
Natural Option Critic
Saket Tiwari
Philip S. Thomas
57
22
0
04 Dec 2018
JANUS: Fast and Flexible Deep Learning via Symbolic Graph Execution of Imperative Programs
Eunji Jeong
Sungwoo Cho
Gyeong-In Yu
Joo Seong Jeong
Dongjin Shin
Byung-Gon Chun
59
25
0
04 Dec 2018
Mitigating Planner Overfitting in Model-Based Reinforcement Learning
Dilip Arumugam
David Abel
Kavosh Asadi
N. Gopalan
Christopher Grimm
Jun Ki Lee
Lucas Lehnert
Michael L. Littman
50
11
0
03 Dec 2018
Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning
Mitchell Wortsman
Kiana Ehsani
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
SSL
94
223
0
03 Dec 2018
Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning
Aishwarya Agrawal
Mateusz Malinowski
Felix Hill
S. M. Ali Eslami
Oriol Vinyals
Tejas D. Kulkarni
67
4
0
03 Dec 2018
AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov Decision Processes with Near-Optimal Sample Complexity
Yibo Zeng
Fei Feng
W. Yin
56
3
0
03 Dec 2018
Resource Constrained Deep Reinforcement Learning
Abhinav Bhatia
Pradeep Varakantham
Akshat Kumar
81
47
0
03 Dec 2018
Revisiting the Softmax Bellman Operator: New Benefits and New Perspective
Zhao Song
Ronald E. Parr
Lawrence Carin
55
4
0
02 Dec 2018
Using Monte Carlo Tree Search as a Demonstrator within Asynchronous Deep RL
Bilal Kartal
Pablo Hernandez-Leal
Matthew E. Taylor
OffRL
122
9
0
30 Nov 2018
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRL
AI4CE
175
1,279
0
30 Nov 2018
How to Organize your Deep Reinforcement Learning Agents: The Importance of Communication Topology
D. Adjodah
D. Calacci
Abhimanyu Dubey
P. Krafft
Esteban Moro Egido
Alex Pentland
GNN
39
0
0
30 Nov 2018
Learning Finite State Representations of Recurrent Policy Networks
Anurag Koul
S. Greydanus
Alan Fern
68
88
0
29 Nov 2018
Touchdown: Natural Language Navigation and Spatial Reasoning in Visual Street Environments
Howard Chen
Alane Suhr
Dipendra Kumar Misra
Noah Snavely
Yoav Artzi
140
391
0
29 Nov 2018
Trajectory-based Learning for Ball-in-Maze Games
S. Paul
J. Baar
47
1
0
28 Nov 2018
Deep Reinforcement Learning for Autonomous Driving
Sen Wang
Daoyuan Jia
Xinshuo Weng
78
165
0
28 Nov 2018
Target Driven Visual Navigation with Hybrid Asynchronous Universal Successor Representations
Shamane Siriwardhana
Abdenacer Naouri
Huansheng Ning
29
5
0
27 Nov 2018
Distributed traffic light control at uncoupled intersections with real-world topology by deep reinforcement learning
Mark Schutera
Niklas Goby
S. Smolarek
Markus Reischl
33
7
0
27 Nov 2018
Understanding the impact of entropy on policy optimization
Zafarali Ahmed
Nicolas Le Roux
Mohammad Norouzi
Dale Schuurmans
87
238
0
27 Nov 2018
Previous
1
2
3
...
58
59
60
...
70
71
72
Next