Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Re-understanding Finite-State Representations of Recurrent Policy Networks
Mohamad H. Danesh
Anurag Koul
Alan Fern
Saeed Khorram
69
22
0
06 Jun 2020
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines
Callum Wilson
A. Riccardi
E. Minisci
21
4
0
04 Jun 2020
Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent
Adrien Bolland
Ioannis Boukas
M. Berger
D. Ernst
20
3
0
02 Jun 2020
Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas
Yen-Ling Kuo
Boris Katz
Andrei Barbu
78
41
0
01 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
143
226
0
01 Jun 2020
Hyperparameter optimization with REINFORCE and Transformers
C. Krishna
Ashish Gupta
Swarnim Narayan
Himanshu Rai
Diksha Manchanda
58
2
0
01 Jun 2020
MM-KTD: Multiple Model Kalman Temporal Differences for Reinforcement Learning
Parvin Malekzadeh
Mohammad Salimibeni
Arash Mohammadi
A. Assa
Konstantinos N. Plataniotis
OffRL
45
12
0
30 May 2020
Deep Reinforcement learning for real autonomous mobile robot navigation in indoor environments
H. Surmann
Christian Jestel
Robin Marchel
Franziska Musberg
Houssem Elhadj
Mahbube Ardani
49
85
0
28 May 2020
Domain Knowledge Integration By Gradient Matching For Sample-Efficient Reinforcement Learning
Parth Chadha
17
0
0
28 May 2020
The Adversarial Resilience Learning Architecture for AI-based Modelling, Exploration, and Operation of Complex Cyber-Physical Systems
Eric M. S. P. Veith
Nils Wenninghoff
Emilie Frost
47
5
0
27 May 2020
MOPO: Model-based Offline Policy Optimization
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
119
776
0
27 May 2020
Towards intervention-centric causal reasoning in learning agents
B. Lansdell
LRM
CML
21
0
0
26 May 2020
Efficient Use of heuristics for accelerating XCS-based Policy Learning in Markov Games
Hao Chen
Chang Wang
Jian Huang
Jianxing Gong
114
5
0
26 May 2020
Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce
Jianxiong Wei
Anxiang Zeng
Yueqiu Wu
P. Guo
Q. Hua
Qingpeng Cai
OffRL
74
9
0
25 May 2020
Learning to Simulate Dynamic Environments with GameGAN
Seung Wook Kim
Yuhao Zhou
Jonah Philion
Antonio Torralba
Sanja Fidler
GAN
104
106
0
25 May 2020
Gradient Monitored Reinforcement Learning
Mohammed Sharafath Abdul Hameed
Gavneet Singh Chadha
Andreas Schwung
S. Ding
99
11
0
25 May 2020
Policy Entropy for Out-of-Distribution Classification
Andreas Sedlmeier
Robert Muller
Steffen Illium
Claudia Linnhoff-Popien
OODD
OffRL
59
14
0
25 May 2020
Learning visual servo policies via planner cloning
Ulrich Viereck
Kate Saenko
Robert Platt
OffRL
35
2
0
24 May 2020
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
Jianfeng Liu
Feiyang Pan
Ling Luo
OffRL
67
23
0
24 May 2020
Learning from Naturalistic Driving Data for Human-like Autonomous Highway Driving
Donghao Xu
Zhezhang Ding
Xu He
Huijing Zhao
M. Moze
François Aioun
F. Guillemard
39
55
0
23 May 2020
Evaluating Generalisation in General Video Game Playing
Martin Balla
Simon Lucas
Diego Perez-Liebana
44
2
0
22 May 2020
Learning Combinatorial Optimization on Graphs: A Survey with Applications to Networking
N. Vesselinova
Rebecca Steinert
Daniel F. Perez-Ramirez
Magnus Boman
GNN
AI4CE
90
146
0
22 May 2020
Consensus Driven Learning
K. Crandall
Dustin J. Webb
FedML
19
0
0
20 May 2020
A Metric Learning Approach to Anomaly Detection in Video Games
Benedict Wilkins
C. Watkins
Kostas Stathis
37
1
0
20 May 2020
Finite-sample Analysis of Greedy-GQ with Linear Function Approximation under Markovian Noise
Yue Wang
Shaofeng Zou
53
21
0
20 May 2020
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks
Haotian Liu
Wenchuan Wu
OffRL
36
98
0
20 May 2020
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
169
87
0
20 May 2020
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text
Felix Hill
Soňa Mokrá
Nathaniel Wong
Tim Harley
LM&Ro
101
82
0
19 May 2020
Privileged Information Dropout in Reinforcement Learning
Pierre-Alexandre Kamienny
Kai Arulkumaran
Feryal M. P. Behbahani
Wendelin Boehmer
Shimon Whiteson
61
10
0
19 May 2020
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Abdulelah S. Alshehri
R. Gani
Fengqi You
AI4CE
98
86
0
18 May 2020
Multi-Objective level generator generation with Marahel
Ahmed Khalifa
Julian Togelius
55
9
0
17 May 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
94
88
0
16 May 2020
Stealthy and Efficient Adversarial Attacks against Deep Reinforcement Learning
Jianwen Sun
Tianwei Zhang
Xiaofei Xie
Lei Ma
Yan Zheng
Kangjie Chen
Yang Liu
AAML
72
118
0
14 May 2020
Reinforced Coloring for End-to-End Instance Segmentation
Tuan Tran Anh
Khoa Nguyen-Tuan
Tran Minh Quan
Won-Ki Jeong
SSeg
ISeg
37
2
0
14 May 2020
On the Global Convergence Rates of Softmax Policy Gradient Methods
Jincheng Mei
Chenjun Xiao
Csaba Szepesvári
Dale Schuurmans
176
294
0
13 May 2020
From Simulation to Real World Maneuver Execution using Deep Reinforcement Learning
Alessandro Paolo Capasso
Giulio Bacchiani
A. Broggi
56
4
0
13 May 2020
Proxy Experience Replay: Federated Distillation for Distributed Reinforcement Learning
Han Cha
Jihong Park
Hyesung Kim
M. Bennis
Seong-Lyun Kim
71
26
0
13 May 2020
Training spiking neural networks using reinforcement learning
Sneha Aenugu
OffRL
16
2
0
12 May 2020
Smooth Exploration for Robotic Reinforcement Learning
Antonin Raffin
Jens Kober
F. Stulp
88
58
0
12 May 2020
Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments
Baiming Chen
Mengdi Xu
Zuxin Liu
Liang-Sheng Li
Ding Zhao
70
37
0
11 May 2020
Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement Learning Problems
Alexis Asseman
Nicolas Antoine
A. Ozcan
34
4
0
10 May 2020
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Ge Yang
Amy Zhang
Ari S. Morcos
Joelle Pineau
Pieter Abbeel
Roberto Calandra
SSL
OffRL
89
27
0
07 May 2020
Guided Policy Search Model-based Reinforcement Learning for Urban Autonomous Driving
Zhuo Xu
Jianyu Chen
Masayoshi Tomizuka
30
12
0
06 May 2020
A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical Systems
Anthony Corso
Robert J. Moss
Mark Koren
Ritchie Lee
Mykel J. Kochenderfer
97
176
0
06 May 2020
Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Pierre-Alexandre Kamienny
Matteo Pirotta
A. Lazaric
Thibault Lavril
Nicolas Usunier
Ludovic Denoyer
87
18
0
06 May 2020
Robotic Arm Control and Task Training through Deep Reinforcement Learning
Andrea Franceschetti
E. Tosello
Nicola Castaman
Stefano Ghidoni
56
32
0
06 May 2020
Generalized Planning With Deep Reinforcement Learning
Or Rivlin
Tamir Hazan
E. Karpas
OffRL
55
48
0
05 May 2020
Discrete-to-Deep Supervised Policy Learning
B. Kurniawan
Peter Vamplew
Michael Papasimeon
Richard Dazeley
Cameron Foale
OffRL
16
3
0
05 May 2020
Demand-Side Scheduling Based on Multi-Agent Deep Actor-Critic Learning for Smart Grids
Joash Lee
Wenbo Wang
Dusit Niyato
22
9
0
05 May 2020
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
193
149
0
04 May 2020
Previous
1
2
3
...
43
44
45
...
70
71
72
Next