Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Multi-Agent Meta-Reinforcement Learning for Self-Powered and Sustainable Edge Computing Systems
M. S. Munir
N. H. Tran
Walid Saad
Choong Seon Hong
143
21
0
20 Feb 2020
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
Tianpei Yang
Jianye Hao
Zhaopeng Meng
Zongzhang Zhang
Yujing Hu
...
Changjie Fan
Weixun Wang
Wulong Liu
Zhaodong Wang
J. Peng
OffRL
89
12
0
19 Feb 2020
Multi-Issue Bargaining With Deep Reinforcement Learning
Ho-Chun Herbert Chang
42
2
0
18 Feb 2020
MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding
Haolin Zhou
Chaoqi Yang
Xiaofeng Gao
Qiong Chen
Gongshen Liu
Guihai Chen
71
6
0
18 Feb 2020
Symbolic Network: Generalized Neural Policies for Relational MDPs
Sankalp Garg
Aniket Bajpai
Mausam
34
5
0
18 Feb 2020
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking
Shirli Di-Castro Shashua
Shie Mannor
OffRL
76
12
0
17 Feb 2020
Adaptive Experience Selection for Policy Gradient
S. Mohamad
Giovanni Montana
106
0
0
17 Feb 2020
Reinforcement learning for the privacy preservation and manipulation of eye tracking data
Wolfgang Fuhl
Efe Bozkir
Enkelejda Kasneci
60
1
0
17 Feb 2020
First Order Constrained Optimization in Policy Space
Yiming Zhang
Q. Vuong
George Andriopoulos
46
4
0
16 Feb 2020
Deep RL Agent for a Real-Time Action Strategy Game
Michal Warchalski
Dimitrije Radojević
M. Milosevic
18
0
0
15 Feb 2020
Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning
Navid Naderializadeh
J. Sydir
M. Simsek
Hosein Nikopour
79
129
0
14 Feb 2020
Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function
M. Kawanaka
Yuma Koizumi
Ryoichi Miyazaki
Kohei Yatabe
AAML
70
23
0
14 Feb 2020
Hoplite: Efficient and Fault-Tolerant Collective Communication for Task-Based Distributed Systems
Siyuan Zhuang
Zhuohan Li
Danyang Zhuo
Stephanie Wang
Eric Liang
Robert Nishihara
Philipp Moritz
Ion Stoica
40
24
0
13 Feb 2020
Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic
Yangang Ren
Jingliang Duan
Shengbo Eben Li
Yang Guan
Qi Sun
OffRL
60
30
0
13 Feb 2020
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription
Olivier Francon
Santiago Gonzalez
Babak Hodjat
Elliot Meyerson
Risto Miikkulainen
Xin Qiu
Hormoz Shahrzad
80
17
0
13 Feb 2020
Learning to Generate Levels From Nothing
Philip Bontrager
Julian Togelius
GAN
61
22
0
12 Feb 2020
Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Ge Liu
Rui Wu
Heng-Tze Cheng
Jing Wang
Jayden Ooi
Lihong Li
Ang Li
Wai Lok Sibon Li
Craig Boutilier
Ed H. Chi
OffRL
36
4
0
12 Feb 2020
Intrinsic Motivation for Encouraging Synergistic Behavior
Rohan Chitnis
Shubham Tulsiani
Saurabh Gupta
Abhinav Gupta
50
28
0
12 Feb 2020
Regret Bounds for Discounted MDPs
Shuang Liu
H. Su
OffRL
80
19
0
12 Feb 2020
SparseIDS: Learning Packet Sampling with Reinforcement Learning
Maximilian Bachl
Fares Meghdouri
J. Fabini
Tanja Zseby
46
6
0
10 Feb 2020
Discrete Action On-Policy Learning with Action-Value Critic
Yuguang Yue
Yunhao Tang
Mingzhang Yin
Mingyuan Yin
OffRL
78
5
0
10 Feb 2020
Self-Attentive Associative Memory
Hung Le
T. Tran
Svetha Venkatesh
101
56
0
10 Feb 2020
Capsule Network Performance with Autonomous Navigation
Tom Molnar
Eugenio Culurciello
3DPC
25
2
0
08 Feb 2020
Dynamic Energy Dispatch Based on Deep Reinforcement Learning in IoT-Driven Smart Isolated Microgrids
Lei Lei
Yue Tan
Glenn Dahlenburg
W. Xiang
K. Zheng
76
71
0
07 Feb 2020
Social diversity and social preferences in mixed-motive reinforcement learning
Kevin R. McKee
I. Gemp
Brian McWilliams
Edgar A. Duénez-Guzmán
Edward Hughes
Joel Z Leibo
97
85
0
06 Feb 2020
Attractive or Faithful? Popularity-Reinforced Learning for Inspired Headline Generation
Yun-Zhu Song
Hong-Han Shuai
Sung-Lin Yeh
Yi-Lun Wu
Lun-Wei Ku
Chao-Han Huck Yang
81
21
0
06 Feb 2020
Temporal-adaptive Hierarchical Reinforcement Learning
Wen-Ji Zhou
Yang Yu
55
3
0
06 Feb 2020
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making
C. Shi
Runzhe Wan
R. Song
Wenbin Lu
Ling Leng
82
39
0
05 Feb 2020
Compositional Languages Emerge in a Neural Iterated Learning Model
Yi Ren
Shangmin Guo
Matthieu Labeau
Shay B. Cohen
S. Kirby
164
98
0
04 Feb 2020
Learning rewards for robotic ultrasound scanning using probabilistic temporal ranking
Michael G. Burke
Katie Lu
Daniel Angelov
Artūras Straižys
Craig Innes
Kartic Subr
S. Ramamoorthy
58
11
0
04 Feb 2020
Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation
Siqi Yang
Lin Wu
Arnold Wiliem
Brian C. Lovell
ObjD
60
19
0
03 Feb 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
367
1,710
0
02 Feb 2020
Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV based Random Access IoT Networks with NOMA
Sami Khairy
Prasanna Balaprakash
L. Cai
Y. Cheng
31
73
0
31 Jan 2020
Locally Private Distributed Reinforcement Learning
Hajime Ono
Tsubasa Takahashi
OffRL
69
23
0
31 Jan 2020
Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Peter Henderson
Jie Hu
Joshua Romoff
Emma Brunskill
Dan Jurafsky
Joelle Pineau
118
459
0
31 Jan 2020
Preventing Imitation Learning with Adversarial Policy Ensembles
Albert Zhan
Stas Tiomkin
Pieter Abbeel
40
3
0
31 Jan 2020
Using Fractal Neural Networks to Play SimCity 1 and Conway's Game of Life at Variable Scales
Sam Earle
AI4CE
76
18
0
29 Jan 2020
MEMO: A Deep Network for Flexible Combination of Episodic Memories
Andrea Banino
Adria Puigdomenech Badia
Raphael Köster
Martin Chadwick
V. Zambaldi
Demis Hassabis
Caswell Barry
M. Botvinick
D. Kumaran
Charles Blundell
KELM
87
35
0
29 Jan 2020
Variational Autoencoders for Opponent Modeling in Multi-Agent Systems
Georgios Papoudakis
Stefano V. Albrecht
BDL
DRL
64
29
0
29 Jan 2020
Robust Multimodal Image Registration Using Deep Recurrent Reinforcement Learning
Shanhui Sun
Jing Hu
Mingqing Yao
Jinrong Hu
Xiaodong Yang
Qi Song
Xi Wu
77
24
0
29 Jan 2020
Towards Learning Multi-agent Negotiations via Self-Play
Yichuan Tang
77
33
0
28 Jan 2020
Rotation, Translation, and Cropping for Zero-Shot Generalization
Chang Ye
Ahmed Khalifa
Philip Bontrager
Julian Togelius
104
38
0
27 Jan 2020
Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning
Inaam Ilahi
Muhammad Usama
Junaid Qadir
M. Janjua
Ala I. Al-Fuqaha
D. Hoang
Dusit Niyato
AAML
147
137
0
27 Jan 2020
PCGRL: Procedural Content Generation via Reinforcement Learning
Ahmed Khalifa
Philip Bontrager
Sam Earle
Julian Togelius
80
146
0
24 Jan 2020
EgoMap: Projective mapping and structured egocentric memory for Deep RL
E. Beeching
Christian Wolf
J. Dibangoye
Olivier Simonin
EgoV
89
27
0
24 Jan 2020
Graph Constrained Reinforcement Learning for Natural Language Action Spaces
Prithviraj Ammanabrolu
Matthew J. Hausknecht
AI4CE
LLMAG
111
129
0
23 Jan 2020
Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
Jianyu Chen
Shengbo Eben Li
Masayoshi Tomizuka
155
246
0
23 Jan 2020
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
78
60
0
22 Jan 2020
On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning
Ameya Pore
G. Aragon-Camarasa
61
11
0
22 Jan 2020
Reinforcement Learning Based Vehicle-cell Association Algorithm for Highly Mobile Millimeter Wave Communication
Hamza Khan
Anis Elgabli
S. Samarakoon
M. Bennis
Choong Seon Hong
45
33
0
22 Jan 2020
Previous
1
2
3
...
46
47
48
...
70
71
72
Next