ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Multi-Agent Meta-Reinforcement Learning for Self-Powered and Sustainable
  Edge Computing Systems
Multi-Agent Meta-Reinforcement Learning for Self-Powered and Sustainable Edge Computing Systems
M. S. Munir
N. H. Tran
Walid Saad
Choong Seon Hong
143
21
0
20 Feb 2020
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
Tianpei Yang
Jianye Hao
Zhaopeng Meng
Zongzhang Zhang
Yujing Hu
...
Changjie Fan
Weixun Wang
Wulong Liu
Zhaodong Wang
J. Peng
OffRL
89
12
0
19 Feb 2020
Multi-Issue Bargaining With Deep Reinforcement Learning
Multi-Issue Bargaining With Deep Reinforcement Learning
Ho-Chun Herbert Chang
42
2
0
18 Feb 2020
MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding
MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding
Haolin Zhou
Chaoqi Yang
Xiaofeng Gao
Qiong Chen
Gongshen Liu
Guihai Chen
71
6
0
18 Feb 2020
Symbolic Network: Generalized Neural Policies for Relational MDPs
Symbolic Network: Generalized Neural Policies for Relational MDPs
Sankalp Garg
Aniket Bajpai
Mausam
34
5
0
18 Feb 2020
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking
Shirli Di-Castro Shashua
Shie Mannor
OffRL
76
12
0
17 Feb 2020
Adaptive Experience Selection for Policy Gradient
Adaptive Experience Selection for Policy Gradient
S. Mohamad
Giovanni Montana
106
0
0
17 Feb 2020
Reinforcement learning for the privacy preservation and manipulation of
  eye tracking data
Reinforcement learning for the privacy preservation and manipulation of eye tracking data
Wolfgang Fuhl
Efe Bozkir
Enkelejda Kasneci
60
1
0
17 Feb 2020
First Order Constrained Optimization in Policy Space
First Order Constrained Optimization in Policy Space
Yiming Zhang
Q. Vuong
George Andriopoulos
46
4
0
16 Feb 2020
Deep RL Agent for a Real-Time Action Strategy Game
Deep RL Agent for a Real-Time Action Strategy Game
Michal Warchalski
Dimitrije Radojević
M. Milosevic
18
0
0
15 Feb 2020
Resource Management in Wireless Networks via Multi-Agent Deep
  Reinforcement Learning
Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning
Navid Naderializadeh
J. Sydir
M. Simsek
Hosein Nikopour
79
129
0
14 Feb 2020
Stable Training of DNN for Speech Enhancement based on
  Perceptually-Motivated Black-Box Cost Function
Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function
M. Kawanaka
Yuma Koizumi
Ryoichi Miyazaki
Kohei Yatabe
AAML
70
23
0
14 Feb 2020
Hoplite: Efficient and Fault-Tolerant Collective Communication for
  Task-Based Distributed Systems
Hoplite: Efficient and Fault-Tolerant Collective Communication for Task-Based Distributed Systems
Siyuan Zhuang
Zhuohan Li
Danyang Zhuo
Stephanie Wang
Eric Liang
Robert Nishihara
Philipp Moritz
Ion Stoica
40
24
0
13 Feb 2020
Improving Generalization of Reinforcement Learning with Minimax
  Distributional Soft Actor-Critic
Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic
Yangang Ren
Jingliang Duan
Shengbo Eben Li
Yang Guan
Qi Sun
OffRL
60
30
0
13 Feb 2020
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted
  Prescription
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription
Olivier Francon
Santiago Gonzalez
Babak Hodjat
Elliot Meyerson
Risto Miikkulainen
Xin Qiu
Hormoz Shahrzad
80
17
0
13 Feb 2020
Learning to Generate Levels From Nothing
Learning to Generate Levels From Nothing
Philip Bontrager
Julian Togelius
GAN
61
22
0
12 Feb 2020
Data Efficient Training for Reinforcement Learning with Adaptive
  Behavior Policy Sharing
Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Ge Liu
Rui Wu
Heng-Tze Cheng
Jing Wang
Jayden Ooi
Lihong Li
Ang Li
Wai Lok Sibon Li
Craig Boutilier
Ed H. Chi
OffRL
36
4
0
12 Feb 2020
Intrinsic Motivation for Encouraging Synergistic Behavior
Intrinsic Motivation for Encouraging Synergistic Behavior
Rohan Chitnis
Shubham Tulsiani
Saurabh Gupta
Abhinav Gupta
50
28
0
12 Feb 2020
Regret Bounds for Discounted MDPs
Regret Bounds for Discounted MDPs
Shuang Liu
H. Su
OffRL
80
19
0
12 Feb 2020
SparseIDS: Learning Packet Sampling with Reinforcement Learning
SparseIDS: Learning Packet Sampling with Reinforcement Learning
Maximilian Bachl
Fares Meghdouri
J. Fabini
Tanja Zseby
46
6
0
10 Feb 2020
Discrete Action On-Policy Learning with Action-Value Critic
Discrete Action On-Policy Learning with Action-Value Critic
Yuguang Yue
Yunhao Tang
Mingzhang Yin
Mingyuan Yin
OffRL
78
5
0
10 Feb 2020
Self-Attentive Associative Memory
Self-Attentive Associative Memory
Hung Le
T. Tran
Svetha Venkatesh
101
56
0
10 Feb 2020
Capsule Network Performance with Autonomous Navigation
Capsule Network Performance with Autonomous Navigation
Tom Molnar
Eugenio Culurciello
3DPC
25
2
0
08 Feb 2020
Dynamic Energy Dispatch Based on Deep Reinforcement Learning in
  IoT-Driven Smart Isolated Microgrids
Dynamic Energy Dispatch Based on Deep Reinforcement Learning in IoT-Driven Smart Isolated Microgrids
Lei Lei
Yue Tan
Glenn Dahlenburg
W. Xiang
K. Zheng
76
71
0
07 Feb 2020
Social diversity and social preferences in mixed-motive reinforcement
  learning
Social diversity and social preferences in mixed-motive reinforcement learning
Kevin R. McKee
I. Gemp
Brian McWilliams
Edgar A. Duénez-Guzmán
Edward Hughes
Joel Z Leibo
97
85
0
06 Feb 2020
Attractive or Faithful? Popularity-Reinforced Learning for Inspired
  Headline Generation
Attractive or Faithful? Popularity-Reinforced Learning for Inspired Headline Generation
Yun-Zhu Song
Hong-Han Shuai
Sung-Lin Yeh
Yi-Lun Wu
Lun-Wei Ku
Chao-Han Huck Yang
81
21
0
06 Feb 2020
Temporal-adaptive Hierarchical Reinforcement Learning
Temporal-adaptive Hierarchical Reinforcement Learning
Wen-Ji Zhou
Yang Yu
55
3
0
06 Feb 2020
Does the Markov Decision Process Fit the Data: Testing for the Markov
  Property in Sequential Decision Making
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making
C. Shi
Runzhe Wan
R. Song
Wenbin Lu
Ling Leng
82
39
0
05 Feb 2020
Compositional Languages Emerge in a Neural Iterated Learning Model
Compositional Languages Emerge in a Neural Iterated Learning Model
Yi Ren
Shangmin Guo
Matthieu Labeau
Shay B. Cohen
S. Kirby
164
98
0
04 Feb 2020
Learning rewards for robotic ultrasound scanning using probabilistic
  temporal ranking
Learning rewards for robotic ultrasound scanning using probabilistic temporal ranking
Michael G. Burke
Katie Lu
Daniel Angelov
Artūras Straižys
Craig Innes
Kartic Subr
S. Ramamoorthy
58
11
0
04 Feb 2020
Unsupervised Domain Adaptive Object Detection using Forward-Backward
  Cyclic Adaptation
Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation
Siqi Yang
Lin Wu
Arnold Wiliem
Brian C. Lovell
ObjD
60
19
0
03 Feb 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
367
1,710
0
02 Feb 2020
Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV
  based Random Access IoT Networks with NOMA
Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV based Random Access IoT Networks with NOMA
Sami Khairy
Prasanna Balaprakash
L. Cai
Y. Cheng
31
73
0
31 Jan 2020
Locally Private Distributed Reinforcement Learning
Locally Private Distributed Reinforcement Learning
Hajime Ono
Tsubasa Takahashi
OffRL
69
23
0
31 Jan 2020
Towards the Systematic Reporting of the Energy and Carbon Footprints of
  Machine Learning
Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Peter Henderson
Jie Hu
Joshua Romoff
Emma Brunskill
Dan Jurafsky
Joelle Pineau
118
459
0
31 Jan 2020
Preventing Imitation Learning with Adversarial Policy Ensembles
Preventing Imitation Learning with Adversarial Policy Ensembles
Albert Zhan
Stas Tiomkin
Pieter Abbeel
40
3
0
31 Jan 2020
Using Fractal Neural Networks to Play SimCity 1 and Conway's Game of
  Life at Variable Scales
Using Fractal Neural Networks to Play SimCity 1 and Conway's Game of Life at Variable Scales
Sam Earle
AI4CE
76
18
0
29 Jan 2020
MEMO: A Deep Network for Flexible Combination of Episodic Memories
MEMO: A Deep Network for Flexible Combination of Episodic Memories
Andrea Banino
Adria Puigdomenech Badia
Raphael Köster
Martin Chadwick
V. Zambaldi
Demis Hassabis
Caswell Barry
M. Botvinick
D. Kumaran
Charles Blundell
KELM
87
35
0
29 Jan 2020
Variational Autoencoders for Opponent Modeling in Multi-Agent Systems
Variational Autoencoders for Opponent Modeling in Multi-Agent Systems
Georgios Papoudakis
Stefano V. Albrecht
BDLDRL
64
29
0
29 Jan 2020
Robust Multimodal Image Registration Using Deep Recurrent Reinforcement
  Learning
Robust Multimodal Image Registration Using Deep Recurrent Reinforcement Learning
Shanhui Sun
Jing Hu
Mingqing Yao
Jinrong Hu
Xiaodong Yang
Qi Song
Xi Wu
77
24
0
29 Jan 2020
Towards Learning Multi-agent Negotiations via Self-Play
Towards Learning Multi-agent Negotiations via Self-Play
Yichuan Tang
77
33
0
28 Jan 2020
Rotation, Translation, and Cropping for Zero-Shot Generalization
Rotation, Translation, and Cropping for Zero-Shot Generalization
Chang Ye
Ahmed Khalifa
Philip Bontrager
Julian Togelius
104
38
0
27 Jan 2020
Challenges and Countermeasures for Adversarial Attacks on Deep
  Reinforcement Learning
Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning
Inaam Ilahi
Muhammad Usama
Junaid Qadir
M. Janjua
Ala I. Al-Fuqaha
D. Hoang
Dusit Niyato
AAML
147
137
0
27 Jan 2020
PCGRL: Procedural Content Generation via Reinforcement Learning
PCGRL: Procedural Content Generation via Reinforcement Learning
Ahmed Khalifa
Philip Bontrager
Sam Earle
Julian Togelius
80
146
0
24 Jan 2020
EgoMap: Projective mapping and structured egocentric memory for Deep RL
EgoMap: Projective mapping and structured egocentric memory for Deep RL
E. Beeching
Christian Wolf
J. Dibangoye
Olivier Simonin
EgoV
89
27
0
24 Jan 2020
Graph Constrained Reinforcement Learning for Natural Language Action
  Spaces
Graph Constrained Reinforcement Learning for Natural Language Action Spaces
Prithviraj Ammanabrolu
Matthew J. Hausknecht
AI4CELLMAG
111
129
0
23 Jan 2020
Interpretable End-to-end Urban Autonomous Driving with Latent Deep
  Reinforcement Learning
Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
Jianyu Chen
Shengbo Eben Li
Masayoshi Tomizuka
155
246
0
23 Jan 2020
Q-Learning in enormous action spaces via amortized approximate
  maximization
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
78
60
0
22 Jan 2020
On Simple Reactive Neural Networks for Behaviour-Based Reinforcement
  Learning
On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning
Ameya Pore
G. Aragon-Camarasa
61
11
0
22 Jan 2020
Reinforcement Learning Based Vehicle-cell Association Algorithm for
  Highly Mobile Millimeter Wave Communication
Reinforcement Learning Based Vehicle-cell Association Algorithm for Highly Mobile Millimeter Wave Communication
Hamza Khan
Anis Elgabli
S. Samarakoon
M. Bennis
Choong Seon Hong
45
33
0
22 Jan 2020
Previous
123...464748...707172
Next