ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Multi-Agent Deep Reinforcement Learning for Liquidation Strategy
  Analysis
Multi-Agent Deep Reinforcement Learning for Liquidation Strategy Analysis
Wenhang Bao
Xiao-Yang Liu
AIFin
87
43
0
24 Jun 2019
Planning Robot Motion using Deep Visual Prediction
Planning Robot Motion using Deep Visual Prediction
Meenakshi Sarkar
Prabhu Pradhan
Debasish Ghose
67
6
0
24 Jun 2019
Optimal Use of Experience in First Person Shooter Environments
Optimal Use of Experience in First Person Shooter Environments
Matthew Aitchison
37
2
0
24 Jun 2019
Ranking Policy Gradient
Ranking Policy Gradient
Kaixiang Lin
Jiayu Zhou
OffRL
67
7
0
24 Jun 2019
Learning Belief Representations for Imitation Learning in POMDPs
Learning Belief Representations for Imitation Learning in POMDPs
Tanmay Gangwani
Joel Lehman
Qiang Liu
Jian Peng
59
37
0
22 Jun 2019
Reinforcement Learning with Convex Constraints
Reinforcement Learning with Convex Constraints
Sobhan Miryoosefi
Kianté Brantley
Hal Daumé
Miroslav Dudík
Robert Schapire
61
93
0
21 Jun 2019
Continual Reinforcement Learning with Diversity Exploration and
  Adversarial Self-Correction
Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction
Fengda Zhu
Xiaojun Chang
Runhao Zeng
Mingkui Tan
CLL
52
3
0
21 Jun 2019
Variable Impedance Control in End-Effector Space: An Action Space for
  Reinforcement Learning in Contact-Rich Tasks
Variable Impedance Control in End-Effector Space: An Action Space for Reinforcement Learning in Contact-Rich Tasks
Roberto Martín-Martín
Michelle A. Lee
Rachel Gardner
Silvio Savarese
Jeannette Bohg
Animesh Garg
86
198
0
20 Jun 2019
Finding Needles in a Moving Haystack: Prioritizing Alerts with
  Adversarial Reinforcement Learning
Finding Needles in a Moving Haystack: Prioritizing Alerts with Adversarial Reinforcement Learning
Liang Tong
Aron Laszka
Chao Yan
Ning Zhang
Yevgeniy Vorobeychik
AAML
54
18
0
20 Jun 2019
Global Convergence of Policy Gradient Methods to (Almost) Locally
  Optimal Policies
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies
Kai Zhang
Alec Koppel
Haoqi Zhu
Tamer Basar
114
191
0
19 Jun 2019
Reward Prediction Error as an Exploration Objective in Deep RL
Reward Prediction Error as an Exploration Objective in Deep RL
Riley Simmons-Edler
Ben Eisner
Daniel Yang
Anthony Bisulco
E. Mitchell
Sebastian Seung
Daniel D. Lee
67
5
0
19 Jun 2019
Directed Exploration for Reinforcement Learning
Directed Exploration for Reinforcement Learning
Z. Guo
Emma Brunskill
71
12
0
18 Jun 2019
Towards White-box Benchmarks for Algorithm Control
Towards White-box Benchmarks for Algorithm Control
André Biedenkapp
H. Bozkurt
Frank Hutter
Marius Lindauer
179
7
0
18 Jun 2019
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant
  Reinforcement Learning
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning
Tadashi Kozuno
Dongqi Han
Kenji Doya
OffRL
50
2
0
18 Jun 2019
NeoNav: Improving the Generalization of Visual Navigation via Generating
  Next Expected Observations
NeoNav: Improving the Generalization of Visual Navigation via Generating Next Expected Observations
Qiaoyun Wu
Tianyi Zhou
Jun Wang
Kai Xu
162
15
0
17 Jun 2019
Is the Policy Gradient a Gradient?
Is the Policy Gradient a Gradient?
Chris Nota
Philip S. Thomas
94
58
0
17 Jun 2019
A Survey of Optimization Methods from a Machine Learning Perspective
A Survey of Optimization Methods from a Machine Learning Perspective
Shiliang Sun
Zehui Cao
Han Zhu
Jing Zhao
90
566
0
17 Jun 2019
Injecting Prior Knowledge for Transfer Learning into Reinforcement
  Learning Algorithms using Logic Tensor Networks
Injecting Prior Knowledge for Transfer Learning into Reinforcement Learning Algorithms using Logic Tensor Networks
Samy Badreddine
Michael Spranger
OffRL
51
15
0
15 Jun 2019
Epistemic Risk-Sensitive Reinforcement Learning
Epistemic Risk-Sensitive Reinforcement Learning
Hannes Eriksson
Christos Dimitrakakis
127
29
0
14 Jun 2019
Multigrid Neural Memory
Multigrid Neural Memory
T. Huynh
Michael Maire
Matthew R. Walter
64
10
0
13 Jun 2019
Deep Reinforcement Learning for Cyber Security
Deep Reinforcement Learning for Cyber Security
Thanh Thi Nguyen
Vijay Janapa Reddi
OffRLAI4CE
117
335
0
13 Jun 2019
Imitation Learning of Neural Spatio-Temporal Point Processes
Imitation Learning of Neural Spatio-Temporal Point Processes
Shixiang Zhu
Shuang Li
Zhigang Peng
Yao Xie
3DPCAI4TS
46
6
0
13 Jun 2019
Conditioning of Reinforcement Learning Agents and its Policy
  Regularization Application
Conditioning of Reinforcement Learning Agents and its Policy Regularization Application
Arip Asadulaev
Igor Kuznetsov
Gideon Stein
Andrey Filchenkov
16
0
0
13 Jun 2019
Neural Graph Evolution: Towards Efficient Automatic Robot Design
Neural Graph Evolution: Towards Efficient Automatic Robot Design
Tingwu Wang
Yuhao Zhou
Sanja Fidler
Jimmy Ba
65
63
0
12 Jun 2019
Active Learning of Dynamics for Data-Driven Control Using Koopman
  Operators
Active Learning of Dynamics for Data-Driven Control Using Koopman Operators
Ian Abraham
Todd Murphey
71
166
0
12 Jun 2019
Reinforcement Learning for Integer Programming: Learning to Cut
Reinforcement Learning for Integer Programming: Learning to Cut
Yunhao Tang
Shipra Agrawal
Yuri Faenza
AI4CE
111
174
0
11 Jun 2019
A Hybrid Approach Between Adversarial Generative Networks and
  Actor-Critic Policy Gradient for Low Rate High-Resolution Image Compression
A Hybrid Approach Between Adversarial Generative Networks and Actor-Critic Policy Gradient for Low Rate High-Resolution Image Compression
N. Savioli
GAN
23
2
0
11 Jun 2019
Causal Discovery with Reinforcement Learning
Causal Discovery with Reinforcement Learning
Shengyu Zhu
Ignavier Ng
Zhitang Chen
CML
114
242
0
11 Jun 2019
Learning Powerful Policies by Using Consistent Dynamics Model
Learning Powerful Policies by Using Consistent Dynamics Model
Shagun Sodhani
Anirudh Goyal
T. Deleu
Yoshua Bengio
Sergey Levine
Jian Tang
OffRL
45
5
0
11 Jun 2019
Write, Execute, Assess: Program Synthesis with a REPL
Write, Execute, Assess: Program Synthesis with a REPL
Kevin Ellis
Maxwell Nye
Yewen Pu
Felix Sosa
J. Tenenbaum
Armando Solar-Lezama
109
169
0
09 Jun 2019
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Mahmoud Assran
Joshua Romoff
Nicolas Ballas
Joelle Pineau
Michael G. Rabbat
64
33
0
09 Jun 2019
Transfer Learning by Modeling a Distribution over Policies
Transfer Learning by Modeling a Distribution over Policies
Disha Shrivastava
Eeshan Gunesh Dhekane
Riashat Islam
OODOffRL
27
0
0
09 Jun 2019
Planning With Uncertain Specifications (PUnS)
Planning With Uncertain Specifications (PUnS)
Ankit J. Shah
Shen Li
J. Shah
87
25
0
07 Jun 2019
Clustered Reinforcement Learning
Clustered Reinforcement Learning
Xiao Ma
Shen-Yi Zhao
Wu-Jun Li
OffRL
64
6
0
06 Jun 2019
Ease-of-Teaching and Language Structure from Emergent Communication
Ease-of-Teaching and Language Structure from Emergent Communication
Fushan Li
Michael Bowling
193
102
0
06 Jun 2019
Deep Reinforcement Learning for Multi-objective Optimization
Deep Reinforcement Learning for Multi-objective Optimization
Kaiwen Li
Tao Zhang
Rui Wang
AI4CE
88
273
0
06 Jun 2019
How to Initialize your Network? Robust Initialization for WeightNorm &
  ResNets
How to Initialize your Network? Robust Initialization for WeightNorm & ResNets
Devansh Arpit
Victor Campos
Yoshua Bengio
83
59
0
05 Jun 2019
Global Optimality Guarantees For Policy Gradient Methods
Global Optimality Guarantees For Policy Gradient Methods
Jalaj Bhandari
Daniel Russo
115
193
0
05 Jun 2019
Reinforcement Learning with Low-Complexity Liquid State Machines
Reinforcement Learning with Low-Complexity Liquid State Machines
Wachirawit Ponghiran
G. Srinivasan
Kaushik Roy
52
14
0
04 Jun 2019
Load Balancing for Ultra-Dense Networks: A Deep Reinforcement Learning
  Based Approach
Load Balancing for Ultra-Dense Networks: A Deep Reinforcement Learning Based Approach
Yue Xu
Wenjun Xu
Zhi Wang
Jiaru Lin
Shuguang Cui
41
70
0
03 Jun 2019
Neural Replicator Dynamics
Neural Replicator Dynamics
Daniel Hennes
Dustin Morrill
Shayegan Omidshafiei
Rémi Munos
Julien Perolat
...
A. Gruslys
Jean-Baptiste Lespiau
Paavo Parmas
Edgar A. Duénez-Guzmán
K. Tuyls
74
16
0
01 Jun 2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum
  Linear Quadratic Games
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
Kai Zhang
Zhuoran Yang
Tamer Basar
111
128
0
31 May 2019
Entropy Minimization In Emergent Languages
Entropy Minimization In Emergent Languages
Eugene Kharitonov
Rahma Chaabouni
Diane Bouchacourt
Marco Baroni
101
3
0
31 May 2019
Interval timing in deep reinforcement learning agents
Interval timing in deep reinforcement learning agents
B. Deverett
Ryan Faulkner
Meire Fortunato
Greg Wayne
Joel Z Leibo
47
14
0
31 May 2019
Sequence Modeling of Temporal Credit Assignment for Episodic
  Reinforcement Learning
Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning
Yang Liu
Yunan Luo
Yuanyi Zhong
Xi Chen
Qiang Liu
Jian-wei Peng
74
36
0
31 May 2019
Combating the Compounding-Error Problem with a Multi-step Model
Combating the Compounding-Error Problem with a Multi-step Model
Kavosh Asadi
Dipendra Kumar Misra
Seungchan Kim
Michel L. Littman
LRM
83
55
0
30 May 2019
Effective Medical Test Suggestions Using Deep Reinforcement Learning
Effective Medical Test Suggestions Using Deep Reinforcement Learning
Yang Chen
Kai-Fu Tang
Yu-Shao Peng
Edward Y. Chang
OffRLOOD
13
6
0
30 May 2019
Don't Forget Your Teacher: A Corrective Reinforcement Learning Framework
Don't Forget Your Teacher: A Corrective Reinforcement Learning Framework
M. Nazari
Majid Jahani
L. Snyder
Martin Takáč
OffRLOnRL
25
1
0
30 May 2019
Reinforcement Learning and Adaptive Sampling for Optimized DNN
  Compilation
Reinforcement Learning and Adaptive Sampling for Optimized DNN Compilation
Byung Hoon Ahn
Prannoy Pilligundla
H. Esmaeilzadeh
71
20
0
30 May 2019
On the Generalization Gap in Reparameterizable Reinforcement Learning
On the Generalization Gap in Reparameterizable Reinforcement Learning
Huan Wang
Stephan Zheng
Caiming Xiong
R. Socher
117
41
0
29 May 2019
Previous
123...535455...707172
Next