ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Demystifying Parallel and Distributed Deep Learning: An In-Depth
  Concurrency Analysis
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
87
713
0
26 Feb 2018
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing
  Atari
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari
P. Chrabaszcz
I. Loshchilov
Frank Hutter
83
100
0
24 Feb 2018
The AdobeIndoorNav Dataset: Towards Deep Reinforcement Learning based
  Real-world Indoor Robot Visual Navigation
The AdobeIndoorNav Dataset: Towards Deep Reinforcement Learning based Real-world Indoor Robot Visual Navigation
Kaichun Mo
Haoxiang Li
Zhe Lin
Joon-Young Lee
71
29
0
24 Feb 2018
Reinforcement Learning on Web Interfaces Using Workflow-Guided
  Exploration
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
Emmy Liu
Kelvin Guu
Panupong Pasupat
Tianlin Shi
Percy Liang
OnRL
79
223
0
24 Feb 2018
Fully Decentralized Multi-Agent Reinforcement Learning with Networked
  Agents
Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
Kai Zhang
Zhuoran Yang
Han Liu
Tong Zhang
Tamer Basar
174
593
0
23 Feb 2018
Weighted Double Deep Multiagent Reinforcement Learning in Stochastic
  Cooperative Environments
Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
Yan Zheng
Jianye Hao
Zongzhang Zhang
OffRL
63
38
0
23 Feb 2018
Structured Control Nets for Deep Reinforcement Learning
Structured Control Nets for Deep Reinforcement Learning
Mario Srouji
Jian Zhang
Ruslan Salakhutdinov
81
43
0
22 Feb 2018
Unicorn: Continual Learning with a Universal, Off-policy Agent
Unicorn: Continual Learning with a Universal, Off-policy Agent
D. Mankowitz
Augustin Žídek
André Barreto
Dan Horgan
Matteo Hessel
John Quan
Junhyuk Oh
H. V. Hasselt
David Silver
Tom Schaul
CLLOffRL
70
48
0
22 Feb 2018
Asynchronous stochastic approximations with asymptotically biased errors
  and deep multi-agent learning
Asynchronous stochastic approximations with asymptotically biased errors and deep multi-agent learning
Arunselvan Ramaswamy
S. Bhatnagar
Daniel E. Quevedo
17
2
0
22 Feb 2018
Clipped Action Policy Gradient
Clipped Action Policy Gradient
Yasuhiro Fujita
S. Maeda
OffRL
53
37
0
21 Feb 2018
Learning to Play with Intrinsically-Motivated Self-Aware Agents
Learning to Play with Intrinsically-Motivated Self-Aware Agents
Nick Haber
Damian Mrowca
Li Fei-Fei
Daniel L. K. Yamins
LRM
96
120
0
21 Feb 2018
Global Pose Estimation with an Attention-based Recurrent Network
Global Pose Estimation with an Attention-based Recurrent Network
Emilio Parisotto
Devendra Singh Chaplot
Jian Zhang
Ruslan Salakhutdinov
58
70
0
19 Feb 2018
Accelerated Primal-Dual Policy Optimization for Safe Reinforcement
  Learning
Accelerated Primal-Dual Policy Optimization for Safe Reinforcement Learning
Qingkai Liang
Fanyu Que
E. Modiano
85
102
0
19 Feb 2018
Efficient Collaborative Multi-Agent Deep Reinforcement Learning for
  Large-Scale Fleet Management
Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management
Kaixiang Lin
Renyu Zhao
Zhe Xu
Jiayu Zhou
52
8
0
18 Feb 2018
Sim-to-Real Optimization of Complex Real World Mobile Network with
  Imperfect Information via Deep Reinforcement Learning from Self-play
Sim-to-Real Optimization of Complex Real World Mobile Network with Imperfect Information via Deep Reinforcement Learning from Self-play
Yongxi Tan
Jin Yang
Xin Chen
Qitao Song
Yunjun Chen
Zhangxiang Ye
Zhenqiang Su
39
2
0
18 Feb 2018
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Zhang-Wei Hong
Tzu-Yun Shann
Shih-Yang Su
Yi-Hsiang Chang
Chun-Yi Lee
103
124
0
13 Feb 2018
Efficient Exploration through Bayesian Deep Q-Networks
Efficient Exploration through Bayesian Deep Q-Networks
Kamyar Azizzadenesheli
Anima Anandkumar
OffRLBDL
112
163
0
13 Feb 2018
Efficient Model-Based Deep Reinforcement Learning with Variational State
  Tabulation
Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation
Dane S. Corneil
W. Gerstner
Johanni Brea
OffRL
86
62
0
12 Feb 2018
Reinforcement Learning for Solving the Vehicle Routing Problem
Reinforcement Learning for Solving the Vehicle Routing Problem
M. Nazari
Afshin Oroojlooy
L. Snyder
Martin Takáč
133
911
0
12 Feb 2018
Taking gradients through experiments: LSTMs and memory proximal policy
  optimization for black-box quantum control
Taking gradients through experiments: LSTMs and memory proximal policy optimization for black-box quantum control
Moritz August
José Miguel Hernández-Lobato
73
41
0
12 Feb 2018
Beyond the One Step Greedy Approach in Reinforcement Learning
Beyond the One Step Greedy Approach in Reinforcement Learning
Yonathan Efroni
Gal Dalal
B. Scherrer
Shie Mannor
OffRL
117
51
0
10 Feb 2018
Path Consistency Learning in Tsallis Entropy Regularized MDPs
Path Consistency Learning in Tsallis Entropy Regularized MDPs
Ofir Nachum
Yinlam Chow
Mohammad Ghavamzadeh
72
47
0
10 Feb 2018
Precision medicine as a control problem: Using simulation and deep
  reinforcement learning to discover adaptive, personalized multi-cytokine
  therapy for sepsis
Precision medicine as a control problem: Using simulation and deep reinforcement learning to discover adaptive, personalized multi-cytokine therapy for sepsis
Brenden K. Petersen
Jiachen Yang
Will Grathwohl
Chase Cockrell
Claudio Santiago
G. An
Daniel Faissol
AI4CE
53
26
0
08 Feb 2018
Learning and Querying Fast Generative Models for Reinforcement Learning
Learning and Querying Fast Generative Models for Reinforcement Learning
Lars Buesing
T. Weber
S. Racanière
S. M. Ali Eslami
Danilo Jimenez Rezende
...
Fabio Viola
F. Besse
Karol Gregor
Demis Hassabis
Daan Wierstra
OffRL
82
135
0
08 Feb 2018
A Critical Investigation of Deep Reinforcement Learning for Navigation
A Critical Investigation of Deep Reinforcement Learning for Navigation
Vikas Dhiman
Shurjo Banerjee
Brent A. Griffin
J. Siskind
Jason J. Corso
84
36
0
07 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
276
1,609
0
05 Feb 2018
Multi-task Learning for Continuous Control
Multi-task Learning for Continuous Control
Himani Arora
Rajath Kumar
Jason Krone
Chong Li
76
13
0
03 Feb 2018
Learning Parametric Closed-Loop Policies for Markov Potential Games
Learning Parametric Closed-Loop Policies for Markov Potential Games
Sergio Valcarcel Macua
Javier Zazo
S. Zazo
82
46
0
03 Feb 2018
Virtual-to-Real: Learning to Control in Visual Semantic Segmentation
Virtual-to-Real: Learning to Control in Visual Semantic Segmentation
Zhang-Wei Hong
Yu-Ming Chen
Shih-Yang Su
Tzu-Yun Shann
Yi-Hsiang Chang
...
Yueh-Chuan Chang
Tsu-Ching Hsiao
Hsin-Wei Hsiao
Sih-Pin Lai
Chun-Yi Lee
121
81
0
01 Feb 2018
VR-Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control
VR-Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control
Jingwei Zhang
L. Tai
Peng Yun
Yufeng Xiong
Ming-Yuan Liu
Joschka Boedecker
Wolfram Burgard
104
123
0
01 Feb 2018
Deep Reinforcement Learning for Programming Language Correction
Deep Reinforcement Learning for Programming Language Correction
Rahul Gupta
Aditya Kanade
S. Shevade
HAI
96
34
0
31 Jan 2018
Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With
  Expert Demonstrations
Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations
Xiaoqin Zhang
Huimin Ma
OffRL
117
38
0
31 Jan 2018
Deep Reinforcement Learning using Capsules in Advanced Game Environments
Deep Reinforcement Learning using Capsules in Advanced Game Environments
Per-Arne Andersen
68
16
0
29 Jan 2018
Image2GIF: Generating Cinemagraphs using Recurrent Deep Q-Networks
Image2GIF: Generating Cinemagraphs using Recurrent Deep Q-Networks
Yipin Zhou
Yale Song
Tamara L. Berg
GAN
56
8
0
27 Jan 2018
FlashRL: A Reinforcement Learning Platform for Flash Games
FlashRL: A Reinforcement Learning Platform for Flash Games
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
VLM
43
2
0
26 Jan 2018
Active Neural Localization
Active Neural Localization
Devendra Singh Chaplot
Emilio Parisotto
Ruslan Salakhutdinov
83
86
0
24 Jan 2018
Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Joel Z Leibo
Cyprien de Masson dÁutume
Daniel Zoran
David Amos
Charlie Beattie
...
Simon Green
A. Gruslys
Shane Legg
Demis Hassabis
M. Botvinick
105
78
0
24 Jan 2018
Logically-Constrained Reinforcement Learning
Logically-Constrained Reinforcement Learning
Mohammadhosein Hasanbeig
Alessandro Abate
Daniel Kroening
97
83
0
24 Jan 2018
Learning Symmetric and Low-energy Locomotion
Learning Symmetric and Low-energy Locomotion
Wenhao Yu
Greg Turk
Chenxi Liu
126
186
0
24 Jan 2018
Experience-driven Networking: A Deep Reinforcement Learning based
  Approach
Experience-driven Networking: A Deep Reinforcement Learning based Approach
Zhiyuan Xu
Jian Tang
Jingsong Meng
Weiyi Zhang
Yanzhi Wang
C. Liu
Dejun Yang
OffRL
92
364
0
17 Jan 2018
An Empirical Analysis of Proximal Policy Optimization with
  Kronecker-factored Natural Gradients
An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients
Jiaming Song
Yuhuai Wu
39
2
0
17 Jan 2018
Deep Reinforcement Learning of Cell Movement in the Early Stage of C.
  elegans Embryogenesis
Deep Reinforcement Learning of Cell Movement in the Early Stage of C. elegans Embryogenesis
Zehao Wang
Dali Wang
Chengcheng Li
Yichi Xu
Husheng Li
Z. Bao
57
35
0
14 Jan 2018
Autonomous Driving in Reality with Reinforcement Learning and Image
  Translation
Autonomous Driving in Reality with Reinforcement Learning and Image Translation
N. Xu
Bowen Tan
Bingyu Kong
76
36
0
13 Jan 2018
Model-Based Action Exploration for Learning Dynamic Motion Skills
Model-Based Action Exploration for Learning Dynamic Motion Skills
Glen Berseth
M. van de Panne
57
0
0
11 Jan 2018
Neural Program Synthesis with Priority Queue Training
Neural Program Synthesis with Priority Queue Training
Daniel A. Abolafia
Mohammad Norouzi
Jonathan Shen
Rui Zhao
Quoc V. Le
107
69
0
10 Jan 2018
Expected Policy Gradients for Reinforcement Learning
Expected Policy Gradients for Reinforcement Learning
K. Ciosek
Shimon Whiteson
125
53
0
10 Jan 2018
Distributed Deep Reinforcement Learning: Learn how to play Atari games
  in 21 minutes
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes
Igor Adamski
R. Adamski
T. Grel
Adam Jedrych
Kamil Kaczmarek
Henryk Michalewski
OffRL
121
37
0
09 Jan 2018
DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement
  Learning Systems for Multi-Agent Dense Traffic Navigation
DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation
Lex Fridman
Jack Terwilliger
Benedikt Jenik
87
24
0
09 Jan 2018
Building Generalizable Agents with a Realistic and Rich 3D Environment
Building Generalizable Agents with a Realistic and Rich 3D Environment
Yi Wu
Yuxin Wu
Georgia Gkioxari
Yuandong Tian
3DV
144
339
0
07 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
361
8,474
0
04 Jan 2018
Previous
123...656667...707172
Next