ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
GymD2D: A Device-to-Device Underlay Cellular Offload Evaluation Platform
GymD2D: A Device-to-Device Underlay Cellular Offload Evaluation Platform
David Cotton
Z. Chaczko
22
2
0
27 Jan 2021
Reinforcement Learning for Selective Key Applications in Power Systems:
  Recent Advances and Future Challenges
Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future Challenges
Xin Chen
Guannan Qu
Yujie Tang
S. Low
Na Li
88
241
0
27 Jan 2021
The MineRL 2020 Competition on Sample Efficient Reinforcement Learning
  using Human Priors
The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors
William H. Guss
Cayden R. Codel
Katja Hofmann
Brandon Houghton
Noburu Kuno
...
John Schulman
Manuela Veloso
Nicholay Topin
Avinash Ummadisingu
Phillip Wang
OffRL
90
65
0
26 Jan 2021
Finite Sample Analysis of Two-Time-Scale Natural Actor-Critic Algorithm
Finite Sample Analysis of Two-Time-Scale Natural Actor-Critic Algorithm
S. Khodadadian
Thinh T. Doan
Justin Romberg
S. T. Maguluri
101
43
0
26 Jan 2021
Accumulating Risk Capital Through Investing in Cooperation
Accumulating Risk Capital Through Investing in Cooperation
Charlotte Roman
Michael Dennis
Andrew Critch
Stuart J. Russell
34
3
0
25 Jan 2021
A Survey on Active Deep Learning: From Model-driven to Data-driven
Peng Liu
Lizhe Wang
Guojin He
Lei Zhao
85
14
0
25 Jan 2021
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning
Juhyoung Lee
Sangyeob Kim
Sangjin Kim
Wooyoung Jo
H. Yoo
OffRL
63
10
0
24 Jan 2021
Episodic memory governs choices: An RNN-based reinforcement learning
  model for decision-making task
Episodic memory governs choices: An RNN-based reinforcement learning model for decision-making task
Xiaohan Zhang
Lu Liu
Guodong Long
Jing Jiang
Shenquan Liu
90
18
0
24 Jan 2021
Multi-intersection Traffic Optimisation: A Benchmark Dataset and a
  Strong Baseline
Multi-intersection Traffic Optimisation: A Benchmark Dataset and a Strong Baseline
Hu Wang
Hao Chen
Qi Wu
Congbo Ma
Yidong Li
Chunhua Shen
77
13
0
24 Jan 2021
Theory of Mind for Deep Reinforcement Learning in Hanabi
Theory of Mind for Deep Reinforcement Learning in Hanabi
Andrew Fuchs
Michael Walton
Theresa Chadwick
Doug Lange
80
11
0
22 Jan 2021
Prior Preference Learning from Experts:Designing a Reward with Active
  Inference
Prior Preference Learning from Experts:Designing a Reward with Active Inference
Jinyoung Shin
Cheolhyeong Kim
H. Hwang
102
9
0
22 Jan 2021
Shielding Atari Games with Bounded Prescience
Shielding Atari Games with Bounded Prescience
Mirco Giacobbe
Mohammadhosein Hasanbeig
Daniel Kroening
H. Wijk
74
23
0
20 Jan 2021
Rank the Episodes: A Simple Approach for Exploration in
  Procedurally-Generated Environments
Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments
Daochen Zha
Wenye Ma
Lei Yuan
Helen Zhou
Ji Liu
140
44
0
20 Jan 2021
Deep Reinforcement Learning for Producing Furniture Layout in Indoor
  Scenes
Deep Reinforcement Learning for Producing Furniture Layout in Indoor Scenes
Xinhan Di
Pengqian Yu
3DVAI4CE
43
4
0
19 Jan 2021
Stable deep reinforcement learning method by predicting uncertainty in
  rewards as a subtask
Stable deep reinforcement learning method by predicting uncertainty in rewards as a subtask
Kanata Suzuki
T. Ogata
96
2
0
18 Jan 2021
Solving Common-Payoff Games with Approximate Policy Iteration
Solving Common-Payoff Games with Approximate Policy Iteration
Samuel Sokota
Edward Lockhart
Finbarr Timbers
Elnaz Davoodi
Ryan DÓrazio
Neil Burch
Martin Schmid
Michael Bowling
Marc Lanctot
97
22
0
11 Jan 2021
Closing the Planning-Learning Loop with Application to Autonomous
  Driving
Closing the Planning-Learning Loop with Application to Autonomous Driving
Panpan Cai
David Hsu
81
14
0
11 Jan 2021
Deep Reinforcement Learning with Function Properties in Mean Reversion
  Strategies
Deep Reinforcement Learning with Function Properties in Mean Reversion Strategies
Sophia Gu
AIFin
13
3
0
09 Jan 2021
Coding for Distributed Multi-Agent Reinforcement Learning
Coding for Distributed Multi-Agent Reinforcement Learning
Baoqian Wang
Junfei Xie
Nikolay Atanasov
78
4
0
07 Jan 2021
Geometric Entropic Exploration
Geometric Entropic Exploration
Z. Guo
M. G. Azar
Alaa Saade
S. Thakoor
Bilal Piot
Bernardo Avila-Pires
Michal Valko
Thomas Mesnard
Tor Lattimore
Rémi Munos
103
32
0
06 Jan 2021
A Survey of Deep RL and IL for Autonomous Driving Policy Learning
A Survey of Deep RL and IL for Autonomous Driving Policy Learning
Zeyu Zhu
Huijing Zhao
149
160
0
06 Jan 2021
Reinforcement Learning with Latent Flow
Reinforcement Learning with Latent Flow
Wenling Shang
Xiaofei Wang
A. Srinivas
Aravind Rajeswaran
Yang Gao
Pieter Abbeel
Michael Laskin
OffRL
80
23
0
06 Jan 2021
Reinforcement Learning based Collective Entity Alignment with Adaptive
  Features
Reinforcement Learning based Collective Entity Alignment with Adaptive Features
Weixin Zeng
Xiang Zhao
Jiuyang Tang
Xuemin Lin
Paul Groth
88
55
0
05 Jan 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
131
75
0
01 Jan 2021
Faults in Deep Reinforcement Learning Programs: A Taxonomy and A
  Detection Approach
Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection Approach
Amin Nikanjam
Mohammad Mehdi Morovati
Foutse Khomh
Houssem Ben Braiek
113
33
0
01 Jan 2021
Autonomous Maintenance in IoT Networks via AoI-driven Deep Reinforcement
  Learning
Autonomous Maintenance in IoT Networks via AoI-driven Deep Reinforcement Learning
G. Stamatakis
Nikolaos Pappas
Alexandros G. Fragkiadakis
A. Traganitis
31
12
0
31 Dec 2020
Towards Understanding Asynchronous Advantage Actor-critic: Convergence
  and Linear Speedup
Towards Understanding Asynchronous Advantage Actor-critic: Convergence and Linear Speedup
Han Shen
Jianchao Tan
Min-Fong Hong
Tianyi Chen
78
30
0
31 Dec 2020
Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle
  Coordination by Multi-Critic Policy Gradient Optimization
Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle Coordination by Multi-Critic Policy Gradient Optimization
Yoav Alon
Huiyu Zhou
107
10
0
31 Dec 2020
Understanding Decoupled and Early Weight Decay
Understanding Decoupled and Early Weight Decay
Johan Bjorck
Kilian Q. Weinberger
Carla P. Gomes
66
25
0
27 Dec 2020
Generation of Traffic Flows in Multi-Agent Traffic Simulation with Agent Behavior Model based on Deep Reinforcement Learning
Junjie Zhong
Hiromitsu Hattori
AI4CE
26
0
0
26 Dec 2020
Learning Vehicle Routing Problems using Policy Optimisation
Learning Vehicle Routing Problems using Policy Optimisation
N. Sultana
Jeffrey Chan
•. A. K. Qin
Tabinda Sarwar
52
5
0
24 Dec 2020
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Y. Fu
Zhongzhi Yu
Yongan Zhang
Yingyan Lin
85
4
0
24 Dec 2020
Augmenting Policy Learning with Routines Discovered from a Single
  Demonstration
Augmenting Policy Learning with Routines Discovered from a Single Demonstration
Zelin Zhao
Chuang Gan
Jiajun Wu
Xiaoxiao Guo
J. Tenenbaum
OffRL
104
5
0
23 Dec 2020
Off-Policy Optimization of Portfolio Allocation Policies under
  Constraints
Off-Policy Optimization of Portfolio Allocation Policies under Constraints
Nymisha Bandi
Theja Tulabandhula
OffRL
30
0
0
21 Dec 2020
Multi-Agent Reinforcement Learning for Dynamic Ocean Monitoring by a
  Swarm of Buoys
Multi-Agent Reinforcement Learning for Dynamic Ocean Monitoring by a Swarm of Buoys
M. Kouzehgar
Malika Meghjani
Roland Bouffanais
45
24
0
21 Dec 2020
Difference Rewards Policy Gradients
Difference Rewards Policy Gradients
Jacopo Castellini
Sam Devlin
F. Oliehoek
Rahul Savani
51
13
0
21 Dec 2020
High-Throughput Synchronous Deep RL
High-Throughput Synchronous Deep RL
Iou-Jen Liu
Raymond A. Yeh
Alex Schwing
OffRL
71
12
0
17 Dec 2020
Planning from Pixels in Atari with Learned Symbolic Representations
Planning from Pixels in Atari with Learned Symbolic Representations
Andrea Dittadi
Frederik K. Drachmann
Thomas Bolander
94
11
0
16 Dec 2020
Revocable Deep Reinforcement Learning with Affinity Regularization for
  Outlier-Robust Graph Matching
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching
Chang-rui Liu
Zetian Jiang
Runzhong Wang
Junchi Yan
Lingxiao Huang
Pinyan Lu
141
10
0
16 Dec 2020
Online Service Migration in Mobile Edge with Incomplete System
  Information: A Deep Recurrent Actor-Critic Learning Approach
Online Service Migration in Mobile Edge with Incomplete System Information: A Deep Recurrent Actor-Critic Learning Approach
Jin Wang
Jia Hu
Geyong Min
Qiang Ni
Tarek A. El-Ghazawi
88
31
0
16 Dec 2020
Sim-to-real reinforcement learning applied to end-to-end vehicle control
Sim-to-real reinforcement learning applied to end-to-end vehicle control
András Kalapos
Csaba Gór
Róbert Moni
I. Harmati
73
14
0
14 Dec 2020
Tutoring Reinforcement Learning via Feedback Control
Tutoring Reinforcement Learning via Feedback Control
F. D. Lellis
G. Russo
M. D. Bernardo
52
6
0
12 Dec 2020
Imitating Interactive Intelligence
Imitating Interactive Intelligence
Josh Abramson
Arun Ahuja
Iain Barr
Arthur Brussee
Federico Carnevale
...
Greg Wayne
Duncan Williams
Nathaniel Wong
Chen Yan
Rui Zhu
LM&Ro
111
71
0
10 Dec 2020
An Efficient Asynchronous Method for Integrating Evolutionary and
  Gradient-based Policy Search
An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search
Kyunghyun Lee
Byeong-uk Lee
Ukcheol Shin
In So Kweon
168
23
0
10 Dec 2020
Topological Planning with Transformers for Vision-and-Language
  Navigation
Topological Planning with Transformers for Vision-and-Language Navigation
Kevin Chen
Junshen K. Chen
Jo Chuang
Nathan Tsoi
Silvio Savarese
LM&Ro
108
101
0
09 Dec 2020
Emergence of Different Modes of Tool Use in a Reaching and Dragging Task
Emergence of Different Modes of Tool Use in a Reaching and Dragging Task
K. Nguyen
Yoonsuck Choe
31
0
0
08 Dec 2020
Multi-agent navigation based on deep reinforcement learning and
  traditional pathfinding algorithm
Multi-agent navigation based on deep reinforcement learning and traditional pathfinding algorithm
Hong Qiu
AI4CE
46
6
0
05 Dec 2020
A Review of Designs and Applications of Echo State Networks
A Review of Designs and Applications of Echo State Networks
Chenxi Sun
Moxian Song
linda Qiao
Hongyan Li
AAML
48
34
0
05 Dec 2020
Are Gradient-based Saliency Maps Useful in Deep Reinforcement Learning?
Are Gradient-based Saliency Maps Useful in Deep Reinforcement Learning?
Matthias Rosynski
Frank Kirchner
Matias Valdenegro-Toro
FAtt
52
13
0
02 Dec 2020
General Characterization of Agents by States they Visit
General Characterization of Agents by States they Visit
Anssi Kanervisto
Tomi Kinnunen
Ville Hautamaki
62
3
0
02 Dec 2020
Previous
123...363738...707172
Next