ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
On The Transferability of Deep-Q Networks
On The Transferability of Deep-Q Networks
M. Sabatelli
Pierre Geurts
83
2
0
06 Oct 2021
Approximate Newton policy gradient algorithms
Approximate Newton policy gradient algorithms
Haoya Li
Samarth Gupta
Hsiangfu Yu
Lexing Ying
Inderjit Dhillon
70
3
0
05 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery
  phantom
Deep reinforcement learning for guidewire navigation in coronary artery phantom
Jihoon Kweon
Kyunghwan Kim
Chaehyuk Lee
Hwi Kwon
Jinwoo Park
...
Inwook Back
J. Roh
Y. Moon
Jaesoon Choi
Young-Hak Kim
OnRL
65
34
0
05 Oct 2021
Mapless Navigation: Learning UAVs Motion forExploration of Unknown
  Environments
Mapless Navigation: Learning UAVs Motion forExploration of Unknown Environments
Sunggoo Jung
David Hyunchul Shim
40
0
0
04 Oct 2021
Automating Privilege Escalation with Deep Reinforcement Learning
Automating Privilege Escalation with Deep Reinforcement Learning
Kalle Kujanpää
Willie Victor
Alexander Ilin
AAML
41
16
0
04 Oct 2021
Collective eXplainable AI: Explaining Cooperative Strategies and Agent
  Contribution in Multiagent Reinforcement Learning with Shapley Values
Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley Values
Alexandre Heuillet
Fabien Couthouis
Natalia Díaz Rodríguez
87
65
0
04 Oct 2021
Parallel Actors and Learners: A Framework for Generating Scalable RL
  Implementations
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations
Chi Zhang
S. Kuppannagari
Viktor Prasanna
OffRL
34
8
0
03 Oct 2021
Batch size-invariance for policy optimization
Batch size-invariance for policy optimization
Jacob Hilton
K. Cobbe
John Schulman
120
14
0
01 Oct 2021
Learning the Markov Decision Process in the Sparse Gaussian Elimination
Learning the Markov Decision Process in the Sparse Gaussian Elimination
Yingshi Chen
36
1
0
30 Sep 2021
Reinforcement Learning for Classical Planning: Viewing Heuristics as
  Dense Reward Generators
Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators
Clement Gehring
Masataro Asai
Rohan Chitnis
Tom Silver
L. Kaelbling
Shirin Sohrabi
Michael Katz
OffRL
84
38
0
30 Sep 2021
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Romain Laroche
Rémi Tachet des Combes
91
8
0
29 Sep 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
135
60
0
28 Sep 2021
Exploring More When It Needs in Deep Reinforcement Learning
Exploring More When It Needs in Deep Reinforcement Learning
Youtian Guo
Qitong Gao
31
0
0
28 Sep 2021
Deep Reinforcement Learning with Adjustments
Deep Reinforcement Learning with Adjustments
H. Khorasgani
Haiyan Wang
Chetan Gupta
Susumu Serita
25
2
0
28 Sep 2021
Runtime Safety Assurance for Learning-enabled Control of Autonomous
  Driving Vehicles
Runtime Safety Assurance for Learning-enabled Control of Autonomous Driving Vehicles
Shengduo Chen
Yao Sun
Dachuan Li
Qiang Wang
Qi Hao
J. Sifakis
82
18
0
28 Sep 2021
The Role of Lookahead and Approximate Policy Evaluation in Reinforcement
  Learning with Linear Value Function Approximation
The Role of Lookahead and Approximate Policy Evaluation in Reinforcement Learning with Linear Value Function Approximation
Anna Winnicki
Joseph Lubars
Michael Livesay
R. Srikant
74
3
0
28 Sep 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning
  Research
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
319
91
0
27 Sep 2021
Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep
  Reinforcement Learning with Demonstration-like Sampled Exploration
Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration
Zhaorun Chen
Binhao Chen
S. Xie
Liang Gong
Chengliang Liu
Zhengfeng Zhang
Junping Zhang
OffRL
25
2
0
27 Sep 2021
Applying supervised and reinforcement learning methods to create
  neural-network-based agents for playing StarCraft II
Applying supervised and reinforcement learning methods to create neural-network-based agents for playing StarCraft II
Michal Opanowicz
30
0
0
26 Sep 2021
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning
  Algorithms
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms
Liyuan Zheng
Tanner Fiez
Zane Alumbaugh
Benjamin J. Chasnov
Lillian J. Ratliff
OffRL
99
42
0
25 Sep 2021
NICE: Robust Scheduling through Reinforcement Learning-Guided Integer
  Programming
NICE: Robust Scheduling through Reinforcement Learning-Guided Integer Programming
Luke Kenworthy
Siddharth Nayak
Christopher R. Chin
H. Balakrishnan
116
8
0
24 Sep 2021
A Graph Policy Network Approach for Volt-Var Control in Power
  Distribution Systems
A Graph Policy Network Approach for Volt-Var Control in Power Distribution Systems
Xian Yeow Lee
Soumik Sarkar
Yubo Wang
35
31
0
24 Sep 2021
The $f$-Divergence Reinforcement Learning Framework
The fff-Divergence Reinforcement Learning Framework
Chen Gong
Qiang He
Yunpeng Bai
Zhouyi Yang
Xiaoyu Chen
Xinwen Hou
Xianjie Zhang
Yu Liu
Guoliang Fan
68
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with
  On-Policy Experience
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
56
34
0
24 Sep 2021
ADVERSARIALuscator: An Adversarial-DRL Based Obfuscator and Metamorphic
  Malware SwarmGenerator
ADVERSARIALuscator: An Adversarial-DRL Based Obfuscator and Metamorphic Malware SwarmGenerator
Mohit Sewak
S. K. Sahay
Hemant Rathore
AAML
58
8
0
23 Sep 2021
On Bonus-Based Exploration Methods in the Arcade Learning Environment
On Bonus-Based Exploration Methods in the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
57
61
0
22 Sep 2021
Real Robot Challenge: A Robotics Competition in the Cloud
Real Robot Challenge: A Robotics Competition in the Cloud
Stefan Bauer
Felix Widmaier
M. Wuthrich
Annika Buchholz
Sebastian Stark
...
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
S. Redmond
Bernhard Schölkopf
61
12
0
22 Sep 2021
Estimation Error Correction in Deep Reinforcement Learning for
  Deterministic Actor-Critic Methods
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods
Baturay Saglam
Enes Duran
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
73
12
0
22 Sep 2021
Benchmarking Lane-changing Decision-making for Deep Reinforcement
  Learning
Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning
Junjie Wang
Qichao Zhang
Dongbin Zhao
OffRL
26
1
0
22 Sep 2021
Context-Specific Representation Abstraction for Deep Option Learning
Context-Specific Representation Abstraction for Deep Option Learning
Marwa Abdulhai
Dong-Ki Kim
Matthew D Riemer
Miao Liu
Gerald Tesauro
Jonathan P. How
OffRL
92
10
0
20 Sep 2021
Multi-Agent Embodied Visual Semantic Navigation with Scene Prior
  Knowledge
Multi-Agent Embodied Visual Semantic Navigation with Scene Prior Knowledge
Xinzhu Liu
Di Guo
Huaping Liu
F. Sun
EgoV
77
25
0
20 Sep 2021
A Survey of Text Games for Reinforcement Learning informed by Natural
  Language
A Survey of Text Games for Reinforcement Learning informed by Natural Language
P. Osborne
Heido Nomm
André Freitas
AI4CE
101
24
0
20 Sep 2021
CompilerGym: Robust, Performant Compiler Optimization Environments for
  AI Research
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Chris Cummins
Bram Wasti
Jiadong Guo
Brandon Cui
Jason Ansel
...
Jia-Wei Liu
O. Teytaud
Benoit Steiner
Yuandong Tian
Hugh Leather
78
76
0
17 Sep 2021
Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual
  Patterns
Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns
Prasanth Buddareddygari
Travis Zhang
Yezhou Yang
Yi Ren
AAML
61
15
0
16 Sep 2021
DROMO: Distributionally Robust Offline Model-based Policy Optimization
DROMO: Distributionally Robust Offline Model-based Policy Optimization
Ruizhen Liu
Dazhi Zhong
Zhi-Cong Chen
OffRL
60
3
0
15 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
103
0
14 Sep 2021
DSDF: An approach to handle stochastic agents in collaborative
  multi-agent reinforcement learning
DSDF: An approach to handle stochastic agents in collaborative multi-agent reinforcement learning
S. K. Perepu
Kaushik Dey
29
0
0
14 Sep 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic
  Reinforcement Learning and Global Convergence of Policy Gradient Methods
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
86
6
0
13 Sep 2021
Direct Advantage Estimation
Direct Advantage Estimation
Hsiao-Ru Pan
Nico Gürtler
Alexander Neitz
Bernhard Schölkopf
OffRLCML
62
13
0
13 Sep 2021
Computation Rate Maximum for Mobile Terminals in UAV-assisted Wireless
  Powered MEC Networks with Fairness Constraint
Computation Rate Maximum for Mobile Terminals in UAV-assisted Wireless Powered MEC Networks with Fairness Constraint
Xiaoyi Zhou
Liang Huang
Tong Ye
Weiqiang Sun
29
1
0
13 Sep 2021
Robust Stability of Neural Network-controlled Nonlinear Systems with
  Parametric Variability
Robust Stability of Neural Network-controlled Nonlinear Systems with Parametric Variability
Soumyabrata Talukder
Ratnesh Kumar
46
8
0
13 Sep 2021
Reinforcement Learning for Load-balanced Parallel Particle Tracing
Reinforcement Learning for Load-balanced Parallel Particle Tracing
Jiayi Xu
Hanqi Guo
Han-Wei Shen
Mukund Raj
Skylar W. Wurster
Tom Peterka
32
6
0
13 Sep 2021
Direct Random Search for Fine Tuning of Deep Reinforcement Learning
  Policies
Direct Random Search for Fine Tuning of Deep Reinforcement Learning Policies
Sean Gillen
Asutay Ozmen
Katie Byl
32
0
0
12 Sep 2021
Learning Selective Communication for Multi-Agent Path Finding
Learning Selective Communication for Multi-Agent Path Finding
Ziyuan Ma
Yudong Luo
Jia Pan
AI4CE
83
52
0
12 Sep 2021
Incentivizing an Unknown Crowd
Incentivizing an Unknown Crowd
Jing Dong
Shuai Li
Baoxiang Wang
OffRL
30
0
0
09 Sep 2021
On the Approximation of Cooperative Heterogeneous Multi-Agent
  Reinforcement Learning (MARL) using Mean Field Control (MFC)
On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)
Washim Uddin Mondal
Mridul Agarwal
Vaneet Aggarwal
S. Ukkusuri
134
44
0
09 Sep 2021
PowerGym: A Reinforcement Learning Environment for Volt-Var Control in
  Power Distribution Systems
PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems
Ting-Han Fan
Xian Yeow Lee
Yubo Wang
178
24
0
08 Sep 2021
On the impact of MDP design for Reinforcement Learning agents in
  Resource Management
On the impact of MDP design for Reinforcement Learning agents in Resource Management
Renato Luiz de Freitas Cunha
Luiz Chaimowicz
22
3
0
07 Sep 2021
Guiding Global Placement With Reinforcement Learning
Guiding Global Placement With Reinforcement Learning
Robert M. Kirby
Kolby Nottingham
Rajarshi Roy
Saad Godil
Bryan Catanzaro
28
2
0
06 Sep 2021
Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning
Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning
Ning Wei
Jiahua Liang
Di Xie
Shiliang Pu
50
0
0
06 Sep 2021
Previous
123...293031...707172
Next