ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,592 papers shown
Title
Learning for Visual Navigation by Imagining the Success
Learning for Visual Navigation by Imagining the Success
M. Moghaddam
Ehsan Abbasnejad
Qi Wu
Javen Qinfeng Shi
Anton Van Den Hengel
66
3
0
28 Feb 2021
Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report
Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report
Michael Walton
Viliam Lisý
70
5
0
27 Feb 2021
Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half
  Precision
Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half Precision
Johan Bjorck
Xiangyu Chen
Christopher De Sa
Carla P. Gomes
Kilian Q. Weinberger
137
6
0
26 Feb 2021
Synthetic Returns for Long-Term Credit Assignment
Synthetic Returns for Long-Term Credit Assignment
David Raposo
Samuel Ritter
Adam Santoro
Greg Wayne
T. Weber
M. Botvinick
H. V. Hasselt
Francis Song
AI4TS
107
35
0
24 Feb 2021
Balancing Rational and Other-Regarding Preferences in
  Cooperative-Competitive Environments
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments
Dmitry Ivanov
Vladimir Egorov
A. Shpilman
68
5
0
24 Feb 2021
FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with
  Quantization-Aware Training and Adaptive Parallelism
FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism
Jenny Yang
Seongmin Hong
Joo-Young Kim
61
18
0
24 Feb 2021
Honey, I Shrunk The Actor: A Case Study on Preserving Performance with
  Smaller Actors in Actor-Critic RL
Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL
Siddharth Mysore
B. Mabsout
R. Mancuso
Kate Saenko
OffRL
47
9
0
23 Feb 2021
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
114
25
0
23 Feb 2021
Differentiable Logic Machines
Differentiable Logic Machines
Matthieu Zimmer
Xuening Feng
Claire Glanois
Zhaohui Jiang
Jianyi Zhang
Paul Weng
Li Dong
Hao Jianye
Liu Wulong
AI4CE
101
23
0
23 Feb 2021
Communication Efficient Parallel Reinforcement Learning
Communication Efficient Parallel Reinforcement Learning
Mridul Agarwal
Bhargav Ganguly
Vaneet Aggarwal
77
11
0
22 Feb 2021
Delayed Rewards Calibration via Reward Empirical Sufficiency
Delayed Rewards Calibration via Reward Empirical Sufficiency
Yixuan Liu
Hu Wang
Xiaowei Wang
Xiaoyue Sun
Liuyue Jiang
Minhui Xue
88
0
0
21 Feb 2021
How To Train Your HERON
How To Train Your HERON
Antoine Richard
Stéphanie Aravecchia
Thomas Schillaci
Matthieu Geist
C´edric Pradalier
41
3
0
20 Feb 2021
Decoupling Value and Policy for Generalization in Reinforcement Learning
Decoupling Value and Policy for Generalization in Reinforcement Learning
Roberta Raileanu
Rob Fergus
DRLOffRL
121
99
0
20 Feb 2021
On Proximal Policy Optimization's Heavy-tailed Gradients
On Proximal Policy Optimization's Heavy-tailed Gradients
Saurabh Garg
Joshua Zhanson
Emilio Parisotto
Adarsh Prasad
J. Zico Kolter
Zachary Chase Lipton
Sivaraman Balakrishnan
Ruslan Salakhutdinov
Pradeep Ravikumar
100
13
0
20 Feb 2021
Training a Resilient Q-Network against Observational Interference
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
73
15
0
18 Feb 2021
State Entropy Maximization with Random Encoders for Efficient
  Exploration
State Entropy Maximization with Random Encoders for Efficient Exploration
Younggyo Seo
Lili Chen
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
119
129
0
18 Feb 2021
Adaptive Rational Activations to Boost Deep Reinforcement Learning
Adaptive Rational Activations to Boost Deep Reinforcement Learning
Quentin Delfosse
P. Schramowski
Martin Mundt
Alejandro Molina
Kristian Kersting
141
15
0
18 Feb 2021
Improved Deep Reinforcement Learning with Expert Demonstrations for
  Urban Autonomous Driving
Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving
Haochen Liu
Zhiyu Huang
Jingda Wu
Chen Lv
102
74
0
18 Feb 2021
Learning Memory-Dependent Continuous Control from Demonstrations
Learning Memory-Dependent Continuous Control from Demonstrations
Siqing Hou
Dongqi Han
Jun Tani
30
0
0
18 Feb 2021
Multi-Agent Reinforcement Learning of 3D Furniture Layout Simulation in
  Indoor Graphics Scenes
Multi-Agent Reinforcement Learning of 3D Furniture Layout Simulation in Indoor Graphics Scenes
Xinhan Di
Pengqian Yu
AI4CE3DV
92
12
0
18 Feb 2021
On the Convergence and Sample Efficiency of Variance-Reduced Policy
  Gradient Method
On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method
Junyu Zhang
Chengzhuo Ni
Zheng Yu
Csaba Szepesvári
Mengdi Wang
130
69
0
17 Feb 2021
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade
  Execution
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade Execution
Karush Suri
Xiaolong Shi
Konstantinos Plataniotis
Y. Lawryshyn
OffRL
47
4
0
16 Feb 2021
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Rukshan Wijesinghe
Kasun Vithanage
Dumindu Tissera
A. Xavier
Subha Fernando
Jayathu Samarawickrama
CLL
50
0
0
16 Feb 2021
A Survey of Machine Learning for Computer Architecture and Systems
A Survey of Machine Learning for Computer Architecture and Systems
Nan Wu
Yuan Xie
AI4TSAI4CE
108
153
0
16 Feb 2021
Scaling Multi-Agent Reinforcement Learning with Selective Parameter
  Sharing
Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing
Filippos Christianos
Georgios Papoudakis
Arrasy Rahman
Stefano V. Albrecht
94
121
0
15 Feb 2021
Tractable structured natural gradient descent using local
  parameterizations
Tractable structured natural gradient descent using local parameterizations
Wu Lin
Frank Nielsen
Mohammad Emtiyaz Khan
Mark Schmidt
146
30
0
15 Feb 2021
Reinforcement Learning for IoT Security: A Comprehensive Survey
Reinforcement Learning for IoT Security: A Comprehensive Survey
Aashma Uprety
D. Rawat
AAML
93
124
0
14 Feb 2021
Resilient Machine Learning for Networked Cyber Physical Systems: A
  Survey for Machine Learning Security to Securing Machine Learning for CPS
Resilient Machine Learning for Networked Cyber Physical Systems: A Survey for Machine Learning Security to Securing Machine Learning for CPS
Felix O. Olowononi
D. Rawat
Chunmei Liu
95
138
0
14 Feb 2021
Interactive Learning from Activity Description
Interactive Learning from Activity Description
Khanh Nguyen
Dipendra Kumar Misra
Robert Schapire
Miroslav Dudík
Patrick Shafto
115
35
0
13 Feb 2021
Modelling Cooperation in Network Games with Spatio-Temporal Complexity
Modelling Cooperation in Network Games with Spatio-Temporal Complexity
Michiel A. Bakker
Richard Everett
Laura Weidinger
Iason Gabriel
William S. Isaac
Joel Z Leibo
Edward Hughes
69
5
0
13 Feb 2021
Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Piotr Kozakowski
Lukasz Kaiser
Henryk Michalewski
Afroz Mohiuddin
Katarzyna Kañska
OffRL
81
5
0
12 Feb 2021
Deep Reinforcement Agent for Scheduling in HPC
Deep Reinforcement Agent for Scheduling in HPC
Yuping Fan
Z. Lan
T. Childers
Paul M. Rich
W. Allcock
M. Papka
57
37
0
11 Feb 2021
Derivative-Free Reinforcement Learning: A Review
Derivative-Free Reinforcement Learning: A Review
Hong Qian
Yang Yu
OffRL
144
42
0
10 Feb 2021
Improving Model-Based Reinforcement Learning with Internal State
  Representations through Self-Supervision
Improving Model-Based Reinforcement Learning with Internal State Representations through Self-Supervision
Julien Scholz
C. Weber
Muhammad Burhan Hafez
S. Wermter
46
3
0
10 Feb 2021
Learning Equational Theorem Proving
Learning Equational Theorem Proving
Jelle Piepenbrock
Tom Heskes
Mikolávs Janota
Josef Urban
AIMatLRM
38
4
0
10 Feb 2021
Policy Augmentation: An Exploration Strategy for Faster Convergence of
  Deep Reinforcement Learning Algorithms
Policy Augmentation: An Exploration Strategy for Faster Convergence of Deep Reinforcement Learning Algorithms
A. Mahyari
22
1
0
10 Feb 2021
Adaptive Pairwise Weights for Temporal Credit Assignment
Adaptive Pairwise Weights for Temporal Credit Assignment
Zeyu Zheng
Risto Vuorio
Richard L. Lewis
Satinder Singh
68
5
0
09 Feb 2021
Learning State Representations from Random Deep Action-conditional
  Predictions
Learning State Representations from Random Deep Action-conditional Predictions
Zeyu Zheng
Vivek Veeriah
Risto Vuorio
Richard L. Lewis
Satinder Singh
70
5
0
09 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
55
13
0
09 Feb 2021
How to Stay Curious while Avoiding Noisy TVs using Aleatoric Uncertainty
  Estimation
How to Stay Curious while Avoiding Noisy TVs using Aleatoric Uncertainty Estimation
Augustine N. Mavor-Parker
K. Young
Caswell Barry
Lewis D. Griffin
95
22
0
08 Feb 2021
Adversarially Guided Actor-Critic
Adversarially Guided Actor-Critic
Yannis Flet-Berliac
Johan Ferret
Olivier Pietquin
Philippe Preux
Matthieu Geist
77
73
0
08 Feb 2021
Towards Hierarchical Task Decomposition using Deep Reinforcement
  Learning for Pick and Place Subtasks
Towards Hierarchical Task Decomposition using Deep Reinforcement Learning for Pick and Place Subtasks
Luca Marzari
Ameya Pore
Diego DallÁlba
G. Aragon-Camarasa
Alessandro Farinelli
Paolo Fiorini
87
29
0
08 Feb 2021
Concentrated Document Topic Model
Concentrated Document Topic Model
Hao Lei
Ying Chen
31
1
0
06 Feb 2021
Rethinking the Implementation Tricks and Monotonicity Constraint in
  Cooperative Multi-Agent Reinforcement Learning
Rethinking the Implementation Tricks and Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning
Jian Hu
Siyang Jiang
Seth Austin Harding
Haibin Wu
Shihua Liao
217
91
0
06 Feb 2021
An advantage actor-critic algorithm for robotic motion planning in dense
  and dynamic scenarios
An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios
Chengmin Zhou
Bingding Huang
Pasi Fränti
47
1
0
05 Feb 2021
Experience-Based Heuristic Search: Robust Motion Planning with Deep
  Q-Learning
Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning
Julian Bernhard
Robert Gieselmann
Klemens Esterle
Alois Knoll
42
17
0
05 Feb 2021
A review of motion planning algorithms for intelligent robotics
A review of motion planning algorithms for intelligent robotics
Chengmin Zhou
Bingding Huang
Pasi Fränti
77
4
0
04 Feb 2021
Proactive and AoI-aware Failure Recovery for Stateful NFV-enabled
  Zero-Touch 6G Networks: Model-Free DRL Approach
Proactive and AoI-aware Failure Recovery for Stateful NFV-enabled Zero-Touch 6G Networks: Model-Free DRL Approach
Amirhossein Shaghaghi
Abolfazl Zakeri
Nader Mokari
M. Javan
M. Behdadfar
Eduard Axel Jorswieck
36
20
0
02 Feb 2021
DRLDO: A novel DRL based De-ObfuscationSystem for Defense against
  Metamorphic Malware
DRLDO: A novel DRL based De-ObfuscationSystem for Defense against Metamorphic Malware
Mohit Sewak
S. K. Sahay
Hemant Rathore
43
13
0
01 Feb 2021
Deep Reinforcement Learning Aided Monte Carlo Tree Search for MIMO
  Detection
Deep Reinforcement Learning Aided Monte Carlo Tree Search for MIMO Detection
Tz-Wei Mo
Ronald Y. Chang
Te-Yi Kan
64
1
0
30 Jan 2021
Previous
123...353637...707172
Next