ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 1,552 papers shown
Title
A Globally Convergent Evolutionary Strategy for Stochastic Constrained
  Optimization with Applications to Reinforcement Learning
A Globally Convergent Evolutionary Strategy for Stochastic Constrained Optimization with Applications to Reinforcement Learning
Youssef Diouane
Aurelien Lucchi
Vihang Patil
29
3
0
21 Feb 2022
Cyber-Physical Defense in the Quantum Era
Cyber-Physical Defense in the Quantum Era
Michel Barbeau
Joaquín García
30
10
0
21 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
37
7
0
16 Feb 2022
Beyond the Policy Gradient Theorem for Efficient Policy Updates in
  Actor-Critic Algorithms
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
Romain Laroche
Rémi Tachet des Combes
51
2
0
15 Feb 2022
QuadSim: A Quadcopter Rotational Dynamics Simulation Framework For
  Reinforcement Learning Algorithms
QuadSim: A Quadcopter Rotational Dynamics Simulation Framework For Reinforcement Learning Algorithms
Burak Han Demirbilek
18
0
0
14 Feb 2022
On the Convergence of SARSA with Linear Function Approximation
On the Convergence of SARSA with Linear Function Approximation
Shangtong Zhang
Rémi Tachet des Combes
Romain Laroche
26
10
0
14 Feb 2022
Sequential Bayesian experimental designs via reinforcement learning
Sequential Bayesian experimental designs via reinforcement learning
Hikaru Asano
OffRL
18
0
0
14 Feb 2022
Autonomous Drone Swarm Navigation and Multi-target Tracking in 3D
  Environments with Dynamic Obstacles
Autonomous Drone Swarm Navigation and Multi-target Tracking in 3D Environments with Dynamic Obstacles
Suleman Qamar
Dr. Saddam Hussain Khan
Muhammad Arif Arshad
Maryam Qamar
Asifullah Khan
29
16
0
13 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
36
29
0
10 Feb 2022
Off-Policy Fitted Q-Evaluation with Differentiable Function
  Approximators: Z-Estimation and Inference Theory
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang
Xuezhou Zhang
Chengzhuo Ni
Mengdi Wang
OffRL
40
16
0
10 Feb 2022
Red Teaming Language Models with Language Models
Red Teaming Language Models with Language Models
Ethan Perez
Saffron Huang
Francis Song
Trevor Cai
Roman Ring
John Aslanides
Amelia Glaese
Nat McAleese
G. Irving
AAML
13
613
0
07 Feb 2022
A Temporal-Difference Approach to Policy Gradient Estimation
A Temporal-Difference Approach to Policy Gradient Estimation
Samuele Tosatto
Andrew Patterson
Martha White
A. R. Mahmood
OffRL
27
2
0
04 Feb 2022
A Survey on Safety-Critical Driving Scenario Generation -- A
  Methodological Perspective
A Survey on Safety-Critical Driving Scenario Generation -- A Methodological Perspective
Wenhao Ding
Chejian Xu
Mansur Arief
Hao-ming Lin
Bo Li
Ding Zhao
37
146
0
04 Feb 2022
Pre-Trained Language Models for Interactive Decision-Making
Pre-Trained Language Models for Interactive Decision-Making
Shuang Li
Xavier Puig
Chris Paxton
Yilun Du
Clinton Jia Wang
...
Anima Anandkumar
Jacob Andreas
Igor Mordatch
Antonio Torralba
Yuke Zhu
LM&Ro
50
250
0
03 Feb 2022
Reinforcement learning of optimal active particle navigation
Reinforcement learning of optimal active particle navigation
Mahdi Nasiri
B. Liebchen
26
24
0
01 Feb 2022
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method
  with Probabilistic Gradient Estimation
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation
Matilde Gargiani
Andrea Zanelli
Andrea Martinelli
Tyler H. Summers
John Lygeros
41
14
0
01 Feb 2022
Warmth and competence in human-agent cooperation
Warmth and competence in human-agent cooperation
Kevin R. McKee
Xuechunzi Bai
S. Fiske
39
26
0
31 Jan 2022
Discovering Exfiltration Paths Using Reinforcement Learning with Attack
  Graphs
Discovering Exfiltration Paths Using Reinforcement Learning with Attack Graphs
Tyler Cody
Abdul Rahman
Christopher Redino
Lanxiao Huang
Ryan Clark
Akshay Kakkar
Deepak Kushwaha
Paul Park
Peter A. Beling
Edward Bowen
32
14
0
28 Jan 2022
Generative Adversarial Exploration for Reinforcement Learning
Generative Adversarial Exploration for Reinforcement Learning
Weijun Hong
Menghui Zhu
Minghuan Liu
Weinan Zhang
Ming Zhou
Yong Yu
Peng Sun
OnRL
39
7
0
27 Jan 2022
Reinforcement Learning-Empowered Mobile Edge Computing for 6G Edge
  Intelligence
Reinforcement Learning-Empowered Mobile Edge Computing for 6G Edge Intelligence
Pengjin Wei
Kun Guo
Ye Li
Jue Wang
W. Feng
Shi Jin
Ning Ge
Ying-Chang Liang
33
45
0
27 Jan 2022
DNNFuser: Generative Pre-Trained Transformer as a Generalized Mapper for
  Layer Fusion in DNN Accelerators
DNNFuser: Generative Pre-Trained Transformer as a Generalized Mapper for Layer Fusion in DNN Accelerators
Sheng-Chun Kao
Xiaoyu Huang
T. Krishna
AI4CE
40
9
0
26 Jan 2022
Learning Invariable Semantical Representation from Language for
  Extensible Policy Generalization
Learning Invariable Semantical Representation from Language for Extensible Policy Generalization
Yihan Li
Jinsheng Ren
Tianrun Xu
Tianren Zhang
Haichuan Gao
Feng Chen
26
1
0
26 Jan 2022
Online Attentive Kernel-Based Temporal Difference Learning
Online Attentive Kernel-Based Temporal Difference Learning
Guang Yang
Xingguo Chen
Shangdong Yang
Huihui Wang
Shaokang Dong
Yang Gao
OffRL
21
3
0
22 Jan 2022
Environment Generation for Zero-Shot Compositional Reinforcement
  Learning
Environment Generation for Zero-Shot Compositional Reinforcement Learning
Izzeddin Gur
Natasha Jaques
Yingjie Miao
Jongwook Choi
Manoj Kumar Tiwari
Honglak Lee
Aleksandra Faust
41
43
0
21 Jan 2022
Reinforcement Learning for Personalized Drug Discovery and Design for
  Complex Diseases: A Systems Pharmacology Perspective
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
49
2
0
21 Jan 2022
Instance-Dependent Confidence and Early Stopping for Reinforcement
  Learning
Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
K. Khamaru
Eric Xia
Martin J. Wainwright
Michael I. Jordan
37
5
0
21 Jan 2022
Profitable Strategy Design by Using Deep Reinforcement Learning for
  Trades on Cryptocurrency Markets
Profitable Strategy Design by Using Deep Reinforcement Learning for Trades on Cryptocurrency Markets
Mohsen Asgari
S. H. Khasteh
19
5
0
15 Jan 2022
Demystifying Reinforcement Learning in Time-Varying Systems
Demystifying Reinforcement Learning in Time-Varying Systems
Pouya Hamadanian
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
50
1
0
14 Jan 2022
Bayesian sense of time in biological and artificial brains
Bayesian sense of time in biological and artificial brains
Zafeirios Fountas
Alexey Zakharov
35
0
0
14 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
38
100
0
11 Jan 2022
Glance and Focus Networks for Dynamic Visual Recognition
Glance and Focus Networks for Dynamic Visual Recognition
Gao Huang
Yulin Wang
Kangchen Lv
Haojun Jiang
Wenhui Huang
Pengfei Qi
S. Song
3DH
79
49
0
09 Jan 2022
Offline Reinforcement Learning for Road Traffic Control
Offline Reinforcement Learning for Road Traffic Control
Mayuresh Kunjir
Sanjay Chawla
OffRL
32
4
0
07 Jan 2022
Mirror Learning: A Unifying Framework of Policy Optimisation
Mirror Learning: A Unifying Framework of Policy Optimisation
J. Kuba
Christian Schroeder de Witt
Jakob N. Foerster
29
24
0
07 Jan 2022
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement
  Learning
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning
Yuxing Wang
Tiantian Zhang
Yongzhe Chang
Bin Liang
Xueqian Wang
Bo Yuan
29
15
0
01 Jan 2022
Learning Based Task Offloading in Digital Twin Empowered Internet of
  Vehicles
Learning Based Task Offloading in Digital Twin Empowered Internet of Vehicles
Jinkai Zheng
Tom H. Luan
Longxiang Gao
Yao Zhang
Yuan Wu
28
14
0
28 Dec 2021
On the Unreasonable Efficiency of State Space Clustering in
  Personalization Tasks
On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Anton Dereventsov
R. Vatsavai
Clayton Webster
33
5
0
24 Dec 2021
Maximum Entropy Population-Based Training for Zero-Shot Human-AI
  Coordination
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination
Rui Zhao
Jinming Song
Yufeng Yuan
Haifeng Hu
Yang Gao
Yi Wu
Zhongqian Sun
Yang Wei
32
63
0
22 Dec 2021
Value Activation for Bias Alleviation: Generalized-activated Deep Double
  Deterministic Policy Gradients
Value Activation for Bias Alleviation: Generalized-activated Deep Double Deterministic Policy Gradients
Jiafei Lyu
Yu Yang
Jiangpeng Yan
Xiu Li
OffRL
AI4CE
39
5
0
21 Dec 2021
Curriculum Based Reinforcement Learning of Grid Topology Controllers to
  Prevent Thermal Cascading
Curriculum Based Reinforcement Learning of Grid Topology Controllers to Prevent Thermal Cascading
A. R. Ramapuram Matavalam
K. Guddanti
Yang Weng
V. Ajjarapu
AI4CE
27
14
0
18 Dec 2021
Symmetry-aware Neural Architecture for Embodied Visual Navigation
Symmetry-aware Neural Architecture for Embodied Visual Navigation
Shuang Liu
Takayuki Okatani
34
1
0
17 Dec 2021
DISTREAL: Distributed Resource-Aware Learning in Heterogeneous Systems
DISTREAL: Distributed Resource-Aware Learning in Heterogeneous Systems
Martin Rapp
R. Khalili
Kilian Pfeiffer
J. Henkel
24
18
0
16 Dec 2021
Learning to track environment state via predictive autoencoding
Learning to track environment state via predictive autoencoding
Marian Andrecki
N. K. Taylor
13
0
0
14 Dec 2021
How to Learn and Represent Abstractions: An Investigation using Symbolic
  Alchemy
How to Learn and Represent Abstractions: An Investigation using Symbolic Alchemy
Badr AlKhamissi
Akshay Srinivasan
Zeb-Kurth Nelson
Samuel Ritter
36
1
0
14 Dec 2021
Towards Interactive Language Modeling
Towards Interactive Language Modeling
Maartje ter Hoeve
Evgeny Kharitonov
Dieuwke Hupkes
Emmanuel Dupoux
26
4
0
14 Dec 2021
Reinforced Abstractive Summarization with Adaptive Length Controlling
M. Song
Yi Feng
L. Jing
36
1
0
14 Dec 2021
Meta-CPR: Generalize to Unseen Large Number of Agents with Communication
  Pattern Recognition Module
Meta-CPR: Generalize to Unseen Large Number of Agents with Communication Pattern Recognition Module
Wei-Cheng Tseng
Wei Wei
Da-Cheng Juan
Min Sun
36
2
0
14 Dec 2021
VMAgent: Scheduling Simulator for Reinforcement Learning
VMAgent: Scheduling Simulator for Reinforcement Learning
Junjie Sheng
Shengliang Cai
Haochuan Cui
Wenhao Li
Yun Hua
...
Yiqiu Hu
Lei Zhu
Qian Peng
Hong Zha
Xiangfeng Wang
VLM
38
3
0
09 Dec 2021
Recent Advances in Reinforcement Learning in Finance
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
31
168
0
08 Dec 2021
Attention-Based Model and Deep Reinforcement Learning for Distribution
  of Event Processing Tasks
Attention-Based Model and Deep Reinforcement Learning for Distribution of Event Processing Tasks
A. Mazayev
F. Al-Tam
N. Correia
41
5
0
07 Dec 2021
Godot Reinforcement Learning Agents
Godot Reinforcement Learning Agents
E. Beeching
Jilles Debangoye
Olivier Simonin
Christian Wolf
GP
OnRL
24
5
0
07 Dec 2021
Previous
123...111213...303132
Next