ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Mastering Autonomous Assembly in Fusion Application with
  Learning-by-doing: a Peg-in-hole Study
Mastering Autonomous Assembly in Fusion Application with Learning-by-doing: a Peg-in-hole Study
Ruochen Yin
Huapeng Wu
Ming Li
Yong Cheng
Yu-jia Song
H. Handroos
34
0
0
24 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph
  Learning for Continuous Action Space
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
54
3
0
23 Aug 2022
Entropy Augmented Reinforcement Learning
Entropy Augmented Reinforcement Learning
Jianfei Ma
69
0
0
19 Aug 2022
Improving Post-Processing of Audio Event Detectors Using Reinforcement
  Learning
Improving Post-Processing of Audio Event Detectors Using Reinforcement Learning
Petros Giannakopoulos
A. Pikrakis
Y. Cotronis
59
3
0
19 Aug 2022
Global Convergence of Two-timescale Actor-Critic for Solving Linear
  Quadratic Regulator
Global Convergence of Two-timescale Actor-Critic for Solving Linear Quadratic Regulator
Xu-yang Chen
Jingliang Duan
Yingbin Liang
Lin Zhao
65
8
0
18 Aug 2022
Visual Explanation of Deep Q-Network for Robot Navigation by Fine-tuning
  Attention Branch
Visual Explanation of Deep Q-Network for Robot Navigation by Fine-tuning Attention Branch
Yuya Maruyama
Hiroshi Fukui
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
K. Sugiura
63
1
0
18 Aug 2022
A Deep Reinforcement Learning-based Adaptive Charging Policy for WRSNs
A Deep Reinforcement Learning-based Adaptive Charging Policy for WRSNs
Ngoc H. Bui
Phi Le Nguyen
Viet Anh Nguyen
Phan-Thuan Do
30
6
0
16 Aug 2022
A Policy Resonance Approach to Solve the Problem of Responsibility
  Diffusion in Multiagent Reinforcement Learning
A Policy Resonance Approach to Solve the Problem of Responsibility Diffusion in Multiagent Reinforcement Learning
Qing Fu
Tenghai Qiu
Jianqiang Yi
Zhiqiang Pu
Xiaolin Ai
Wanmai Yuan
128
1
0
16 Aug 2022
On the Importance of Critical Period in Multi-stage Reinforcement
  Learning
On the Importance of Critical Period in Multi-stage Reinforcement Learning
Junseok Park
Inwoo Hwang
Min Whoo Lee
Hyunseok Oh
Minsu Lee
Youngki Lee
Byoung-Tak Zhang
OffRL
52
0
0
09 Aug 2022
Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics
  Driven Reinforcement Learning
Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics Driven Reinforcement Learning
Xin Jin
Shu Zhao
Le Zhang
Xin Zhao
Qiang Deng
Chaoen Xiao
EGVMCVBM
46
2
0
09 Aug 2022
A Game-Theoretic Perspective of Generalization in Reinforcement Learning
A Game-Theoretic Perspective of Generalization in Reinforcement Learning
Chang Yang
Ruiyu Wang
Xinrun Wang
Zhen Wang
OffRL
58
3
0
07 Aug 2022
Recurrent networks, hidden states and beliefs in partially observable
  environments
Recurrent networks, hidden states and beliefs in partially observable environments
Gaspard Lambrechts
Adrien Bolland
D. Ernst
79
14
0
06 Aug 2022
Human Decision Makings on Curriculum Reinforcement Learning with
  Difficulty Adjustment
Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment
Yilei Zeng
Jiali Duan
Yongqian Li
Emilio Ferrara
Lerrel Pinto
Chloe Kuo
Stefanos Nikolaidis
92
3
0
04 Aug 2022
Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability
  Management Framework
Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework
Soumyadeep Hore
Ankit Shah
Nathaniel D. Bastian
36
17
0
03 Aug 2022
Bayesian regularization of empirical MDPs
Bayesian regularization of empirical MDPs
Samarth Gupta
Daniel N. Hill
Lexing Ying
Inderjit Dhillon
OffRL
42
0
0
03 Aug 2022
Digital Twin-Assisted Efficient Reinforcement Learning for Edge Task
  Scheduling
Digital Twin-Assisted Efficient Reinforcement Learning for Edge Task Scheduling
Xiucheng Wang
Longfei Ma
Haocheng Li
Zhisheng Yin
Tom H. Luan
Nan Cheng
OffRL
38
14
0
02 Aug 2022
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to
  Cooperative MARL
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
J. Kuba
Xidong Feng
Shiyao Ding
Hao Dong
Jun Wang
Yaodong Yang
77
21
0
02 Aug 2022
DashBot: Insight-Driven Dashboard Generation Based on Deep Reinforcement
  Learning
DashBot: Insight-Driven Dashboard Generation Based on Deep Reinforcement Learning
Dazhen Deng
Aoyu Wu
Huamin Qu
Yingcai Wu
105
37
0
02 Aug 2022
Search for or Navigate to? Dual Adaptive Thinking for Object Navigation
Search for or Navigate to? Dual Adaptive Thinking for Object Navigation
Ronghao Dang
Liuyi Wang
Zongtao He
Shuai Su
Chengju Liu
Qi Chen
70
18
0
01 Aug 2022
Reinforcement learning with experience replay and adaptation of action
  dispersion
Reinforcement learning with experience replay and adaptation of action dispersion
Pawel Wawrzyñski
Wojciech Masarczyk
M. Ostaszewski
22
1
0
30 Jul 2022
Improved Policy Optimization for Online Imitation Learning
Improved Policy Optimization for Online Imitation Learning
J. Lavington
Sharan Vaswani
Mark Schmidt
OffRL
91
6
0
29 Jul 2022
Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement
  Learning with Domain Randomization
Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization
Y. Kadokawa
Lingwei Zhu
Yoshihisa Tsurumine
Takamitsu Matsubara
60
8
0
29 Jul 2022
Generating Teammates for Training Robust Ad Hoc Teamwork Agents via
  Best-Response Diversity
Generating Teammates for Training Robust Ad Hoc Teamwork Agents via Best-Response Diversity
Arrasy Rahman
Elliot Fosong
Ignacio Carlucho
Stefano V. Albrecht
103
10
0
28 Jul 2022
Unsupervised Frequent Pattern Mining for CEP
Unsupervised Frequent Pattern Mining for CEP
G. Shapira
Assaf Schuster
35
0
0
28 Jul 2022
SAC-AP: Soft Actor Critic based Deep Reinforcement Learning for Alert
  Prioritization
SAC-AP: Soft Actor Critic based Deep Reinforcement Learning for Alert Prioritization
Lalitha Chavali
Tanay Gupta
Paresh Saxena
AAML
11
7
0
27 Jul 2022
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting
  Uncertain Outcomes
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain Outcomes
A. Sorokin
N. Buzun
Leonid Pugachev
Andrey Kravchenko
167
8
0
27 Jul 2022
Safe and Robust Experience Sharing for Deterministic Policy Gradient
  Algorithms
Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms
Baturay Saglam
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
63
3
0
27 Jul 2022
Semi-analytical Industrial Cooling System Model for Reinforcement
  Learning
Semi-analytical Industrial Cooling System Model for Reinforcement Learning
Yuri Chervonyi
Praneet Dutta
Piotr Trochim
Octavian Voicu
Cosmin Paduraru
...
Jared Quincy Davis
R. Chippendale
Gautam Bajaj
Sims Witherspoon
Jerry Luo
AI4CE
84
12
0
26 Jul 2022
Optimizing Empty Container Repositioning and Fleet Deployment via
  Configurable Semi-POMDPs
Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs
Riccardo Poiani
Ciprian Stirbu
Alberto Maria Metelli
Marcello Restelli
25
1
0
25 Jul 2022
Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
Deborah Cohen
Moonkyung Ryu
Yinlam Chow
Orgad Keller
Ido Greenberg
...
Michael Fink
Yossi Matias
Idan Szpektor
Craig Boutilier
G. Elidan
OffRL
69
11
0
25 Jul 2022
Flowsheet synthesis through hierarchical reinforcement learning and
  graph neural networks
Flowsheet synthesis through hierarchical reinforcement learning and graph neural networks
Laura Stops
Roel Leenhouts
Qitong Gao
Artur M. Schweidtmann
AI4CE
59
30
0
25 Jul 2022
Robust Knowledge Adaptation for Dynamic Graph Neural Networks
Robust Knowledge Adaptation for Dynamic Graph Neural Networks
Han Li
Changsheng Li
Kaituo Feng
Ye Yuan
Guoren Wang
H. Zha
85
14
0
22 Jul 2022
Knowledge-enhanced Black-box Attacks for Recommendations
Knowledge-enhanced Black-box Attacks for Recommendations
Jingfan Chen
Wenqi Fan
Guanghui Zhu
Xiangyu Zhao
Chun Yuan
Qing Li
Jiaming Ji
MLAUAAML
74
52
0
21 Jul 2022
Minimum Description Length Control
Minimum Description Length Control
Theodore H. Moskovitz
Ta-Chu Kao
M. Sahani
M. Botvinick
80
1
0
17 Jul 2022
Robust AI Driving Strategy for Autonomous Vehicles
Robust AI Driving Strategy for Autonomous Vehicles
S. Nageshrao
Yousaf Rahman
V. Ivanovic
M. Janković
E. Tseng
M. Hafner
Dimitar Filev
81
5
0
16 Jul 2022
BCRLSP: An Offline Reinforcement Learning Framework for Sequential
  Targeted Promotion
BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion
Fanglin Chen
Xiao Liu
Bo Tang
Feiyu Xiong
Serim Hwang
Guomian Zhuang
OffRL
52
1
0
16 Jul 2022
Asset Allocation: From Markowitz to Deep Reinforcement Learning
Asset Allocation: From Markowitz to Deep Reinforcement Learning
Ricard Durall
47
6
0
14 Jul 2022
Scheduling Out-of-Coverage Vehicular Communications Using Reinforcement
  Learning
Scheduling Out-of-Coverage Vehicular Communications Using Reinforcement Learning
T. Şahin
R. Khalili
Mate Boban
A. Wolisz
61
4
0
13 Jul 2022
Unsupervised Learning for Combinatorial Optimization with Principled
  Objective Relaxation
Unsupervised Learning for Combinatorial Optimization with Principled Objective Relaxation
Haoyu Wang
Nan Wu
Hang Yang
Cong Hao
Pan Li
108
32
0
13 Jul 2022
Ablation Study of How Run Time Assurance Impacts the Training and
  Performance of Reinforcement Learning Agents
Ablation Study of How Run Time Assurance Impacts the Training and Performance of Reinforcement Learning Agents
Nathaniel P. Hamilton
Kyle Dunlap
Taylor T. Johnson
Kerianne L. Hobbs
OffRL
78
8
0
08 Jul 2022
Storehouse: a Reinforcement Learning Environment for Optimizing
  Warehouse Management
Storehouse: a Reinforcement Learning Environment for Optimizing Warehouse Management
Julen Cestero
M. Quartulli
Alberto Maria Metelli
Marcello Restelli
OffRL
52
7
0
08 Jul 2022
Stochastic optimal well control in subsurface reservoirs using
  reinforcement learning
Stochastic optimal well control in subsurface reservoirs using reinforcement learning
A. Dixit
A. Elsheikh
OOD
36
16
0
07 Jul 2022
Retro-RL: Reinforcing Nominal Controller With Deep Reinforcement
  Learning for Tilting-Rotor Drones
Retro-RL: Reinforcing Nominal Controller With Deep Reinforcement Learning for Tilting-Rotor Drones
Aswin Nahrendra
Christian Tirtawardhana
Byeong-Uk Yu
E. Lee
Hyunsam Myung
OffRL
61
9
0
07 Jul 2022
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent
  Reinforcement Learning
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Lukas Schafer
Filippos Christianos
Amos Storkey
Stefano V. Albrecht
51
7
0
05 Jul 2022
CLEAR: Improving Vision-Language Navigation with Cross-Lingual,
  Environment-Agnostic Representations
CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations
Jialu Li
Hao Tan
Joey Tianyi Zhou
LM&Ro
122
12
0
05 Jul 2022
Tackling Real-World Autonomous Driving using Deep Reinforcement Learning
Tackling Real-World Autonomous Driving using Deep Reinforcement Learning
Paolo Maramotti
Alessandro Paolo Capasso
Giulio Bacchiani
A. Broggi
60
11
0
05 Jul 2022
Resource Allocation in Multicore Elastic Optical Networks: A Deep
  Reinforcement Learning Approach
Resource Allocation in Multicore Elastic Optical Networks: A Deep Reinforcement Learning Approach
Juan Pinto-Ríos
F. Calderón
A. Leiva
Gabriel Hermosilla
A. Beghelli
Danilo Bórquez-Paredes
A. Lozada
N. Jara
R. Olivares
G. Saavedra
59
19
0
05 Jul 2022
Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework
Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework
Shunyu Liu
Kaixuan Chen
Na Yu
Mingli Song
Zunlei Feng
Mingli Song
71
1
0
05 Jul 2022
General Policy Evaluation and Improvement by Learning to Identify Few
  But Crucial States
General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States
Francesco Faccio
Aditya A. Ramesh
Vincent Herrmann
J. Harb
Jürgen Schmidhuber
OffRL
117
11
0
04 Jul 2022
WebShop: Towards Scalable Real-World Web Interaction with Grounded
  Language Agents
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Shunyu Yao
Howard Chen
John Yang
Karthik Narasimhan
LLMAGLM&Ro
195
522
0
04 Jul 2022
Previous
123...212223...707172
Next