ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 1,547 papers shown
Title
FedHQL: Federated Heterogeneous Q-Learning
FedHQL: Federated Heterogeneous Q-Learning
Flint Xiaofeng Fan
Yining Ma
Zhongxiang Dai
Cheston Tan
Bryan Kian Hsiang Low
Roger Wattenhofer
FedML
24
7
0
26 Jan 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement
  Learning
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
32
8
0
26 Jan 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
42
124
0
19 Jan 2023
A Domain-Agnostic Approach for Characterization of Lifelong Learning
  Systems
A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Megan M. Baker
Alexander New
Mario Aguilar-Simon
Ziad Al-Halah
Sébastien M. R. Arnold
...
Zifan Xu
A. Yanguas-Gil
Harel Yedidsion
Shangqun Yu
Gautam K. Vallabha
35
16
0
18 Jan 2023
DRL-VO: Learning to Navigate Through Crowded Dynamic Scenes Using
  Velocity Obstacles
DRL-VO: Learning to Navigate Through Crowded Dynamic Scenes Using Velocity Obstacles
Zhanteng Xie
P. Dames
46
61
0
16 Jan 2023
Mean-Field Control based Approximation of Multi-Agent Reinforcement
  Learning in Presence of a Non-decomposable Shared Global State
Mean-Field Control based Approximation of Multi-Agent Reinforcement Learning in Presence of a Non-decomposable Shared Global State
Washim Uddin Mondal
Vaneet Aggarwal
S. Ukkusuri
33
8
0
13 Jan 2023
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework
Zongwei Liu
Yonghong Song
Yuanlin Zhang
OffRL
35
2
0
10 Jan 2023
Symbolic Visual Reinforcement Learning: A Scalable Framework with
  Object-Level Abstraction and Differentiable Expression Search
Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search
Wenqing Zheng
S. Sharan
Zhiwen Fan
Kevin Wang
Yihan Xi
Zhangyang Wang
60
9
0
30 Dec 2022
Asynchronous Hybrid Reinforcement Learning for Latency and Reliability
  Optimization in the Metaverse over Wireless Communications
Asynchronous Hybrid Reinforcement Learning for Latency and Reliability Optimization in the Metaverse over Wireless Communications
Wen-li Yu
Terence Jie Chua
Jun Zhao
OffRL
19
20
0
30 Dec 2022
Towards automating Codenames spymasters with deep reinforcement learning
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
28
2
0
28 Dec 2022
Variance Reduction for Score Functions Using Optimal Baselines
Variance Reduction for Score Functions Using Optimal Baselines
Ronan L. Keane
H. Gao
21
0
0
27 Dec 2022
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with
  Robotic and Human Co-Workers
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Aleksandar Krnjaic
Raul D. Steleac
Jonathan D. Thomas
Georgios Papoudakis
Lukas Schafer
...
Kuan-Ho Lao
Murat Cubuktepe
Matthew Haley
Peter Borsting
Stefano V. Albrecht
OffRL
21
17
0
22 Dec 2022
Reinforcement Learning for Agile Active Target Sensing with a UAV
Reinforcement Learning for Agile Active Target Sensing with a UAV
Harshi Goel
Laura Jarin-Lipschitz
S. Agarwal
Sandeep Manjanna
Vijay Kumar
27
1
0
16 Dec 2022
Multi-Agent Reinforcement Learning with Shared Resources for Inventory
  Management
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
Yuandong Ding
Ming Feng
Guozi Liu
Wei Jiang
Chuheng Zhang
Li Zhao
Lei Song
Houqiang Li
Yan Jin
Jiang Bian
35
16
0
15 Dec 2022
Learning Robotic Navigation from Experience: Principles, Methods, and
  Recent Results
Learning Robotic Navigation from Experience: Principles, Methods, and Recent Results
Sergey Levine
Dhruv Shah
SSL
54
21
0
13 Dec 2022
Proximal Policy Optimization Based Reinforcement Learning for Joint
  Bidding in Energy and Frequency Regulation Markets
Proximal Policy Optimization Based Reinforcement Learning for Joint Bidding in Energy and Frequency Regulation Markets
M. Anwar
Changlong Wang
F. D. Nijs
Hao Wang
21
12
0
13 Dec 2022
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
Qisheng Zhang
Zhen Guo
A. Jøsang
Lance M. Kaplan
F. Chen
Dong-Ho Jeong
Jin-Hee Cho
25
0
0
13 Dec 2022
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Hsin-En Su
Yen-Ju Chen
Ping-Chun Hsieh
Xi Liu
OffRL
26
0
0
10 Dec 2022
A Scale-Arbitrary Image Super-Resolution Network Using Frequency-domain
  Information
A Scale-Arbitrary Image Super-Resolution Network Using Frequency-domain Information
Jing Fang
Yinbo Yu
Zhongyuan Wang
Xin Ding
R. Hu
32
1
0
08 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player
  Multi-Agent Learning Toolbox
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
33
13
0
01 Dec 2022
Global Convergence of Localized Policy Iteration in Networked
  Multi-Agent Reinforcement Learning
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Yizhou Zhang
Guannan Qu
Pan Xu
Yiheng Lin
Zaiwei Chen
Adam Wierman
44
26
0
30 Nov 2022
General policy mapping: online continual reinforcement learning inspired
  on the insect brain
General policy mapping: online continual reinforcement learning inspired on the insect brain
A. Yanguas-Gil
Sandeep Madireddy
CLL
OnRL
21
0
0
30 Nov 2022
Real-time Bidding Strategy in Display Advertising: An Empirical Analysis
Real-time Bidding Strategy in Display Advertising: An Empirical Analysis
Mengjuan Liu
Zhengning Hu
Zhi Lai
Daiwei Zheng
Xuyun Nie
24
2
0
30 Nov 2022
Beyond CAGE: Investigating Generalization of Learned Autonomous Network
  Defense Policies
Beyond CAGE: Investigating Generalization of Learned Autonomous Network Defense Policies
M. Wolk
A. Applebaum
Camron Dennler
P. Dwyer
M. Moskowitz
...
N. Nichols
Nicole Park
Paul Rachwalski
Frank Rau
A. Webster
OffRL
AAML
26
17
0
28 Nov 2022
Navigation as Attackers Wish? Towards Building Robust Embodied Agents
  under Federated Learning
Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning
Yunchao Zhang
Zonglin Di
KAI-QING Zhou
Cihang Xie
Xin Eric Wang
FedML
AAML
36
2
0
27 Nov 2022
A Critical Review of Traffic Signal Control and A Novel Unified View of
  Reinforcement Learning and Model Predictive Control Approaches for Adaptive
  Traffic Signal Control
A Critical Review of Traffic Signal Control and A Novel Unified View of Reinforcement Learning and Model Predictive Control Approaches for Adaptive Traffic Signal Control
Xiaoyu Wang
Scott Sanner
Baher Abdulhai
22
5
0
26 Nov 2022
Melting Pot 2.0
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
45
32
0
24 Nov 2022
Representation Learning for Continuous Action Spaces is Beneficial for
  Efficient Policy Learning
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
19
1
0
23 Nov 2022
Predicting Topological Maps for Visual Navigation in Unexplored
  Environments
Predicting Topological Maps for Visual Navigation in Unexplored Environments
Huangying Zhan
Hamid Rezatofighi
Ian Reid
49
0
0
23 Nov 2022
Examining Policy Entropy of Reinforcement Learning Agents for
  Personalization Tasks
Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Anton Dereventsov
Andrew Starnes
Clayton Webster
26
4
0
21 Nov 2022
HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler
  for Neural Networks
HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks
Zining Zhang
Bingsheng He
Zhenjie Zhang
14
5
0
21 Nov 2022
SafeLight: A Reinforcement Learning Method toward Collision-free Traffic
  Signal Control
SafeLight: A Reinforcement Learning Method toward Collision-free Traffic Signal Control
Wenlu Du
J. Ye
Jingyi Gu
Jing Li
Hua Wei
Gui-Liu Wang
33
29
0
20 Nov 2022
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer
  Value Function
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
Clément Bonnet
Laurence Midgley
Alexandre Laterre
29
1
0
19 Nov 2022
Universal Distributional Decision-based Black-box Adversarial Attack
  with Reinforcement Learning
Universal Distributional Decision-based Black-box Adversarial Attack with Reinforcement Learning
Yiran Huang
Yexu Zhou
Michael Hefenbrock
T. Riedel
Likun Fang
Michael Beigl
AAML
24
3
0
15 Nov 2022
An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural
  Policy Gradient Methods
An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods
Yanli Liu
Kai Zhang
Tamer Basar
W. Yin
48
102
0
15 Nov 2022
Legged Locomotion in Challenging Terrains using Egocentric Vision
Legged Locomotion in Challenging Terrains using Egocentric Vision
Ananye Agarwal
Ashish Kumar
Jitendra Malik
Deepak Pathak
34
207
0
14 Nov 2022
Parallel Automatic History Matching Algorithm Using Reinforcement
  Learning
Parallel Automatic History Matching Algorithm Using Reinforcement Learning
Omar S. Alolayan
Abdullah O. Alomar
John R. Williams
33
6
0
14 Nov 2022
PMR: Prototypical Modal Rebalance for Multimodal Learning
PMR: Prototypical Modal Rebalance for Multimodal Learning
Yunfeng Fan
Wenchao Xu
Yining Qi
Junxiao Wang
Song Guo
32
62
0
14 Nov 2022
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
Andrea Zanette
OffRL
24
14
0
10 Nov 2022
Coordinating CAV Swarms at Intersections with a Deep Learning Model
Coordinating CAV Swarms at Intersections with a Deep Learning Model
Jiawei Zhang
Sheng Li
Li Li
42
26
0
10 Nov 2022
Vision-based navigation and obstacle avoidance via deep reinforcement
  learning
Vision-based navigation and obstacle avoidance via deep reinforcement learning
P. Blum
Peter Crowley
G. Lykotrafitis
24
2
0
09 Nov 2022
Simulation-Based Parallel Training
Simulation-Based Parallel Training
Lucas Meyer
Alejandro Ribés
Bruno Raffin
AI4CE
41
2
0
08 Nov 2022
Policy-Based Reinforcement Learning for Assortative Matching in Human
  Behavior Modeling
Policy-Based Reinforcement Learning for Assortative Matching in Human Behavior Modeling
Ou Deng
Qun Jin
22
1
0
08 Nov 2022
Developing Decentralised Resilience to Malicious Influence in Collective
  Perception Problem
Developing Decentralised Resilience to Malicious Influence in Collective Perception Problem
Christopher Wise
Aya Hussein
Heba El-Fiqi
11
0
0
06 Nov 2022
Graph Reinforcement Learning Application to Co-operative Decision-Making
  in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Qi Liu
Xueyuan Li
Zirui Li
Jingda Wu
Guodong Du
Xinlu Gao
Fan Yang
Shihua Yuan
54
8
0
06 Nov 2022
A Survey on Reinforcement Learning in Aviation Applications
A Survey on Reinforcement Learning in Aviation Applications
Pouria Razzaghi
Amin Tabrizian
Wei Guo
Shulu Chen
Abenezer Taye
Ellis E. Thompson
Alexis Bregeon
Ali Baheri
Peng Wei
OffRL
23
52
0
03 Nov 2022
Leveraging Fully Observable Policies for Learning under Partial
  Observability
Leveraging Fully Observable Policies for Learning under Partial Observability
Hai V. Nguyen
Andrea Baisero
Dian Wang
Chris Amato
Robert W. Platt
OffRL
32
19
0
03 Nov 2022
Reinforcement Learning Applied to Trading Systems: A Survey
Reinforcement Learning Applied to Trading Systems: A Survey
L. Felizardo
Francisco Caio Lima Paiva
Anna Helena Reali Costa
E. Del-Moral-Hernandez
AIFin
21
1
0
01 Nov 2022
DanZero: Mastering GuanDan Game with Reinforcement Learning
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
19
6
0
31 Oct 2022
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight
  Grouping for Multi-Agent Reinforcement Learning
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning
Jenny Yang
Jaeuk Kim
Joo-Young Kim
31
2
0
29 Oct 2022
Previous
123...789...293031
Next