Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 1,547 papers shown
Title
FedHQL: Federated Heterogeneous Q-Learning
Flint Xiaofeng Fan
Yining Ma
Zhongxiang Dai
Cheston Tan
Bryan Kian Hsiang Low
Roger Wattenhofer
FedML
24
7
0
26 Jan 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
32
8
0
26 Jan 2023
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
42
124
0
19 Jan 2023
A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Megan M. Baker
Alexander New
Mario Aguilar-Simon
Ziad Al-Halah
Sébastien M. R. Arnold
...
Zifan Xu
A. Yanguas-Gil
Harel Yedidsion
Shangqun Yu
Gautam K. Vallabha
35
16
0
18 Jan 2023
DRL-VO: Learning to Navigate Through Crowded Dynamic Scenes Using Velocity Obstacles
Zhanteng Xie
P. Dames
46
61
0
16 Jan 2023
Mean-Field Control based Approximation of Multi-Agent Reinforcement Learning in Presence of a Non-decomposable Shared Global State
Washim Uddin Mondal
Vaneet Aggarwal
S. Ukkusuri
33
8
0
13 Jan 2023
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework
Zongwei Liu
Yonghong Song
Yuanlin Zhang
OffRL
35
2
0
10 Jan 2023
Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search
Wenqing Zheng
S. Sharan
Zhiwen Fan
Kevin Wang
Yihan Xi
Zhangyang Wang
60
9
0
30 Dec 2022
Asynchronous Hybrid Reinforcement Learning for Latency and Reliability Optimization in the Metaverse over Wireless Communications
Wen-li Yu
Terence Jie Chua
Jun Zhao
OffRL
19
20
0
30 Dec 2022
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
28
2
0
28 Dec 2022
Variance Reduction for Score Functions Using Optimal Baselines
Ronan L. Keane
H. Gao
21
0
0
27 Dec 2022
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Aleksandar Krnjaic
Raul D. Steleac
Jonathan D. Thomas
Georgios Papoudakis
Lukas Schafer
...
Kuan-Ho Lao
Murat Cubuktepe
Matthew Haley
Peter Borsting
Stefano V. Albrecht
OffRL
21
17
0
22 Dec 2022
Reinforcement Learning for Agile Active Target Sensing with a UAV
Harshi Goel
Laura Jarin-Lipschitz
S. Agarwal
Sandeep Manjanna
Vijay Kumar
27
1
0
16 Dec 2022
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
Yuandong Ding
Ming Feng
Guozi Liu
Wei Jiang
Chuheng Zhang
Li Zhao
Lei Song
Houqiang Li
Yan Jin
Jiang Bian
35
16
0
15 Dec 2022
Learning Robotic Navigation from Experience: Principles, Methods, and Recent Results
Sergey Levine
Dhruv Shah
SSL
54
21
0
13 Dec 2022
Proximal Policy Optimization Based Reinforcement Learning for Joint Bidding in Energy and Frequency Regulation Markets
M. Anwar
Changlong Wang
F. D. Nijs
Hao Wang
21
12
0
13 Dec 2022
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
Qisheng Zhang
Zhen Guo
A. Jøsang
Lance M. Kaplan
F. Chen
Dong-Ho Jeong
Jin-Hee Cho
25
0
0
13 Dec 2022
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Hsin-En Su
Yen-Ju Chen
Ping-Chun Hsieh
Xi Liu
OffRL
26
0
0
10 Dec 2022
A Scale-Arbitrary Image Super-Resolution Network Using Frequency-domain Information
Jing Fang
Yinbo Yu
Zhongyuan Wang
Xin Ding
R. Hu
32
1
0
08 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
33
13
0
01 Dec 2022
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Yizhou Zhang
Guannan Qu
Pan Xu
Yiheng Lin
Zaiwei Chen
Adam Wierman
44
26
0
30 Nov 2022
General policy mapping: online continual reinforcement learning inspired on the insect brain
A. Yanguas-Gil
Sandeep Madireddy
CLL
OnRL
21
0
0
30 Nov 2022
Real-time Bidding Strategy in Display Advertising: An Empirical Analysis
Mengjuan Liu
Zhengning Hu
Zhi Lai
Daiwei Zheng
Xuyun Nie
24
2
0
30 Nov 2022
Beyond CAGE: Investigating Generalization of Learned Autonomous Network Defense Policies
M. Wolk
A. Applebaum
Camron Dennler
P. Dwyer
M. Moskowitz
...
N. Nichols
Nicole Park
Paul Rachwalski
Frank Rau
A. Webster
OffRL
AAML
26
17
0
28 Nov 2022
Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning
Yunchao Zhang
Zonglin Di
KAI-QING Zhou
Cihang Xie
Xin Eric Wang
FedML
AAML
36
2
0
27 Nov 2022
A Critical Review of Traffic Signal Control and A Novel Unified View of Reinforcement Learning and Model Predictive Control Approaches for Adaptive Traffic Signal Control
Xiaoyu Wang
Scott Sanner
Baher Abdulhai
22
5
0
26 Nov 2022
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
45
32
0
24 Nov 2022
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
19
1
0
23 Nov 2022
Predicting Topological Maps for Visual Navigation in Unexplored Environments
Huangying Zhan
Hamid Rezatofighi
Ian Reid
49
0
0
23 Nov 2022
Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Anton Dereventsov
Andrew Starnes
Clayton Webster
26
4
0
21 Nov 2022
HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks
Zining Zhang
Bingsheng He
Zhenjie Zhang
14
5
0
21 Nov 2022
SafeLight: A Reinforcement Learning Method toward Collision-free Traffic Signal Control
Wenlu Du
J. Ye
Jingyi Gu
Jing Li
Hua Wei
Gui-Liu Wang
33
29
0
20 Nov 2022
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
Clément Bonnet
Laurence Midgley
Alexandre Laterre
29
1
0
19 Nov 2022
Universal Distributional Decision-based Black-box Adversarial Attack with Reinforcement Learning
Yiran Huang
Yexu Zhou
Michael Hefenbrock
T. Riedel
Likun Fang
Michael Beigl
AAML
24
3
0
15 Nov 2022
An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods
Yanli Liu
Kai Zhang
Tamer Basar
W. Yin
48
102
0
15 Nov 2022
Legged Locomotion in Challenging Terrains using Egocentric Vision
Ananye Agarwal
Ashish Kumar
Jitendra Malik
Deepak Pathak
34
207
0
14 Nov 2022
Parallel Automatic History Matching Algorithm Using Reinforcement Learning
Omar S. Alolayan
Abdullah O. Alomar
John R. Williams
33
6
0
14 Nov 2022
PMR: Prototypical Modal Rebalance for Multimodal Learning
Yunfeng Fan
Wenchao Xu
Yining Qi
Junxiao Wang
Song Guo
32
62
0
14 Nov 2022
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
Andrea Zanette
OffRL
24
14
0
10 Nov 2022
Coordinating CAV Swarms at Intersections with a Deep Learning Model
Jiawei Zhang
Sheng Li
Li Li
42
26
0
10 Nov 2022
Vision-based navigation and obstacle avoidance via deep reinforcement learning
P. Blum
Peter Crowley
G. Lykotrafitis
24
2
0
09 Nov 2022
Simulation-Based Parallel Training
Lucas Meyer
Alejandro Ribés
Bruno Raffin
AI4CE
41
2
0
08 Nov 2022
Policy-Based Reinforcement Learning for Assortative Matching in Human Behavior Modeling
Ou Deng
Qun Jin
22
1
0
08 Nov 2022
Developing Decentralised Resilience to Malicious Influence in Collective Perception Problem
Christopher Wise
Aya Hussein
Heba El-Fiqi
11
0
0
06 Nov 2022
Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Qi Liu
Xueyuan Li
Zirui Li
Jingda Wu
Guodong Du
Xinlu Gao
Fan Yang
Shihua Yuan
54
8
0
06 Nov 2022
A Survey on Reinforcement Learning in Aviation Applications
Pouria Razzaghi
Amin Tabrizian
Wei Guo
Shulu Chen
Abenezer Taye
Ellis E. Thompson
Alexis Bregeon
Ali Baheri
Peng Wei
OffRL
23
52
0
03 Nov 2022
Leveraging Fully Observable Policies for Learning under Partial Observability
Hai V. Nguyen
Andrea Baisero
Dian Wang
Chris Amato
Robert W. Platt
OffRL
32
19
0
03 Nov 2022
Reinforcement Learning Applied to Trading Systems: A Survey
L. Felizardo
Francisco Caio Lima Paiva
Anna Helena Reali Costa
E. Del-Moral-Hernandez
AIFin
21
1
0
01 Nov 2022
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
19
6
0
31 Oct 2022
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning
Jenny Yang
Jaeuk Kim
Joo-Young Kim
31
2
0
29 Oct 2022
Previous
1
2
3
...
7
8
9
...
29
30
31
Next