Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Investigation of reinforcement learning for shape optimization of profile extrusion dies
C. Fricke
D. Wolff
Marco Kemmerling
S. Elgeti
OffRL
16
5
0
23 Dec 2022
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Aleksandar Krnjaic
Raul D. Steleac
Jonathan D. Thomas
Georgios Papoudakis
Lukas Schafer
...
Kuan-Ho Lao
Murat Cubuktepe
Matthew Haley
Peter Borsting
Stefano V. Albrecht
OffRL
86
18
0
22 Dec 2022
Lifelong Reinforcement Learning with Modulating Masks
Eseoghene Ben-Iwhiwhu
Saptarshi Nath
Praveen K. Pilly
Soheil Kolouri
Andrea Soltoggio
CLL
OffRL
100
23
0
21 Dec 2022
Reinforcement Learning for Agile Active Target Sensing with a UAV
Harshi Goel
Laura Jarin-Lipschitz
S. Agarwal
Sandeep Manjanna
Vijay Kumar
54
1
0
16 Dec 2022
Exploring Tradeoffs in Spiking Neural Networks
Florian Bacho
Dominique F. Chu
59
1
0
15 Dec 2022
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
Yuandong Ding
Ming Feng
Guozi Liu
Wei Jiang
Wei Shen
Li Zhao
Lei Song
Houqiang Li
Yan Jin
Jiang Bian
71
16
0
15 Dec 2022
Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
51
12
0
14 Dec 2022
Learning Robotic Navigation from Experience: Principles, Methods, and Recent Results
Sergey Levine
Dhruv Shah
SSL
95
23
0
13 Dec 2022
Proximal Policy Optimization Based Reinforcement Learning for Joint Bidding in Energy and Frequency Regulation Markets
M. Anwar
Changlong Wang
F. D. Nijs
Hao Wang
23
14
0
13 Dec 2022
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration
Qisheng Zhang
Zhen Guo
A. Jøsang
Lance M. Kaplan
F. Chen
Dong-Ho Jeong
Jin-Hee Cho
44
0
0
13 Dec 2022
Reinforced Approximate Exploratory Data Analysis
Shaddy Garg
Subrata Mitra
Tong Yu
Yash Gadhia
A. Kashettiwar
43
6
0
12 Dec 2022
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Hsin-En Su
Yen-Ju Chen
Ping-Chun Hsieh
Xi Liu
OffRL
70
0
0
10 Dec 2022
A Scale-Arbitrary Image Super-Resolution Network Using Frequency-domain Information
Jing Fang
Yinbo Yu
Zhongyuan Wang
Xin Ding
R. Hu
96
1
0
08 Dec 2022
Design and Planning of Flexible Mobile Micro-Grids Using Deep Reinforcement Learning
Cesare Caputo
Michel-Alexandre Cardin
Pudong Ge
Fei Teng
A. Korre
Ehecatl Antonio del Rio Chanona
39
18
0
08 Dec 2022
L2SR: Learning to Sample and Reconstruct for Accelerated MRI via Reinforcement Learning
Pu Yang
Bin Dong
OffRL
AI4TS
70
0
0
05 Dec 2022
Resilience Evaluation of Entropy Regularized Logistic Networks with Probabilistic Cost
Koshi Oishi
Yota Hashizume
Tomohiko Jimbo
Hirotaka Kaji
Kenji Kashima
48
2
0
05 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
76
13
0
01 Dec 2022
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Yizhou Zhang
Guannan Qu
Pan Xu
Yiheng Lin
Zaiwei Chen
Adam Wierman
96
26
0
30 Nov 2022
General policy mapping: online continual reinforcement learning inspired on the insect brain
A. Yanguas-Gil
Sandeep Madireddy
CLL
OnRL
133
0
0
30 Nov 2022
Real-time Bidding Strategy in Display Advertising: An Empirical Analysis
Mengjuan Liu
Zhengning Hu
Zhi Lai
Daiwei Zheng
Xuyun Nie
36
2
0
30 Nov 2022
Beyond CAGE: Investigating Generalization of Learned Autonomous Network Defense Policies
M. Wolk
A. Applebaum
Camron Dennler
P. Dwyer
M. Moskowitz
...
N. Nichols
Nicole Park
Paul Rachwalski
Frank Rau
A. Webster
OffRL
AAML
93
18
0
28 Nov 2022
Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning
Yunchao Zhang
Zonglin Di
KAI-QING Zhou
Cihang Xie
Xin Eric Wang
FedML
AAML
88
2
0
27 Nov 2022
A Critical Review of Traffic Signal Control and A Novel Unified View of Reinforcement Learning and Model Predictive Control Approaches for Adaptive Traffic Signal Control
Xiaoyu Wang
Scott Sanner
Baher Abdulhai
69
5
0
26 Nov 2022
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
124
34
0
24 Nov 2022
Explainable and Safe Reinforcement Learning for Autonomous Air Mobility
Lei Wang
Hongyu Yang
Yi Lin
S. Yin
Yuankai Wu
28
5
0
24 Nov 2022
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
68
1
0
23 Nov 2022
Reinforcement learning for traffic signal control in hybrid action space
Haoqing Luo
Sheng Jin
80
7
0
23 Nov 2022
Predicting Topological Maps for Visual Navigation in Unexplored Environments
Huangying Zhan
Hamid Rezatofighi
Ian Reid
113
0
0
23 Nov 2022
Decision-making with Speculative Opponent Models
Jing-rong Sun
Shuo Chen
Cong Zhang
Yining Ma
Jie Zhang
74
1
0
22 Nov 2022
Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Anton Dereventsov
Andrew Starnes
Clayton Webster
85
4
0
21 Nov 2022
HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks
Zining Zhang
Bingsheng He
Zhenjie Zhang
52
5
0
21 Nov 2022
SafeLight: A Reinforcement Learning Method toward Collision-free Traffic Signal Control
Wenlu Du
J. Ye
Jingyi Gu
Jing Li
Hua Wei
Gui-Liu Wang
73
32
0
20 Nov 2022
Let Graph be the Go Board: Gradient-free Node Injection Attack for Graph Neural Networks via Reinforcement Learning
Mingxuan Ju
Yujie Fan
Chuxu Zhang
Yanfang Ye
AAML
120
38
0
19 Nov 2022
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
Clément Bonnet
Laurence Midgley
Alexandre Laterre
111
0
0
19 Nov 2022
ProtSi: Prototypical Siamese Network with Data Augmentation for Few-Shot Subjective Answer Evaluation
Yining Lu
Jing Qiu
Gaurav Gupta
AAML
57
1
0
17 Nov 2022
Universal Distributional Decision-based Black-box Adversarial Attack with Reinforcement Learning
Yiran Huang
Yexu Zhou
Michael Hefenbrock
T. Riedel
Likun Fang
Michael Beigl
AAML
28
3
0
15 Nov 2022
An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods
Yanli Liu
Kai Zhang
Tamer Basar
W. Yin
117
110
0
15 Nov 2022
Legged Locomotion in Challenging Terrains using Egocentric Vision
Ananye Agarwal
Ashish Kumar
Jitendra Malik
Deepak Pathak
96
216
0
14 Nov 2022
Towards Abstractive Timeline Summarisation using Preference-based Reinforcement Learning
Yuxuan Ye
Edwin Simpson
36
0
0
14 Nov 2022
Parallel Automatic History Matching Algorithm Using Reinforcement Learning
Omar S. Alolayan
Abdullah O. Alomar
John R. Williams
51
6
0
14 Nov 2022
PMR: Prototypical Modal Rebalance for Multimodal Learning
Yunfeng Fan
Wenchao Xu
Yining Qi
Junxiao Wang
Song Guo
74
73
0
14 Nov 2022
The Expertise Problem: Learning from Specialized Feedback
Oliver Daniels-Koch
Rachel Freedman
OffRL
70
18
0
12 Nov 2022
A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges
Yunpeng Qing
Shunyu Liu
Mingli Song
Huiqiong Wang
Mingli Song
XAI
93
1
0
12 Nov 2022
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
Andrea Zanette
OffRL
65
15
0
10 Nov 2022
Coordinating CAV Swarms at Intersections with a Deep Learning Model
Jiawei Zhang
Sheng Li
Li Li
68
26
0
10 Nov 2022
Vision-based navigation and obstacle avoidance via deep reinforcement learning
P. Blum
Peter Crowley
G. Lykotrafitis
31
2
0
09 Nov 2022
Simulation-Based Parallel Training
Lucas Meyer
Alejandro Ribés
Bruno Raffin
AI4CE
74
2
0
08 Nov 2022
Policy-Based Reinforcement Learning for Assortative Matching in Human Behavior Modeling
Ou Deng
Qun Jin
79
1
0
08 Nov 2022
Developing Decentralised Resilience to Malicious Influence in Collective Perception Problem
Christopher Wise
Aya Hussein
Heba El-Fiqi
24
0
0
06 Nov 2022
Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Qi Liu
Xueyuan Li
Zirui Li
Jingda Wu
Guodong Du
Xinlu Gao
Fan Yang
Shihua Yuan
77
8
0
06 Nov 2022
Previous
1
2
3
...
18
19
20
...
70
71
72
Next