Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Area-wide traffic signal control based on a deep graph Q-Network (DGQN) trained in an asynchronous manner
Gyeongjun Kim
Keemin Sohn
GNN
14
9
0
05 Aug 2020
Learning Transition Models with Time-delayed Causal Relations
Junchi Liang
Abdeslam Boularias
OffRL
47
3
0
04 Aug 2020
On The Plurality of Graphs
N. Fitzgerald
Jacopo Tagliabue
28
1
0
03 Aug 2020
Tracking the Race Between Deep Reinforcement Learning and Imitation Learning -- Extended Version
Timo P. Gros
Daniel Holler
Jörg Hoffmann
V. Wolf
17
12
0
03 Aug 2020
Proximal Deterministic Policy Gradient
Marco Maggipinto
Gian Antonio Susto
Pratik Chaudhari
OffRL
41
5
0
03 Aug 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
91
43
0
02 Aug 2020
MAPPER: Multi-Agent Path Planning with Evolutionary Reinforcement Learning in Mixed Dynamic Environments
Zuxin Liu
Baiming Chen
Hongyi Zhou
G. Koushik
M. Hebert
Ding Zhao
AI4CE
107
89
0
30 Jul 2020
Understanding the Stability of Deep Control Policies for Biped Locomotion
Hwangpil Park
R. Yu
Yoonsang Lee
Kyungho Lee
Jehee Lee
52
9
0
30 Jul 2020
Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection
Tianming Wang
Wenjie Lu
H. Yu
Dikai Liu
89
1
0
29 Jul 2020
Adaptive Bitrate Video Streaming for Wireless nodes: A Survey
Kamran Nishat
O. Gnawali
A. Abdelhadi
35
1
0
27 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
95
4
0
24 Jul 2020
Value-Decomposition Multi-Agent Actor-Critics
Jianyu Su
Stephen C. Adams
Peter A. Beling
135
106
0
24 Jul 2020
Bridging the Imitation Gap by Adaptive Insubordination
Luca Weihs
Unnat Jain
Iou-Jen Liu
Jordi Salvador
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
93
36
0
23 Jul 2020
Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Rahul Jain
94
43
0
23 Jul 2020
Time Perception: A Review on Psychological, Computational and Robotic Models
Hamit Basgol
I. Ayhan
Emre Ugur
32
14
0
23 Jul 2020
Attend and Segment: Attention Guided Active Semantic Segmentation
Soroush Seifi
Tinne Tuytelaars
71
13
0
22 Jul 2020
Learning Object Relation Graph and Tentative Policy for Visual Navigation
Heming Du
Xin Yu
Liang Zheng
84
131
0
21 Jul 2020
Soft Expert Reward Learning for Vision-and-Language Navigation
Hu Wang
Qi Wu
Chunhua Shen
57
51
0
21 Jul 2020
Lagrangian Duality in Reinforcement Learning
Pranay Pasula
OffRL
30
0
0
20 Jul 2020
Quick Question: Interrupting Users for Microtasks with Reinforcement Learning
Bo-Jhang Ho
Bharathan Balaji
Mehmet Köseoğlu
S. Sandha
Siyou Pei
Mani B. Srivastava
37
6
0
18 Jul 2020
Discovering Reinforcement Learning Algorithms
Junhyuk Oh
Matteo Hessel
Wojciech M. Czarnecki
Zhongwen Xu
H. V. Hasselt
Satinder Singh
David Silver
94
129
0
17 Jul 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
92
78
0
16 Jul 2020
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Liang Liu
Hao Lu
Hongwei Zou
Haipeng Xiong
Zhiguo Cao
Chunhua Shen
OffRL
78
72
0
16 Jul 2020
Mixture of Step Returns in Bootstrapped DQN
Po-Han Chiang
Hsuan-Kung Yang
Zhang-Wei Hong
Chun-Yi Lee
45
4
0
16 Jul 2020
Active Visual Information Gathering for Vision-Language Navigation
Hanqing Wang
Wenguan Wang
Tianmin Shu
Wei Liang
Jianbing Shen
145
73
0
15 Jul 2020
Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems
Xianfu Chen
Celimuge Wu
Tao Chen
Zhi Liu
Honggang Zhang
M. Bennis
Hang Liu
Yusheng Ji
85
72
0
15 Jul 2020
Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning
Sabrina Hoppe
Marc Toussaint
OffRL
56
7
0
15 Jul 2020
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
51
25
0
15 Jul 2020
Machine Learning for Offensive Security: Sandbox Classification Using Decision Trees and Artificial Neural Networks
William W. Pearce
Nick Landers
Nancy Fulda
11
4
0
14 Jul 2020
Relational-Grid-World: A Novel Relational Reasoning Environment and An Agent Model for Relational Information Extraction
Faruk Küçüksubasi
Elif Surer
34
2
0
12 Jul 2020
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization
Yimin Huang
Yujun Li
Hanrong Ye
Zhenguo Li
Zhihua Zhang
70
7
0
11 Jul 2020
A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks
Unnat Jain
Luca Weihs
Eric Kolve
Ali Farhadi
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
86
58
0
09 Jul 2020
Self-Supervised Policy Adaptation during Deployment
Nicklas Hansen
Rishabh Jangir
Yu Sun
Guillem Alenyà
Pieter Abbeel
Alexei A. Efros
Lerrel Pinto
Xiaolong Wang
110
163
0
08 Jul 2020
Tracking-by-Trackers with a Distilled and Reinforced Model
Matteo Dunnhofer
N. Martinel
C. Micheloni
VOT
OffRL
66
4
0
08 Jul 2020
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Joshua Romoff
Peter Henderson
David Kanaa
Emmanuel Bengio
Ahmed Touati
Pierre-Luc Bacon
Joelle Pineau
66
3
0
06 Jul 2020
Integrating Distributed Architectures in Highly Modular RL Libraries
Albert Bou
Sebastian Dittert
Gianni De Fabritiis
76
0
0
06 Jul 2020
Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning
Meng Zhou
Ziyu Liu
Pengwei Sui
Yixuan Li
Yuk Ying Chung
72
27
0
06 Jul 2020
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
97
72
0
04 Jul 2020
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Yufei Wang
Tianwei Ni
77
21
0
03 Jul 2020
Efficient Marginalization of Discrete and Structured Latent Variables via Sparsity
Gonçalo M. Correia
Vlad Niculae
Wilker Aziz
André F. T. Martins
BDL
162
23
0
03 Jul 2020
Expected Eligibility Traces
H. V. Hasselt
Sephora Madjiheurem
Matteo Hessel
David Silver
André Barreto
Diana Borsa
64
38
0
03 Jul 2020
Towards Generalization and Data Efficient Learning of Deep Robotic Grasping
Zhixin Chen
Mengxiang Lin
Zhixin Jia
Shibo Jian
48
6
0
02 Jul 2020
Decentralized Deep Reinforcement Learning for Network Level Traffic Signal Control
Jinqiu Guo
27
1
0
02 Jul 2020
Adaptive Discretization for Model-Based Reinforcement Learning
Sean R. Sinclair
Tianyu Wang
Gauri Jain
Siddhartha Banerjee
Chao Yu
OffRL
89
21
0
01 Jul 2020
Gradient Temporal-Difference Learning with Regularized Corrections
Sina Ghiassian
Andrew Patterson
Shivam Garg
Dhawal Gupta
Adam White
Martha White
177
42
0
01 Jul 2020
Convex Regularization in Monte-Carlo Tree Search
Tuan Dam
Carlo DÉramo
Jan Peters
Joni Pajarinen
OffRL
81
11
0
01 Jul 2020
Robustifying the Deployment of tinyML Models for Autonomous mini-vehicles
Miguel de Prado
Manuele Rusci
Romain Donze
Alessandro Capotondi
Serge Monnerat
Luca Benini and
Nuria Pazos
97
40
0
01 Jul 2020
A Novel RL-assisted Deep Learning Framework for Task-informative Signals Selection and Classification for Spontaneous BCIs
Wonjun Ko
Eunjin Jeon
Heung-Il Suk
40
15
0
01 Jul 2020
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
Elise van der Pol
Daniel E. Worrall
H. V. Hoof
F. Oliehoek
Max Welling
BDL
AI4CE
121
164
0
30 Jun 2020
Lachesis: Automatic Partitioning for UDF-Centric Analytics
Jia Zou
A. Das
Pratik Barhate
Arun Iyengar
Binhang Yuan
Dimitrije Jankov
Chis Jermaine
24
4
0
30 Jun 2020
Previous
1
2
3
...
41
42
43
...
70
71
72
Next