Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control
Zhiyuan Xu
Kun Wu
Zhengping Che
Jian Tang
Jieping Ye
CLL
OffRL
109
49
0
15 Oct 2020
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua Zhu
Yingce Xia
Lijun Wu
Jiajun Deng
Wen-gang Zhou
Tao Qin
Houqiang Li
SSL
OffRL
110
60
0
15 Oct 2020
Average Cost Optimal Control of Stochastic Systems Using Reinforcement Learning
J. Lai
J. Xiong
18
0
0
13 Oct 2020
Deep Reinforcement Learning and Transportation Research: A Comprehensive Review
Nahid Parvez Farazi
T. Ahamed
Limon Barua
Bo Zou
AI4TS
69
18
0
13 Oct 2020
FedAT: A High-Performance and Communication-Efficient Federated Learning System with Asynchronous Tiers
Zheng Chai
Yujing Chen
Ali Anwar
Liang Zhao
Yue Cheng
Huzefa Rangwala
FedML
82
124
0
12 Oct 2020
The Greatest Teacher, Failure is: Using Reinforcement Learning for SFC Placement Based on Availability and Energy Consumption
Guto Leoni Santos
Theo Lynn
J. Kelner
P. Endo
30
0
0
12 Oct 2020
Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification
Yulin Wang
Kangchen Lv
Rui Huang
Shiji Song
Le Yang
Gao Huang
3DH
65
151
0
11 Oct 2020
Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments
Zhi Wang
Chunlin Chen
D. Dong
51
12
0
09 Oct 2020
Learning Not to Learn: Nature versus Nurture in Silico
R. T. Lange
Henning Sprekeler
80
10
0
09 Oct 2020
Learning Value Functions in Deep Policy Gradients using Residual Variance
Yannis Flet-Berliac
Reda Ouhamma
Odalric-Ambrym Maillard
Philippe Preux
OffRL
72
1
0
09 Oct 2020
Q-learning with Language Model for Edit-based Unsupervised Summarization
Ryosuke Kohita
Akifumi Wachi
Yang Zhao
Ryuki Tachibana
KELM
52
4
0
09 Oct 2020
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
K. Murugesan
Mattia Atzeni
Pavan Kapanipathi
Pushkar Shukla
Yara Rizk
Gerald Tesauro
Kartik Talamadupula
Mrinmaya Sachan
Murray Campbell
LM&Ro
LLMAG
OffRL
77
56
0
08 Oct 2020
Maximum Reward Formulation In Reinforcement Learning
S. Gottipati
Yashaswi Pathak
Rohan Nuttall
Sahir
Raviteja Chunduru
Ahmed Touati
Sriram Ganapathi Subramanian
Matthew E. Taylor
Sarath Chandar
118
14
0
08 Oct 2020
Regularized Inverse Reinforcement Learning
Wonseok Jeon
Chen-Yang Su
Paul Barde
T. Doan
Derek Nowrouzezahrai
Joelle Pineau
76
12
0
07 Oct 2020
Online Safety Assurance for Deep Reinforcement Learning
Noga H. Rotman
Michael Schapira
Aviv Tamar
OffRL
96
5
0
07 Oct 2020
From Language Games to Drawing Games
Chrisantha Fernando
D. Zenkova
Stanislav Nikolov
Simon Osindero
73
4
0
06 Oct 2020
Learning Diverse Options via InfoMax Termination Critic
Yuji Kanagawa
Tomoyuki Kaneko
66
1
0
06 Oct 2020
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation
Minki Kang
Moonsu Han
Sung Ju Hwang
OOD
81
18
0
06 Oct 2020
Heterogeneous Multi-Agent Reinforcement Learning for Unknown Environment Mapping
Ceyer Wakilpoor
Patrick J. Martin
Carrie Rebhuhn
Amanda Vu
67
22
0
06 Oct 2020
Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Jonathan Gray
Adam Lerer
A. Bakhtin
Noam Brown
123
51
0
06 Oct 2020
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
236
877
0
05 Oct 2020
Offline Learning for Planning: A Summary
Giorgio Angelotti
Nicolas Drougard
Caroline Ponzoni Carvalho Chanel
OffRL
51
4
0
05 Oct 2020
The act of remembering: a study in partially observable reinforcement learning
Rodrigo Toro Icarte
Richard Valenzano
Toryn Q. Klassen
Phillip J. K. Christoffersen
Amir-massoud Farahmand
Sheila A. McIlraith
OffRL
40
11
0
05 Oct 2020
Mean-Variance Efficient Reinforcement Learning by Expected Quadratic Utility Maximization
Masahiro Kato
Kei Nakagawa
Kenshi Abe
Tetsuro Morimura
424
0
0
03 Oct 2020
Reinforcement Learning of Sequential Price Mechanisms
Gianluca Brero
Alon Eden
M. Gerstgrasser
David C. Parkes
Duncan Rheingans-Yoo
OffRL
60
18
0
02 Oct 2020
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Shangtong Zhang
Romain Laroche
H. V. Seijen
Shimon Whiteson
Rémi Tachet des Combes
124
15
0
02 Oct 2020
Self-Play Reinforcement Learning for Fast Image Retargeting
Nobukatsu Kajiura
Satoshi Kosugi
Xueting Wang
T. Yamasaki
136
20
0
02 Oct 2020
How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and Act in Fantasy Worlds
Prithviraj Ammanabrolu
Jack Urbanek
Margaret Li
Arthur Szlam
Tim Rocktaschel
Jason Weston
LM&Ro
121
44
0
01 Oct 2020
Toolpath design for additive manufacturing using deep reinforcement learning
M. Mozaffar
Ablodghani Ebrahimi
Jian Cao
AI4CE
40
7
0
30 Sep 2020
Cross Learning in Deep Q-Networks
Xing Wang
A. Vinel
25
2
0
29 Sep 2020
Trust-Region Method with Deep Reinforcement Learning in Analog Design Space Exploration
Kai-En Yang
Chia-Yu Tsai
Hung-Hao Shen
Chen-Feng Chiang
Feng-Ming Tsai
Chunguang Wang
Yiju Ting
Chia-Shun Yeh
C. Lai
53
14
0
29 Sep 2020
Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Yunshu Du
Garrett A. Warnell
A. Gebremedhin
Peter Stone
Matthew E. Taylor
58
11
0
29 Sep 2020
Enhancing Continuous Control of Mobile Robots for End-to-End Visual Active Tracking
Alessandro Devo
Alberto Dionigi
G. Costante
29
27
0
28 Sep 2020
Normalization Techniques in Training DNNs: Methodology, Analysis and Application
Lei Huang
Jie Qin
Yi Zhou
Fan Zhu
Li Liu
Ling Shao
AI4CE
176
278
0
27 Sep 2020
Lineage Evolution Reinforcement Learning
Zeyu Zhang
Guisheng Yin
29
0
0
26 Sep 2020
Symbolic Relational Deep Reinforcement Learning based on Graph Neural Networks and Autoregressive Policy Decomposition
Jaromír Janisch
Tomávs Pevný
Viliam Lisý
AI4CE
93
3
0
25 Sep 2020
Deep Reinforcement Learning with a Stage Incentive Mechanism of Dense Reward for Robotic Trajectory Planning
G. Peng
Jin Yang
Xinde Li
M. O. Khyam
47
11
0
25 Sep 2020
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
Wenshuai Zhao
Jorge Peña Queralta
Tomi Westerlund
OffRL
265
743
0
24 Sep 2020
Neurocoder: Learning General-Purpose Computation Using Stored Neural Programs
Hung Le
Svetha Venkatesh
NAI
45
5
0
24 Sep 2020
Is Q-Learning Provably Efficient? An Extended Analysis
Kushagra Rastogi
Jonathan Lee
Fabrice Harel-Canada
Aditya Sunil Joglekar
OffRL
28
1
0
22 Sep 2020
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning
Junjie Wang
Qichao Zhang
Dongbin Zhao
Mengchen Zhao
Jianye Hao
OffRL
55
5
0
21 Sep 2020
Learn to Exceed: Stereo Inverse Reinforcement Learning with Concurrent Policy Optimization
Feng Tao
Yongcan Cao
101
2
0
21 Sep 2020
Towards Interpretable-AI Policies Induction using Evolutionary Nonlinear Decision Trees for Discrete Action Systems
Yashesh D. Dhebar
Kalyanmoy Deb
S. Nageshrao
Ling Zhu
Dimitar Filev
68
16
0
20 Sep 2020
AI and Wargaming
J. Goodman
S. Risi
Simon Lucas
VLM
125
14
0
18 Sep 2020
Efficient Reinforcement Learning Development with RLzoo
Zihan Ding
Tianyang Yu
Yanhua Huang
Hongming Zhang
Guo Li
Quancheng Guo
Kai Zou
Hao Dong
OffRL
OnRL
44
6
0
18 Sep 2020
Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos
Jie Wu
Guanbin Li
Xiaoguang Han
Liang Lin
OffRL
AI4TS
84
56
0
18 Sep 2020
Competitiveness of MAP-Elites against Proximal Policy Optimization on locomotion tasks in deterministic simulations
Szymon Brych
Antoine Cully
67
4
0
17 Sep 2020
Energy-based Surprise Minimization for Multi-Agent Value Factorization
Karush Suri
Xiaolong Shi
Konstantinos Plataniotis
Y. Lawryshyn
61
1
0
16 Sep 2020
Transfer Learning in Deep Reinforcement Learning: A Survey
Zhuangdi Zhu
Kaixiang Lin
Anil K. Jain
Jiayu Zhou
OffRL
LRM
160
606
0
16 Sep 2020
Multimodal Safety-Critical Scenarios Generation for Decision-Making Algorithms Evaluation
Wenhao Ding
Baiming Chen
Yue Liu
Kim Ji Eun
Ding Zhao
AAML
102
106
0
16 Sep 2020
Previous
1
2
3
...
39
40
41
...
70
71
72
Next