Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Soft policy optimization using dual-track advantage estimator
Yubo Huang
Xuechun Wang
Luobao Zou
Zhiwei Zhuang
Weidong Zhang
31
3
0
15 Sep 2020
Decoupling Representation Learning from Reinforcement Learning
Adam Stooke
Kimin Lee
Pieter Abbeel
Michael Laskin
SSL
DRL
403
346
0
14 Sep 2020
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement Learning
R. Awasthi
K. K. Guliani
Saif Ahmad Khan
Aniket Vashishtha
M. S. Gill
Arshita Bhatt
A. Nagori
Aniket Gupta
Ponnurangam Kumaraguru
Tavpritesh Sethi
102
24
0
14 Sep 2020
Multi-Agent Reinforcement Learning in Cournot Games
Yuanyuan Shi
Baosen Zhang
65
7
0
14 Sep 2020
Efficient Competitive Self-Play Policy Optimization
Yuanyi Zhong
Yuanshuo Zhou
Jian Peng
20
2
0
13 Sep 2020
Pow-Wow: A Dataset and Study on Collaborative Communication in Pommerman
Takuma Yoneda
Matthew R. Walter
Jason Naradowsky
LLMAG
18
1
0
13 Sep 2020
Guided Policy Search Based Control of a High Dimensional Advanced Manufacturing Process
A. Surana
Kishore K. Reddy
M. Siopis
AI4CE
27
2
0
12 Sep 2020
Phasic Policy Gradient
K. Cobbe
Jacob Hilton
Oleg Klimov
John Schulman
OffRL
100
160
0
09 Sep 2020
Graph neural networks-based Scheduler for Production planning problems using Reinforcement Learning
Mohammed Sharafath Abdul Hameed
Andreas Schwung
28
25
0
08 Sep 2020
Evolutionary Reinforcement Learning via Cooperative Coevolutionary Negatively Correlated Search
Hu Zhang
Peng Yang
Yang Yu
Mingjiang Li
K. Tang
126
21
0
08 Sep 2020
Detecting and adapting to crisis pattern with context based Deep Reinforcement Learning
Eric Benhamou
David Saltiel
Jean-Jacques Ohana
Jamal Atif
67
19
0
07 Sep 2020
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning
Sheng-Chun Kao
Geonhwa Jeong
T. Krishna
116
96
0
04 Sep 2020
Sparse Meta Networks for Sequential Adaptation and its Application to Adaptive Language Modelling
Tsendsuren Munkhdalai
CLL
OffRL
54
4
0
03 Sep 2020
Action and Perception as Divergence Minimization
Danijar Hafner
Pedro A. Ortega
Jimmy Ba
Thomas Parr
Karl J. Friston
N. Heess
91
53
0
03 Sep 2020
Grounded Language Learning Fast and Slow
Felix Hill
O. Tieleman
Tamara von Glehn
Nathaniel Wong
Hamza Merzic
S. Clark
LM&Ro
172
81
0
03 Sep 2020
TAP-Net: Transport-and-Pack using Reinforcement Learning
Huang Ruizhen
XU Juzhan
Bin Chen
Minglun Gong
Hao Zhang
Hui Huang
71
26
0
03 Sep 2020
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics
Yanchao Sun
Da Huo
Furong Huang
AAML
OffRL
OnRL
114
52
0
02 Sep 2020
Latency and Throughput Optimization in Modern Networks: A Comprehensive Survey
A. Mirzaeinia
Mehdi Mirzaeinia
A. Rezgui
53
6
0
01 Sep 2020
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments using A3C learning and Residual Recurrent Neural Networks
Shreshth Tuli
Shashikant Ilager
K. Ramamohanarao
Rajkumar Buyya
69
179
0
01 Sep 2020
PlotThread: Creating Expressive Storyline Visualizations using Reinforcement Learning
Tan Tang
Renzhong Li
Xinke Wu
Shuhan Liu
Johannes Knittel
Steffen Koch
Thomas Ertl
Lingyun Yu
Peiran Ren
Yingcai Wu
95
54
0
01 Sep 2020
Deep Reinforcement Learning for Contact-Rich Skills Using Compliant Movement Primitives
Oren Spector
M. Zacksenhouse
64
12
0
30 Aug 2020
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems
Vinicius G. Goecks
117
11
0
30 Aug 2020
Reinforcement Learning with Feedback-modulated TD-STDP
Stephen Chung
R. Kozma
33
3
0
29 Aug 2020
Real-world Video Adaptation with Reinforcement Learning
Hongzi Mao
Shannon Chen
Drew Dimmery
Shaun Singh
Drew Blaisdell
Yuandong Tian
Mohammad Alizadeh
E. Bakshy
OffRL
126
77
0
28 Aug 2020
AllenAct: A Framework for Embodied AI Research
Luca Weihs
Jordi Salvador
Klemen Kotar
Unnat Jain
Kuo-Hao Zeng
Roozbeh Mottaghi
Aniruddha Kembhavi
LM&Ro
AI4CE
80
75
0
28 Aug 2020
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity Edge Devices
Parth Mannan
A. Samajdar
T. Krishna
58
2
0
27 Aug 2020
Constrained Markov Decision Processes via Backward Value Functions
Harsh Satija
Philip Amortila
Joelle Pineau
107
52
0
26 Aug 2020
Selective Particle Attention: Visual Feature-Based Attention in Deep Reinforcement Learning
Sam Blakeman
D. Mareschal
68
1
0
26 Aug 2020
Model-Free Episodic Control with State Aggregation
R. Pinto
OffRL
32
3
0
21 Aug 2020
NANCY: Neural Adaptive Network Coding methodologY for video distribution over wireless networks
Paresh Saxena
Mandan Naresh
Manik Gupta
Anirudh Achanta
S. Kota
Smrati Gupta
8
8
0
21 Aug 2020
Exploiting Scene-specific Features for Object Goal Navigation
Tommaso Campari
Paolo Eccher
Luciano Serafini
Lamberto Ballan
95
29
0
21 Aug 2020
Heterogeneity-Aware Cluster Scheduling Policies for Deep Learning Workloads
Deepak Narayanan
Keshav Santhanam
Fiodar Kazhamiaka
Amar Phanishayee
Matei A. Zaharia
83
216
0
20 Aug 2020
Reinforcement Learning for Low-Thrust Trajectory Design of Interplanetary Missions
Alessandro Zavoli
Lorenzo Federici
35
7
0
19 Aug 2020
Towards Closing the Sim-to-Real Gap in Collaborative Multi-Robot Deep Reinforcement Learning
Wenshuai Zhao
Jorge Peña Queralta
Qingqing Li
Tomi Westerlund
64
28
0
18 Aug 2020
Ubiquitous Distributed Deep Reinforcement Learning at the Edge: Analyzing Byzantine Agents in Discrete Action Spaces
Wenshuai Zhao
Jorge Peña Queralta
Qingqing Li
Tomi Westerlund
80
6
0
18 Aug 2020
Learning Fair Policies in Multiobjective (Deep) Reinforcement Learning with Average and Discounted Rewards
Umer Siddique
Paul Weng
Matthieu Zimmer
FaML
OffRL
65
88
0
18 Aug 2020
Learning Complex Multi-Agent Policies in Presence of an Adversary
Siddharth Ghiya
Katia Sycara
25
3
0
18 Aug 2020
A Survey of Deep Learning for Data Caching in Edge Network
Yantong Wang
V. Friderikos
90
28
0
17 Aug 2020
Generative Design by Reinforcement Learning: Enhancing the Diversity of Topology Optimization Designs
Seowoo Jang
Soyoung Yoo
Namwoo Kang
AI4CE
124
74
0
17 Aug 2020
Playing Catan with Cross-dimensional Neural Network
Quentin Gendre
Tomoyuki Kaneko
BDL
34
4
0
17 Aug 2020
Reducing Sampling Error in Batch Temporal Difference Learning
Brahma S. Pavse
Ishan Durugkar
Josiah P. Hanna
Peter Stone
OffRL
71
12
0
15 Aug 2020
Explainability in Deep Reinforcement Learning
Alexandre Heuillet
Fabien Couthouis
Natalia Díaz Rodríguez
XAI
255
284
0
15 Aug 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
103
19
0
14 Aug 2020
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDL
OffRL
127
17
0
11 Aug 2020
Woodpecker-DL: Accelerating Deep Neural Networks via Hardware-Aware Multifaceted Optimizations
Yongchao Liu
Yue Jin
Yongqi Chen
Teng Teng
Hang Ou
Rui Zhao
Yao Zhang
110
1
0
11 Aug 2020
TriFinger: An Open-Source Robot for Learning Dexterity
Manuel Wüthrich
Felix Widmaier
F. Grimminger
J. Akpo
S. Joshi
...
Julian Viereck
M. Naveau
Ludovic Righetti
Bernhard Schölkopf
Stefan Bauer
82
72
0
08 Aug 2020
Convex Q-Learning, Part 1: Deterministic Optimal Control
P. Mehta
Sean P. Meyn
36
4
0
08 Aug 2020
A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning
Mathieu Seurin
Florian Strub
Philippe Preux
Olivier Pietquin
49
5
0
07 Aug 2020
Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals
Ozsel Kilinc
Giovanni Montana
66
5
0
05 Aug 2020
Robust Deep Reinforcement Learning through Adversarial Loss
Tuomas P. Oikarinen
Wang Zhang
Alexandre Megretski
Luca Daniel
Tsui-Wei Weng
AAML
90
97
0
05 Aug 2020
Previous
1
2
3
...
40
41
42
...
70
71
72
Next