Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,592 papers shown
Title
Learning to Optimize Industry-Scale Dynamic Pickup and Delivery Problems
Xijun Li
Weilin Luo
Mingxuan Yuan
Jun Wang
Jiawen Lu
Jie Wang
Jinhu Lu
Jia Zeng
76
42
0
27 May 2021
From Motor Control to Team Play in Simulated Humanoid Football
Siqi Liu
Guy Lever
Zhe Wang
J. Merel
S. M. Ali Eslami
...
Tuomas Haarnoja
Brendan D. Tracey
K. Tuyls
T. Graepel
N. Heess
118
134
0
25 May 2021
Transfer Learning and Curriculum Learning in Sokoban
Zhao Yang
Mike Preuss
Aske Plaat
OffRL
57
3
0
25 May 2021
Unbiased Asymmetric Reinforcement Learning under Partial Observability
Andrea Baisero
Chris Amato
OffRL
61
21
0
25 May 2021
Certification of Iterative Predictions in Bayesian Neural Networks
Matthew Wicker
Luca Laurenti
A. Patané
Nicola Paoletti
Alessandro Abate
Marta Z. Kwiatkowska
152
11
0
21 May 2021
Cross-domain Imitation from Observations
Dripta S. Raychaudhuri
S. Paul
J. Baar
Amit K. Roy-Chowdhury
OOD
93
45
0
20 May 2021
Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness
Mathieu Seurin
Florian Strub
Philippe Preux
Olivier Pietquin
44
9
0
20 May 2021
VTNet: Visual Transformer Network for Object Goal Navigation
Heming Du
Xin Yu
Liang Zheng
ViT
91
93
0
20 May 2021
Training Heterogeneous Features in Sequence to Sequence Tasks: Latent Enhanced Multi-filter Seq2Seq Model
Yunhao Yang
Zhaokun Xue
141
3
0
18 May 2021
Online Multimodal Transportation Planning using Deep Reinforcement Learning
A. Farahani
Laura Genga
R. Dijkman
OffRL
46
6
0
18 May 2021
RL-GRIT: Reinforcement Learning for Grammar Inference
Walt Woods
39
4
0
17 May 2021
Behavior-based Neuroevolutionary Training in Reinforcement Learning
Jörg Stork
Martin Zaefferer
Nils Eisler
Patrick Tichelmann
Thomas Bartz-Beielstein
A. E. Eiben
37
5
0
17 May 2021
Using Distributed Reinforcement Learning for Resource Orchestration in a Network Slicing Scenario
Federico Mason
G. Nencioni
Andrea Zanella
24
23
0
17 May 2021
DRAS-CQSim: A Reinforcement Learning based Framework for HPC Cluster Scheduling
Yuping Fan
Z. Lan
16
14
0
16 May 2021
A Heuristically Assisted Deep Reinforcement Learning Approach for Network Slice Placement
José Jurandir Alves Esteves
Amina Boubendir
Fabrice Michel Guillemin
Pierre Sens
43
32
0
14 May 2021
Bootstrapping User and Item Representations for One-Class Collaborative Filtering
Dongha Lee
SeongKu Kang
Hyunjun Ju
Chanyoung Park
Hwanjo Yu
65
114
0
13 May 2021
A Survey on Reinforcement Learning-Aided Caching in Mobile Edge Networks
Nikolaos Nomikos
Spyros Zoupanos
Themistoklis Charalambous
I. Krikidis
Athina P. Petropulu
89
1
0
12 May 2021
Hierarchical RNNs-Based Transformers MADDPG for Mixed Cooperative-Competitive Environments
Xiaolong Wei
Lifang Yang
Xianglin Huang
Gang Cao
Zhulin Tao
Zhengyang Du
Jing An
59
6
0
11 May 2021
A Deep Reinforcement Learning Approach to Audio-Based Navigation in a Multi-Speaker Environment
Petros Giannakopoulos
A. Pikrakis
Y. Cotronis
72
7
0
10 May 2021
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Jinjie Ni
Tom Young
Vlad Pandelea
Fuzhao Xue
Min Zhang
225
280
0
10 May 2021
PEARL: Parallelized Expert-Assisted Reinforcement Learning for Scene Rearrangement Planning
Hanqing Wang
Zan Wang
Wei Liang
L. Yu
35
1
0
10 May 2021
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
Haiyan Yin
96
0
0
09 May 2021
Towards Theoretical Understandings of Robust Markov Decision Processes: Sample Complexity and Asymptotics
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
81
35
0
09 May 2021
Scalable, Decentralized Multi-Agent Reinforcement Learning Methods Inspired by Stigmergy and Ant Colonies
Austin Nguyen
42
1
0
08 May 2021
Using reinforcement learning to design an AI assistantfor a satisfying co-op experience
Ajay R Krishnan
N. Jyothish
Xun Jia
14
0
0
07 May 2021
UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms
Denis Belomestny
I. Levin
Eric Moulines
A. Naumov
S. Samsonov
V. Zorina
OffRL
55
0
0
05 May 2021
Pervasive AI for IoT applications: A Survey on Resource-efficient Distributed Artificial Intelligence
Emna Baccour
N. Mhaisen
A. Abdellatif
A. Erbad
Amr M. Mohamed
Mounir Hamdi
Mohsen Guizani
98
94
0
04 May 2021
Semantic Extractor-Paraphraser based Abstractive Summarization
Anubhav Jangra
Raghav Jain
Vaibhav Mavi
S. Saha
P. Bhattacharyya
56
6
0
04 May 2021
Reinforcement Learning for Ridesharing: An Extended Survey
Zhiwei Qin
Hongtu Zhu
Jieping Ye
169
88
0
03 May 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
56
8
0
03 May 2021
MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple Accelerator Cores
Sheng-Chun Kao
T. Krishna
123
52
0
28 Apr 2021
End-to-End Intersection Handling using Multi-Agent Deep Reinforcement Learning
Alessandro Paolo Capasso
Paolo Maramotti
Anthony DellÉva
A. Broggi
120
18
0
28 Apr 2021
Semi-On-Policy Training for Sample Efficient Multi-Agent Policy Gradients
Bozhidar Vasilev
Tarun Gupta
Bei Peng
Shimon Whiteson
46
2
0
27 Apr 2021
ANT: Learning Accurate Network Throughput for Better Adaptive Video Streaming
Jiaoyang Yin
Yiling Xu
Hao Chen
Yunfei Zhang
S. Appleby
Zhan Ma
25
11
0
26 Apr 2021
Efficient Hyperparameter Optimization for Physics-based Character Animation
Zeshi Yang
Zhiqi Yin
AI4CE
91
9
0
26 Apr 2021
Learning Latent Graph Dynamics for Visual Manipulation of Deformable Objects
Xiao Ma
David Hsu
W. Lee
AI4CE
82
31
0
25 Apr 2021
Constraint-Guided Reinforcement Learning: Augmenting the Agent-Environment-Interaction
Helge Spieker
34
3
0
24 Apr 2021
Safe Chance Constrained Reinforcement Learning for Batch Process Control
M. Mowbray
Panagiotis Petsagkourakis
Ehecatl Antonio del Rio Chanona
Dongda Zhang
OffRL
73
37
0
23 Apr 2021
Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand Systems
Daniele Gammelli
Kaidi Yang
James Harrison
Filipe Rodrigues
Francisco Câmara Pereira
Marco Pavone
GNN
98
48
0
23 Apr 2021
Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data
Adrian Remonda
Sarah Krebs
Eduardo E. Veas
Granit Luzhnica
Roman Kern
OffRL
74
23
0
22 Apr 2021
A learning gap between neuroscience and reinforcement learning
Samuel T. Wauthier
Pietro Mazzaglia
Ozan Çatal
Cedric De Boom
Tim Verbelen
Bart Dhoedt
63
3
0
22 Apr 2021
CVLight: Decentralized Learning for Adaptive Traffic Signal Control with Connected Vehicles
Zhaobin Mo
Wangzhi Li
Yongjie Fu
Kangrui Ruan
Xuan Di
90
42
0
21 Apr 2021
Network Defense is Not a Game
Andres Molina-Markham
Ransom K. Winder
Ahmad Ridley
AAML
48
14
0
20 Apr 2021
Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic Information
Jialu Li
Hao Hao Tan
Joey Tianyi Zhou
68
34
0
19 Apr 2021
Training Value-Aligned Reinforcement Learning Agents Using a Normative Prior
Md Sultan al Nahian
Spencer Frazier
Brent Harrison
Mark O. Riedl
97
19
0
19 Apr 2021
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
Jie Ren
Yewen Li
Zihan Ding
Wei Pan
Hao Dong
BDL
MoE
57
26
0
19 Apr 2021
A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable Settings
Eltayeb Ahmed
L. Zintgraf
Christian Schroeder de Witt
Nicolas Usunier
SSL
54
0
0
17 Apr 2021
Generalising Discrete Action Spaces with Conditional Action Trees
Christopher Bamford
Alvaro Ovalle
72
7
0
15 Apr 2021
GridToPix: Training Embodied Agents with Minimal Supervision
Unnat Jain
Iou-Jen Liu
Svetlana Lazebnik
Aniruddha Kembhavi
Luca Weihs
Alex Schwing
119
23
0
14 Apr 2021
TAAC: Temporally Abstract Actor-Critic for Continuous Control
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
58
21
0
13 Apr 2021
Previous
1
2
3
...
33
34
35
...
70
71
72
Next