Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Atari-5: Distilling the Arcade Learning Environment down to Five Games
Matthew Aitchison
Penny Sweetser
Marcus Hutter
93
22
0
05 Oct 2022
Learning Dynamic Abstract Representations for Sample-Efficient Reinforcement Learning
Mehdi Dadvar
Rashmeet Kaur Nayyar
Siddharth Srivastava
47
0
0
04 Oct 2022
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy
Prithviraj Ammanabrolu
Kianté Brantley
Jack Hessel
R. Sifa
Christian Bauckhage
Hannaneh Hajishirzi
Yejin Choi
OffRL
107
250
0
03 Oct 2022
MSRL: Distributed Reinforcement Learning with Dataflow Fragments
Huanzhou Zhu
Bo Zhao
Gang Chen
Weifeng Chen
Yijie Chen
Liang Shi
Yaodong Yang
Peter R. Pietzuch
Lei Chen
OffRL
MoE
76
7
0
03 Oct 2022
Deep Intrinsically Motivated Exploration in Continuous Control
Baturay Saglam
Suleyman S. Kozat
61
4
0
01 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States
C. Banerjee
Zhiyong Chen
N. Noman
60
3
0
01 Oct 2022
Emergent Communication: Generalization and Overfitting in Lewis Games
Mathieu Rita
Corentin Tallec
Paul Michel
Jean-Bastien Grill
Olivier Pietquin
Emmanuel Dupoux
Florian Strub
AI4CE
119
25
0
30 Sep 2022
Reward Shaping for User Satisfaction in a REINFORCE Recommender
Konstantina Christakopoulou
Can Xu
Sai Zhang
Sriraj Badam
Trevor Potter
...
Ya Le
Chris Berg
E. B. Dixon
Ed H. Chi
Minmin Chen
OffRL
32
9
0
30 Sep 2022
Reinforcement Learning Algorithms: An Overview and Classification
Fadi AlMahamid
Katarina Grolinger
39
45
0
29 Sep 2022
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
ReLM
LRM
213
300
0
29 Sep 2022
Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training
Gang Chen
Victoria Huang
OffRL
111
1
0
29 Sep 2022
FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations
Marie Siew
Shikhar Sharma
Zekai Li
Kun Guo
Chao Xu
Tania Lorido-Botran
Tony Q.S. Quek
Carlee Joe-Wong
61
1
0
28 Sep 2022
Improving alignment of dialogue agents via targeted human judgements
Amelia Glaese
Nat McAleese
Maja Trkebacz
John Aslanides
Vlad Firoiu
...
John F. J. Mellor
Demis Hassabis
Koray Kavukcuoglu
Lisa Anne Hendricks
G. Irving
ALM
AAML
324
538
0
28 Sep 2022
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Filippos Christianos
Georgios Papoudakis
Stefano V. Albrecht
93
4
0
28 Sep 2022
Resource Allocation for Mobile Metaverse with the Internet of Vehicles over 6G Wireless Communications: A Deep Reinforcement Learning Approach
Terence Jie Chua
Wen-li Yu
Jun Zhao
75
16
0
27 Sep 2022
Reinforcement Learning for Cognitive Delay/Disruption Tolerant Network Node Management in an LEO-based Satellite Constellation
Xue Sun
Chang‐Jiang Li
Lei Yan
Suzhi Cao
22
1
0
27 Sep 2022
Learning GFlowNets from partial episodes for improved convergence and stability
Kanika Madan
Jarrid Rector-Brooks
Maksym Korablyov
Emmanuel Bengio
Moksh Jain
A. Nica
Tom Bosc
Yoshua Bengio
Nikolay Malkin
100
103
0
26 Sep 2022
Deep Reinforcement Learning for Adaptive Mesh Refinement
C. Foucart
A. Charous
Pierre FJ Lermusiaux
AI4CE
81
23
0
25 Sep 2022
Reward Learning using Structural Motifs in Inverse Reinforcement Learning
Raeid Saqur
91
2
0
25 Sep 2022
On Efficient Reinforcement Learning for Full-length Game of StarCraft II
Ruo-Ze Liu
Zhen-Jia Pang
Zhou-Yu Meng
Wenhai Wang
Yang Yu
Tong Lu
OffRL
63
19
0
23 Sep 2022
Parallel Reinforcement Learning Simulation for Visual Quadrotor Navigation
Jack D. Saunders
Sajad Saeedi
Wenbin Li
49
3
0
22 Sep 2022
Inverted Landing in a Small Aerial Robot via Deep Reinforcement Learning for Triggering and Control of Rotational Maneuvers
Bryan Habas
J. Langelaan
Bo Cheng
72
4
0
22 Sep 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
86
50
0
21 Sep 2022
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
112
13
0
21 Sep 2022
On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies
Haozhi Wang
Qing Wang
Yunfeng Shao
Dong Li
Jianye Hao
Yinchuan Li
73
0
0
21 Sep 2022
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games
Hui Bai
R. Shen
Yue Lin
Bo Xu
Ran Cheng
VLM
85
5
0
21 Sep 2022
Soft Action Priors: Towards Robust Policy Transfer
M. Centa
Philippe Preux
OffRL
OnRL
22
1
0
20 Sep 2022
A Deep Reinforcement Learning-Based Charging Scheduling Approach with Augmented Lagrangian for Electric Vehicle
Guibin Chen
Xiaoying Shi
52
3
0
20 Sep 2022
Age of Semantics in Cooperative Communications: To Expedite Simulation Towards Real via Offline Reinforcement Learning
Xianfu Chen
Zhifeng Zhao
S. Mao
Celimuge Wu
Honggang Zhang
M. Bennis
OffRL
83
3
0
19 Sep 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
96
37
0
19 Sep 2022
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
89
12
0
19 Sep 2022
Learn the Time to Learn: Replay Scheduling in Continual Learning
Marcus Klasson
Hedvig Kjellström
Chen Zhang
CLL
90
9
0
18 Sep 2022
Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
William Wong
Praneet Dutta
Octavian Voicu
Yuri Chervonyi
Cosmin Paduraru
Jerry Luo
OffRL
AI4CE
74
5
0
16 Sep 2022
Causal Coupled Mechanisms: A Control Method with Cooperation and Competition for Complex System
Xuehui Yu
Jingchi Jiang
Xinmiao Yu
Yi Guan
Xue Li
37
0
0
15 Sep 2022
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
85
3
0
15 Sep 2022
Sparsity Inducing Representations for Policy Decompositions
Ashwin Khadke
H. Geyer
32
1
0
15 Sep 2022
Feature-Rich Long-term Bitcoin Trading Assistant
Jatin Nainani
Nirman Taterh
Md Ausaf Rashid
Ankit Khivasara
26
0
0
14 Sep 2022
Using Forwards-Backwards Models to Approximate MDP Homomorphisms
Augustine N. Mavor-Parker
Matthew J. Sargent
Christian Pehle
Andrea Banino
Lewis D. Griffin
Caswell Barry
73
1
0
14 Sep 2022
Skip Training for Multi-Agent Reinforcement Learning Controller for Industrial Wave Energy Converters
Soumyendu Sarkar
Vineet Gundecha
Sahand Ghorbanpour
Alexander Shmakov
Ashwin Ramesh Babu
Alexandre Frederic Julien Pichard
Mathieu Cocho
53
16
0
13 Sep 2022
FORLORN: A Framework for Comparing Offline Methods and Reinforcement Learning for Optimization of RAN Parameters
Vegard Edvardsen
Gard Spreemann
J. V. D. Abeele
OffRL
55
0
0
08 Sep 2022
On the Near-Optimality of Local Policies in Large Cooperative Multi-Agent Reinforcement Learning
Washim Uddin Mondal
Vaneet Aggarwal
S. Ukkusuri
81
5
0
07 Sep 2022
Obtaining Robust Control and Navigation Policies for Multi-Robot Navigation via Deep Reinforcement Learning
Christian Jestel
H. Surmann
Jonas Stenzel
Oliver Urbann
Marius Brehler
51
9
0
07 Sep 2022
Model-Free Deep Reinforcement Learning in Software-Defined Networks
Luke Borchjes
Clement N. Nyirenda
L. Leenen
46
1
0
03 Sep 2022
Transformers are Sample-Efficient World Models
Vincent Micheli
Eloi Alonso
Franccois Fleuret
VLM
OffRL
195
189
0
01 Sep 2022
Deep Anomaly Detection and Search via Reinforcement Learning
Chao Chen
Dawei Wang
Feng Mao
Zongzhang Zhang
Yang Yu
50
0
0
31 Aug 2022
Normality-Guided Distributional Reinforcement Learning for Continuous Control
Ju-Seung Byun
Andrew Perrault
OffRL
107
0
0
28 Aug 2022
Unsupervised Representation Learning in Deep Reinforcement Learning: A Review
N. Botteghi
M. Poel
C. Brune
SSL
OffRL
105
13
0
27 Aug 2022
Lower Difficulty and Better Robustness: A Bregman Divergence Perspective for Adversarial Training
Zihui Wu
Haichang Gao
Bingqian Zhou
Xiaoyan Guo
Shudong Zhang
AAML
71
0
0
26 Aug 2022
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement Learning: A Systematic Review
Fadi AlMahamid
Katarina Grolinger
63
76
0
25 Aug 2022
A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks
Paulina Stevia Nouwou Mindom
Amin Nikanjam
Foutse Khomh
OffRL
69
11
0
25 Aug 2022
Previous
1
2
3
...
20
21
22
...
70
71
72
Next