Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 1,552 papers shown
Title
Local Connection Reinforcement Learning Method for Efficient Control of Robotic Peg-in-Hole Assembly
Yuhang Gai
Jiwen Zhang
Dan Wu
Ken Chen
OffRL
32
1
0
24 Oct 2022
On Many-Actions Policy Gradient
Michal Nauman
Marek Cygan
19
0
0
24 Oct 2022
Climate Change Policy Exploration using Reinforcement Learning
Theodore Wolf
29
0
0
23 Oct 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
39
6
0
22 Oct 2022
Learning Robust Dynamics through Variational Sparse Gating
A. Jain
Shivakanth Sujit
S. Joshi
Vincent Michalski
Danijar Hafner
Samira Ebrahimi Kahou
27
8
0
21 Oct 2022
Self-Supervised Learning via Maximum Entropy Coding
Xin Liu
Zhongdao Wang
Yali Li
Shengjin Wang
SSL
39
41
0
20 Oct 2022
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control
Yanfei Xiang
Xin Wang
Shu Hu
Bin Zhu
Xiaomeng Huang
Xi Wu
Siwei Lyu
SSL
29
5
0
20 Oct 2022
Emerging Threats in Deep Learning-Based Autonomous Driving: A Comprehensive Survey
Huiyun Cao
Wenlong Zou
Yinkun Wang
Ting Song
Mengjun Liu
AAML
56
5
0
19 Oct 2022
Commonsense Knowledge from Scene Graphs for Textual Environments
Tsunehiko Tanaka
Daiki Kimura
Michiaki Tatsubori
20
2
0
19 Oct 2022
ULN: Towards Underspecified Vision-and-Language Navigation
Weixi Feng
Tsu-Jui Fu
Yujie Lu
William Yang Wang
51
5
0
18 Oct 2022
CUP: Critic-Guided Policy Reuse
Jin Zhang
Siyuan Li
Chongjie Zhang
34
8
0
15 Oct 2022
COFFEE: Counterfactual Fairness for Personalized Text Generation in Explainable Recommendation
Nan Wang
Qifan Wang
Yi-Chia Wang
Maziar Sanjabi
Jingzhou Liu
Hamed Firooz
Hongning Wang
Shaoliang Nie
33
6
0
14 Oct 2022
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations
N. Vadori
Leo Ardon
Sumitra Ganesh
Thomas Spooner
Selim Amrouni
Jared Vann
Mengda Xu
Zeyu Zheng
T. Balch
Manuela Veloso
25
16
0
13 Oct 2022
Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Anton Dereventsov
A. Bibin
24
1
0
12 Oct 2022
DQLAP: Deep Q-Learning Recommender Algorithm with Update Policy for a Real Steam Turbine System
M. Modirrousta
M. A. Shoorehdeli
M. Yari
A. Ghahremani
24
2
0
12 Oct 2022
Point Cloud Scene Completion with Joint Color and Semantic Estimation from Single RGB-D Image
Zhaoxuan Zhang
Xiaoguang Han
B. Dong
Tong Li
Baocai Yin
Xin Yang
3DPC
3DV
33
8
0
12 Oct 2022
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach
Peng Mi
Li Shen
Tianhe Ren
Yiyi Zhou
Xiaoshuai Sun
Rongrong Ji
Dacheng Tao
AAML
43
69
0
11 Oct 2022
Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems
Junmin Zhong
Ruofan Wu
J. Si
LRM
30
0
0
10 Oct 2022
Experiential Explanations for Reinforcement Learning
Amal Alabdulkarim
Madhuri Singh
Gennie Mansi
Kaely Hall
Mark O. Riedl
Mark O. Riedl
OffRL
43
3
0
10 Oct 2022
How to Enable Uncertainty Estimation in Proximal Policy Optimization
Eugene Bykovets
Yannick Metz
Mennatallah El-Assady
Daniel A. Keim
J. M. Buhmann
UQCV
16
1
0
07 Oct 2022
Self-Adaptive Driving in Nonstationary Environments through Conjectural Online Lookahead Adaptation
Tao Li
Haozhe Lei
Quanyan Zhu
31
11
0
06 Oct 2022
Deep Inventory Management
Dhruv Madeka
Kari Torkkola
Carson Eisenach
Anna Luo
Dean Phillips Foster
Sham M. Kakade
BDL
45
15
0
06 Oct 2022
A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Aishwarya Kamath
Peter Anderson
Su Wang
Jing Yu Koh
Alexander Ku
Austin Waters
Yinfei Yang
Jason Baldridge
Zarana Parekh
LM&Ro
24
45
0
06 Oct 2022
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
S. Mohamad
H. Alamri
A. Bouchachia
50
3
0
06 Oct 2022
Low-Thrust Orbital Transfer using Dynamics-Agnostic Reinforcement Learning
Carlos M. Casas
B. Carro
Antonio J. Sánchez-Esguevillas
22
1
0
06 Oct 2022
Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of Connected Autonomous Vehicles in Challenging Scenarios
Zhili Zhang
Songyang Han
Jiangwei Wang
Fei Miao
43
19
0
05 Oct 2022
Learning Dynamic Abstract Representations for Sample-Efficient Reinforcement Learning
Mehdi Dadvar
Rashmeet Kaur Nayyar
Siddharth Srivastava
24
0
0
04 Oct 2022
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy
Prithviraj Ammanabrolu
Kianté Brantley
Jack Hessel
R. Sifa
Christian Bauckhage
Hannaneh Hajishirzi
Yejin Choi
OffRL
31
240
0
03 Oct 2022
MSRL: Distributed Reinforcement Learning with Dataflow Fragments
Huanzhou Zhu
Bo Zhao
Gang Chen
Weifeng Chen
Yijie Chen
Liang Shi
Yaodong Yang
Peter R. Pietzuch
Lei Chen
OffRL
MoE
22
6
0
03 Oct 2022
Deep Intrinsically Motivated Exploration in Continuous Control
Baturay Saglam
Suleyman Serdar Kozat
26
4
0
01 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States
C. Banerjee
Zhiyong Chen
N. Noman
31
3
0
01 Oct 2022
Reward Shaping for User Satisfaction in a REINFORCE Recommender
Konstantina Christakopoulou
Can Xu
Sai Zhang
Sriraj Badam
Trevor Potter
...
Ya Le
Chris Berg
E. B. Dixon
Ed H. Chi
Minmin Chen
OffRL
25
8
0
30 Sep 2022
Reinforcement Learning Algorithms: An Overview and Classification
Fadi AlMahamid
Katarina Grolinger
21
40
0
29 Sep 2022
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
ReLM
LRM
61
270
0
29 Sep 2022
Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training
Gang Chen
Victoria Huang
OffRL
45
0
0
29 Sep 2022
FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations
Marie Siew
Shikhar Sharma
Zekai Li
Kun Guo
Chao Xu
Tania Lorido-Botran
Tony Q.S. Quek
Carlee Joe-Wong
30
1
0
28 Sep 2022
Improving alignment of dialogue agents via targeted human judgements
Amelia Glaese
Nat McAleese
Maja Trkebacz
John Aslanides
Vlad Firoiu
...
John F. J. Mellor
Demis Hassabis
Koray Kavukcuoglu
Lisa Anne Hendricks
G. Irving
ALM
AAML
239
507
0
28 Sep 2022
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Filippos Christianos
Georgios Papoudakis
Stefano V. Albrecht
35
4
0
28 Sep 2022
Resource Allocation for Mobile Metaverse with the Internet of Vehicles over 6G Wireless Communications: A Deep Reinforcement Learning Approach
Terence Jie Chua
Wen-li Yu
Jun Zhao
37
16
0
27 Sep 2022
Inverted Landing in a Small Aerial Robot via Deep Reinforcement Learning for Triggering and Control of Rotational Maneuvers
Bryan Habas
J. Langelaan
Bo Cheng
23
4
0
22 Sep 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
44
50
0
21 Sep 2022
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
51
12
0
21 Sep 2022
On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies
Haozhi Wang
Qing Wang
Yunfeng Shao
Dong Li
Jianye Hao
Yinchuan Li
36
0
0
21 Sep 2022
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games
Hui Bai
R. Shen
Yue Lin
Bo Xu
Ran Cheng
VLM
36
5
0
21 Sep 2022
A Deep Reinforcement Learning-Based Charging Scheduling Approach with Augmented Lagrangian for Electric Vehicle
Guibin Chen
Xiaoying Shi
27
3
0
20 Sep 2022
Age of Semantics in Cooperative Communications: To Expedite Simulation Towards Real via Offline Reinforcement Learning
Xianfu Chen
Zhifeng Zhao
S. Mao
Celimuge Wu
Honggang Zhang
M. Bennis
OffRL
31
3
0
19 Sep 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
42
35
0
19 Sep 2022
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
36
12
0
19 Sep 2022
Learn the Time to Learn: Replay Scheduling in Continual Learning
Marcus Klasson
Hedvig Kjellström
Chen Zhang
CLL
37
9
0
18 Sep 2022
Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
William Wong
Praneet Dutta
Octavian Voicu
Yuri Chervonyi
Cosmin Paduraru
Jerry Luo
OffRL
AI4CE
34
5
0
16 Sep 2022
Previous
1
2
3
...
8
9
10
...
30
31
32
Next