ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Adversarial Reinforced Instruction Attacker for Robust Vision-Language
  Navigation
Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation
Bingqian Lin
Yi Zhu
Yanxin Long
Xiaodan Liang
QiXiang Ye
Liang Lin
AAML
89
16
0
23 Jul 2021
Structured second-order methods via natural gradient descent
Structured second-order methods via natural gradient descent
Wu Lin
Frank Nielsen
Mohammad Emtiyaz Khan
Mark Schmidt
ODL
60
10
0
22 Jul 2021
DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation
DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation
Wentao Bao
Qi Yu
Yu Kong
FAtt
75
41
0
21 Jul 2021
MarsExplorer: Exploration of Unknown Terrains via Deep Reinforcement
  Learning and Procedurally Generated Environments
MarsExplorer: Exploration of Unknown Terrains via Deep Reinforcement Learning and Procedurally Generated Environments
Dimitrios I. Koutras
Athanasios Ch. Kapoutsis
A. Amanatiadis
Elias B. Kosmatopoulos
59
10
0
21 Jul 2021
Proximal Policy Optimization for Tracking Control Exploiting Future
  Reference Information
Proximal Policy Optimization for Tracking Control Exploiting Future Reference Information
Jana Mayer
Johannes Westermann
Juan Pedro Gutiérrez H. Muriedas
Uwe Mettin
A. Lampe
OffRL
23
0
0
20 Jul 2021
Mastering Visual Continuous Control: Improved Data-Augmented
  Reinforcement Learning
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
OffRL
133
353
0
20 Jul 2021
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated
  Exploration
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration
Lukas Schafer
Filippos Christianos
Josiah P. Hanna
Stefano V. Albrecht
92
23
0
19 Jul 2021
Structured World Belief for Reinforcement Learning in POMDP
Structured World Belief for Reinforcement Learning in POMDP
Gautam Singh
Skand Peri
Junghyun Kim
Hyunseok Kim
Sungjin Ahn
OCL
81
28
0
19 Jul 2021
Greedification Operators for Policy Optimization: Investigating Forward
  and Reverse KL Divergences
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Alan Chan
Hugo Silva
Sungsu Lim
Tadashi Kozuno
A. R. Mahmood
Martha White
90
31
0
17 Jul 2021
High-Accuracy Model-Based Reinforcement Learning, a Survey
High-Accuracy Model-Based Reinforcement Learning, a Survey
Aske Plaat
W. Kosters
Mike Preuss
OffRL
75
37
0
17 Jul 2021
Neighbor-view Enhanced Model for Vision and Language Navigation
Neighbor-view Enhanced Model for Vision and Language Navigation
Dongyan An
Yuankai Qi
Yan Huang
Qi Wu
Liang Wang
Tieniu Tan
LM&Ro
82
71
0
15 Jul 2021
NeuSaver: Neural Adaptive Power Consumption Optimization for Mobile
  Video Streaming
NeuSaver: Neural Adaptive Power Consumption Optimization for Mobile Video Streaming
Kyoungjun Park
Myungchul Kim
Laihyuk Park
21
3
0
15 Jul 2021
The Benchmark Lottery
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
117
92
0
14 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting
  Pot
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
93
111
0
14 Jul 2021
ReLLIE: Deep Reinforcement Learning for Customized Low-Light Image
  Enhancement
ReLLIE: Deep Reinforcement Learning for Customized Low-Light Image Enhancement
Rongkai Zhang
Lanqing Guo
Siyu Huang
Bihan Wen
OffRL
74
51
0
13 Jul 2021
Cautious Policy Programming: Exploiting KL Regularization in Monotonic
  Policy Improvement for Reinforcement Learning
Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning
Lingwei Zhu
Toshinori Kitamura
Takamitsu Matsubara
OffRL
46
1
0
13 Jul 2021
Conservative Offline Distributional Reinforcement Learning
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
106
85
0
12 Jul 2021
R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks
  for Image Denoising via Residual Recovery
R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks for Image Denoising via Residual Recovery
Rongkai Zhang
Jiang Zhu
Zhiyuan Zha
Justin Dauwels
Bihan Wen
75
6
0
12 Jul 2021
Stabilizing Neural Control Using Self-Learned Almost Lyapunov Critics
Stabilizing Neural Control Using Self-Learned Almost Lyapunov Critics
Ya-Chien Chang
Sicun Gao
94
59
0
11 Jul 2021
Coordinate-wise Control Variates for Deep Policy Gradients
Coordinate-wise Control Variates for Deep Policy Gradients
Yuanyi Zhong
Yuanshuo Zhou
Jian-wei Peng
BDL
88
1
0
11 Jul 2021
ARC: Adversarially Robust Control Policies for Autonomous Vehicles
ARC: Adversarially Robust Control Policies for Autonomous Vehicles
Sampo Kuutti
Saber Fallah
Richard Bowden
AAML
65
5
0
09 Jul 2021
Intelligent Link Adaptation for Grant-Free Access Cellular Networks: A
  Distributed Deep Reinforcement Learning Approach
Intelligent Link Adaptation for Grant-Free Access Cellular Networks: A Distributed Deep Reinforcement Learning Approach
Joao V. C. Evangelista
Zeeshan Sattar
Georges Kaddoum
Bassant Selim
Aydin Sarraf
27
2
0
08 Jul 2021
RMA: Rapid Motor Adaptation for Legged Robots
RMA: Rapid Motor Adaptation for Legged Robots
Ashish Kumar
Zipeng Fu
Deepak Pathak
Jitendra Malik
186
584
0
08 Jul 2021
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy
  Learning
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning
Yuexiang Zhai
Christina Baek
Zhengyuan Zhou
Jiantao Jiao
Yi-An Ma
85
23
0
08 Jul 2021
Analytically Tractable Hidden-States Inference in Bayesian Neural
  Networks
Analytically Tractable Hidden-States Inference in Bayesian Neural Networks
L. Nguyen
J. Goulet
BDL
35
6
0
08 Jul 2021
Deep Learning for Embodied Vision Navigation: A Survey
Deep Learning for Embodied Vision Navigation: A Survey
Fengda Zhu
Yi Zhu
Vincent CS Lee
Xiaodan Liang
Xiaojun Chang
EgoVLM&Ro
101
0
0
07 Jul 2021
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
Erdun Gao
Fan Feng
Chaochao Lu
Sara Magliacane
Kun Zhang
106
69
0
06 Jul 2021
Effects of Smart Traffic Signal Control on Air Quality
Effects of Smart Traffic Signal Control on Air Quality
P. Fazzini
M. Torre
V. Rizza
F. Petracchini
18
4
0
06 Jul 2021
Collaborative Visual Navigation
Collaborative Visual Navigation
Haiyang Wang
Wenguan Wang
Xizhou Zhu
Jifeng Dai
Liwei Wang
EgoV
100
20
0
02 Jul 2021
User Role Discovery and Optimization Method based on K-means +
  Reinforcement learning in Mobile Applications
User Role Discovery and Optimization Method based on K-means + Reinforcement learning in Mobile Applications
Yuanbang Li
OffRL
19
2
0
02 Jul 2021
Reinforcement Learning for Feedback-Enabled Cyber Resilience
Reinforcement Learning for Feedback-Enabled Cyber Resilience
Yunhan Huang
Linan Huang
Quanyan Zhu
99
71
0
02 Jul 2021
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
87
144
0
01 Jul 2021
Applications of the Free Energy Principle to Machine Learning and
  Neuroscience
Applications of the Free Energy Principle to Machine Learning and Neuroscience
Beren Millidge
DRL
117
8
0
30 Jun 2021
Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in
  Edge Industrial IoT
Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge Industrial IoT
Wanlu Lei
Yu Ye
Ming Xiao
Mikael Skoglund
Zhu Han
45
1
0
30 Jun 2021
Understanding Adversarial Attacks on Observations in Deep Reinforcement
  Learning
Understanding Adversarial Attacks on Observations in Deep Reinforcement Learning
You Qiaoben
Chengyang Ying
Xinning Zhou
Hang Su
Jun Zhu
Bo Zhang
AAML
109
17
0
30 Jun 2021
Deep Multiagent Reinforcement Learning: Challenges and Directions
Deep Multiagent Reinforcement Learning: Challenges and Directions
Annie Wong
Thomas Bäck
Anna V. Kononova
Aske Plaat
AI4CE
116
97
0
29 Jun 2021
Globally Optimal Hierarchical Reinforcement Learning for
  Linearly-Solvable Markov Decision Processes
Globally Optimal Hierarchical Reinforcement Learning for Linearly-Solvable Markov Decision Processes
Guillermo Infante
Anders Jonsson
Vicencc Gómez
23
7
0
29 Jun 2021
Policy Regularization via Noisy Advantage Values for Cooperative
  Multi-agent Actor-Critic methods
Policy Regularization via Noisy Advantage Values for Cooperative Multi-agent Actor-Critic methods
Jian Hu
Siyue Hu
Shih-Wei Liao
118
15
0
27 Jun 2021
Graph Convolutional Memory using Topological Priors
Graph Convolutional Memory using Topological Priors
Steven D. Morad
Stephan Liwicki
Ryan Kortvelesy
R. Mecca
Amanda Prorok
27
0
0
27 Jun 2021
Core Challenges in Embodied Vision-Language Planning
Core Challenges in Embodied Vision-Language Planning
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
144
48
0
26 Jun 2021
A nonlinear hidden layer enables actor-critic agents to learn multiple
  paired association navigation
A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation
M Ganesh Kumar
Cheston Tan
C. Libedinsky
S. Yen
A. Tan
44
5
0
25 Jun 2021
Mix and Mask Actor-Critic Methods
Mix and Mask Actor-Critic Methods
Dom Huh
29
1
0
24 Jun 2021
Policy Smoothing for Provably Robust Reinforcement Learning
Policy Smoothing for Provably Robust Reinforcement Learning
Aounon Kumar
Alexander Levine
Soheil Feizi
AAML
127
59
0
21 Jun 2021
Distributed Heuristic Multi-Agent Path Finding with Communication
Distributed Heuristic Multi-Agent Path Finding with Communication
Ziyuan Ma
Yudong Luo
Hang Ma
79
72
0
21 Jun 2021
Analytically Tractable Bayesian Deep Q-Learning
Analytically Tractable Bayesian Deep Q-Learning
Luong Ha
L. Nguyen
J. Goulet
BDLOffRL
35
2
0
21 Jun 2021
Learning to Reach, Swim, Walk and Fly in One Trial: Data-Driven Control
  with Scarce Data and Side Information
Learning to Reach, Swim, Walk and Fly in One Trial: Data-Driven Control with Scarce Data and Side Information
Franck Djeumou
Ufuk Topcu
43
4
0
19 Jun 2021
A Condense-then-Select Strategy for Text Summarization
A Condense-then-Select Strategy for Text Summarization
Hou Pong Chan
Irwin King
45
13
0
19 Jun 2021
Prediction-Free, Real-Time Flexible Control of Tidal Lagoons through
  Proximal Policy Optimisation: A Case Study for the Swansea Lagoon
Prediction-Free, Real-Time Flexible Control of Tidal Lagoons through Proximal Policy Optimisation: A Case Study for the Swansea Lagoon
Túlio Marcondes Moreira
Jackson Geraldo de Faria
Pedro O. S. Vaz de Melo
Luiz Chaimowicz
G. Medeiros-Ribeiro
35
10
0
18 Jun 2021
MADE: Exploration via Maximizing Deviation from Explored Regions
MADE: Exploration via Maximizing Deviation from Explored Regions
Tianjun Zhang
Paria Rashidinejad
Jiantao Jiao
Yuandong Tian
Joseph E. Gonzalez
Stuart J. Russell
OffRL
96
44
0
18 Jun 2021
Towards Distraction-Robust Active Visual Tracking
Towards Distraction-Robust Active Visual Tracking
Fangwei Zhong
Peng Sun
Wenhan Luo
Tingyun Yan
Yizhou Wang
AAML
55
38
0
18 Jun 2021
Previous
123...313233...707172
Next