ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Extragradient with player sampling for faster Nash equilibrium finding
Extragradient with player sampling for faster Nash equilibrium finding
Carles Domingo Enrich
Samy Jelassi
Carles Domingo-Enrich
Damien Scieur
A. Mensch
Joan Bruna
18
1
0
29 May 2019
CopyCAT: Taking Control of Neural Policies with Constant Attacks
CopyCAT: Taking Control of Neural Policies with Constant Attacks
Léonard Hussenot
Matthieu Geist
Olivier Pietquin
AAML
54
31
0
29 May 2019
Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation
Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation
Vihan Jain
Gabriel Ilharco
Alexander Ku
Ashish Vaswani
Eugene Ie
Jason Baldridge
LM&Ro
92
182
0
29 May 2019
Recurrent Existence Determination Through Policy Optimization
Recurrent Existence Determination Through Policy Optimization
Baoxiang Wang
45
1
0
29 May 2019
Snooping Attacks on Deep Reinforcement Learning
Snooping Attacks on Deep Reinforcement Learning
Matthew J. Inkawhich
Yiran Chen
Hai Helen Li
AAML
68
25
0
28 May 2019
Beyond Exponentially Discounted Sum: Automatic Learning of Return
  Function
Beyond Exponentially Discounted Sum: Automatic Learning of Return Function
Yufei Wang
Qiwei Ye
Tie-Yan Liu
OffRL
64
16
0
28 May 2019
AgentGraph: Towards Universal Dialogue Management with Structured Deep
  Reinforcement Learning
AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning
Lu Chen
Zhi Chen
Bowen Tan
Sishan Long
Milica Gasic
Kai Yu
79
35
0
27 May 2019
Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement
  Learning
Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning
Caleb Chuck
Supawit Chockchowwat
S. Niekum
67
14
0
27 May 2019
Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation
  from Video
Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation from Video
Miguel Jaques
Michael G. Burke
Timothy M. Hospedales
VGenPINN
133
45
0
27 May 2019
Disentangling Dynamics and Returns: Value Function Decomposition with
  Future Prediction
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Zhaopeng Meng
Yaodong Yang
Li Wang
35
2
0
27 May 2019
Policy Search by Target Distribution Learning for Continuous Control
Policy Search by Target Distribution Learning for Continuous Control
Wei Shen
Yuanqi Li
Jian Li
60
6
0
27 May 2019
A Kernel Loss for Solving the Bellman Equation
A Kernel Loss for Solving the Bellman Equation
Yihao Feng
Lihong Li
Qiang Liu
89
70
0
25 May 2019
Continual Reinforcement Learning in 3D Non-stationary Environments
Continual Reinforcement Learning in 3D Non-stationary Environments
Vincenzo Lomonaco
Karan Desai
Eugenio Culurciello
Davide Maltoni
OffRL
49
42
0
24 May 2019
Flow-based Intrinsic Curiosity Module
Flow-based Intrinsic Curiosity Module
Hsuan-Kung Yang
Po-Han Chiang
Min-Fong Hong
Chun-Yi Lee
39
4
0
24 May 2019
Neural Temporal-Difference and Q-Learning Provably Converge to Global
  Optima
Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima
Qi Cai
Zhuoran Yang
Jason D. Lee
Zhaoran Wang
66
32
0
24 May 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire
  Evacuation Environment
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Jivitesh Sharma
Per-Arne Andersen
Ole-Christoffer Granmo
M. G. Olsen
AI4CE
76
70
0
23 May 2019
Recurrent Value Functions
Recurrent Value Functions
Pierre Thodoroff
N. Anand
Lucas Caccia
Doina Precup
Joelle Pineau
36
5
0
23 May 2019
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
Rui Zhao
Xudong Sun
Volker Tresp
69
83
0
21 May 2019
Combining Experience Replay with Exploration by Random Network
  Distillation
Combining Experience Replay with Exploration by Random Network Distillation
Francesco Sovrano
61
15
0
18 May 2019
SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation
SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation
Daniel Gordon
Abhishek Kadian
Devi Parikh
Judy Hoffman
Dhruv Batra
90
75
0
18 May 2019
Optimizing Sequential Medical Treatments with Auto-Encoding Heuristic
  Search in POMDPs
Optimizing Sequential Medical Treatments with Auto-Encoding Heuristic Search in POMDPs
Luchen Li
Matthieu Komorowski
Aldo A. Faisal
OffRL
139
13
0
17 May 2019
Leveraging exploration in off-policy algorithms via normalizing flows
Leveraging exploration in off-policy algorithms via normalizing flows
Bogdan Mazoure
T. Doan
A. Durand
R. Devon Hjelm
Joelle Pineau
OnRL
70
62
0
16 May 2019
Learning and Exploiting Multiple Subgoals for Fast Exploration in
  Hierarchical Reinforcement Learning
Learning and Exploiting Multiple Subgoals for Fast Exploration in Hierarchical Reinforcement Learning
Libo Xing
18
4
0
13 May 2019
Learning Phase Competition for Traffic Signal Control
Learning Phase Competition for Traffic Signal Control
Guanjie Zheng
Yuanhao Xiong
Xinshi Zang
J. Feng
Hua Wei
Huichu Zhang
Yong Li
Kai Xu
Z. Li
86
230
0
12 May 2019
Graph Attention Memory for Visual Navigation
Graph Attention Memory for Visual Navigation
Dong Li
Qichao Zhang
Dongbin Zhao
Yuzheng Zhuang
Bin Wang
Wulong Liu
Rasul Tutunov
Jun Wang
65
16
0
11 May 2019
Optimizing Routerless Network-on-Chip Designs: An Innovative
  Learning-Based Framework
Optimizing Routerless Network-on-Chip Designs: An Innovative Learning-Based Framework
Ting-Ru Lin
Drew Penney
Massoud Pedram
Lizhong Chen
3DV
55
8
0
11 May 2019
Do Autonomous Agents Benefit from Hearing?
Do Autonomous Agents Benefit from Hearing?
Abraham Woubie
Anssi Kanervisto
Janne Karttunen
Ville Hautamaki
50
8
0
10 May 2019
Design of Artificial Intelligence Agents for Games using Deep
  Reinforcement Learning
Design of Artificial Intelligence Agents for Games using Deep Reinforcement Learning
A. Roibu
63
1
0
10 May 2019
Attention-based Deep Reinforcement Learning for Multi-view Environments
Attention-based Deep Reinforcement Learning for Multi-view Environments
Elaheh Barati
Xuewen Chen
Z. Zhong
83
6
0
10 May 2019
Pretrain Soft Q-Learning with Imperfect Demonstrations
Pretrain Soft Q-Learning with Imperfect Demonstrations
Xiaoqin Zhang
Yunfei Li
Huimin Ma
Xiong Luo
OffRLOnRL
26
2
0
09 May 2019
Learning to Evolve
Learning to Evolve
Jan Schuchardt
Vladimir Golkov
Zorah Lähner
22
0
0
08 May 2019
Smoothing Policies and Safe Policy Gradients
Smoothing Policies and Safe Policy Gradients
Matteo Papini
Matteo Pirotta
Marcello Restelli
80
31
0
08 May 2019
Meta-learning of Sequential Strategies
Meta-learning of Sequential Strategies
Pedro A. Ortega
Jane X. Wang
Mark Rowland
Tim Genewein
Z. Kurth-Nelson
...
Yee Whye Teh
H. V. Hasselt
Nando de Freitas
M. Botvinick
Shane Legg
OffRL
123
101
0
08 May 2019
Toybox: A Suite of Environments for Experimental Evaluation of Deep
  Reinforcement Learning
Toybox: A Suite of Environments for Experimental Evaluation of Deep Reinforcement Learning
Emma Tosch
Kaleigh Clary
John Foley
David D. Jensen
OffRL
49
9
0
07 May 2019
Continual and Multi-task Reinforcement Learning With Shared Episodic
  Memory
Continual and Multi-task Reinforcement Learning With Shared Episodic Memory
A. Sorokin
Andrey Kravchenko
KELMCLL
39
5
0
07 May 2019
A Complementary Learning Systems Approach to Temporal Difference
  Learning
A Complementary Learning Systems Approach to Temporal Difference Learning
Sam Blakeman
D. Mareschal
62
42
0
07 May 2019
DeepRMSA: A Deep Reinforcement Learning Framework for Routing,
  Modulation and Spectrum Assignment in Elastic Optical Networks
DeepRMSA: A Deep Reinforcement Learning Framework for Routing, Modulation and Spectrum Assignment in Elastic Optical Networks
Xiaoliang Chen
Baojia Li
R. Proietti
Hongbo Lu
Zuqing Zhu
S. Yoo
64
167
0
06 May 2019
P3O: Policy-on Policy-off Policy Optimization
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
93
56
0
05 May 2019
ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient
  Backpropagation Through Categorical Variables
ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Mingzhang Yin
Yuguang Yue
Mingyuan Zhou
66
23
0
04 May 2019
Information asymmetry in KL-regularized RL
Information asymmetry in KL-regularized RL
Alexandre Galashov
Siddhant M. Jayakumar
Leonard Hasenclever
Dhruva Tirumala
Jonathan Richard Schwarz
Guillaume Desjardins
Wojciech M. Czarnecki
Yee Whye Teh
Razvan Pascanu
N. Heess
OffRL
67
104
0
03 May 2019
Collaborative Evolutionary Reinforcement Learning
Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka
Somdeb Majumdar
Tarek Nassar
Zach Dwiel
E. Tumer
Santiago Miret
Yinyin Liu
Kagan Tumer
66
100
0
02 May 2019
Autonomous Air Traffic Controller: A Deep Multi-Agent Reinforcement
  Learning Approach
Autonomous Air Traffic Controller: A Deep Multi-Agent Reinforcement Learning Approach
Marc Brittain
Peng Wei
DRL
52
36
0
02 May 2019
From Video Game to Real Robot: The Transfer between Action Spaces
From Video Game to Real Robot: The Transfer between Action Spaces
Janne Karttunen
Anssi Kanervisto
Ville Kyrki
Ville Hautamaki
92
8
0
02 May 2019
DAC: The Double Actor-Critic Architecture for Learning Options
DAC: The Double Actor-Critic Architecture for Learning Options
Shangtong Zhang
Shimon Whiteson
149
73
0
29 Apr 2019
Reinforcement Learning Scheduler for Vehicle-to-Vehicle Communications
  Outside Coverage
Reinforcement Learning Scheduler for Vehicle-to-Vehicle Communications Outside Coverage
T. Şahin
R. Khalili
Mate Boban
A. Wolisz
OffRL
28
21
0
29 Apr 2019
Neural Logic Machines
Neural Logic Machines
Honghua Dong
Jiayuan Mao
Tian Lin
Chong-Jun Wang
Lihong Li
Denny Zhou
NAILRMAI4CE
141
250
0
26 Apr 2019
Target-Based Temporal Difference Learning
Target-Based Temporal Difference Learning
Donghwan Lee
Niao He
OOD
83
32
0
24 Apr 2019
Neural Logic Reinforcement Learning
Neural Logic Reinforcement Learning
Zhengyao Jiang
Shan Luo
NAI
103
75
0
24 Apr 2019
Stochastic Lipschitz Q-Learning
Xu Zhu
64
4
0
24 Apr 2019
Towards Combining On-Off-Policy Methods for Real-World Applications
Towards Combining On-Off-Policy Methods for Real-World Applications
Kai-Chun Hu
Chen-Huan Pi
Ting Han Wei
I-Chen Wu
Stone Cheng
Yi-Wei Dai
Wei-Yuan Ye
OffRL
33
2
0
24 Apr 2019
Previous
123...545556...707172
Next