ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue
  Policy Learning
Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning
Baolin Peng
Xiujun Li
Jianfeng Gao
Jingjing Liu
Yun-Nung Chen
Kam-Fai Wong
91
65
0
31 Oct 2017
Action-depedent Control Variates for Policy Optimization via Stein's
  Identity
Action-depedent Control Variates for Policy Optimization via Stein's Identity
Hao Liu
Yihao Feng
Yi Mao
Dengyong Zhou
Jian-wei Peng
Qiang Liu
94
4
0
30 Oct 2017
Predicting Head Movement in Panoramic Video: A Deep Reinforcement
  Learning Approach
Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach
Yuhang Song
Mai Xu
Jianyi Wang
Minglang Qiao
Liangyu Huo
Zulin Wang
105
207
0
30 Oct 2017
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep
  Reinforcement Learning
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning
Sergio Valcarcel Macua
Aleksi Tukiainen
D. Hernández
David Baldazo
Enrique Munoz de Cote
S. Zazo
114
29
0
28 Oct 2017
Generalization Tower Network: A Novel Deep Neural Network Architecture
  for Multi-Task Learning
Generalization Tower Network: A Novel Deep Neural Network Architecture for Multi-Task Learning
Yuhang Song
Mai Xu
Songyang Zhang
Liangyu Huo
57
3
0
27 Oct 2017
Understanding Early Word Learning in Situated Artificial Agents
Understanding Early Word Learning in Situated Artificial Agents
Felix Hill
S. Clark
Karl Moritz Hermann
Phil Blunsom
LM&Ro
93
32
0
26 Oct 2017
DoShiCo Challenge: Domain Shift in Control Prediction
DoShiCo Challenge: Domain Shift in Control Prediction
Klaas Kelchtermans
Tinne Tuytelaars
22
0
0
26 Oct 2017
Meta Learning Shared Hierarchies
Meta Learning Shared Hierarchies
Kevin Frans
Jonathan Ho
Xi Chen
Pieter Abbeel
John Schulman
84
355
0
26 Oct 2017
Consequentialist conditional cooperation in social dilemmas with
  imperfect information
Consequentialist conditional cooperation in social dilemmas with imperfect information
A. Peysakhovich
Adam Lerer
89
65
0
19 Oct 2017
Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of
  Robots by Deep Reinforcement Learning
Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of Robots by Deep Reinforcement Learning
A. Kume
Eiichi Matsumoto
K. Takahashi
W. Ko
Jethro Tan
69
11
0
17 Oct 2017
Deep Imitation Learning for Complex Manipulation Tasks from Virtual
  Reality Teleoperation
Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation
Tianhao Zhang
Zoe McCarthy
Owen Jow
Dennis Lee
Xi Chen
Ken Goldberg
Pieter Abbeel
SSL
156
663
0
12 Oct 2017
Arguing Machines: Human Supervision of Black Box AI Systems That Make
  Life-Critical Decisions
Arguing Machines: Human Supervision of Black Box AI Systems That Make Life-Critical Decisions
Alex Fridman
Li Ding
Benedikt Jenik
B. Reimer
51
14
0
12 Oct 2017
AMBER: Adaptive Multi-Batch Experience Replay for Continuous Action
  Control
AMBER: Adaptive Multi-Batch Experience Replay for Continuous Action Control
Seungyul Han
Y. Sung
OffRL
33
8
0
12 Oct 2017
Emergent Complexity via Multi-Agent Competition
Emergent Complexity via Multi-Agent Competition
Trapit Bansal
J. Pachocki
Szymon Sidor
Ilya Sutskever
Igor Mordatch
86
392
0
10 Oct 2017
MSC: A Dataset for Macro-Management in StarCraft II
MSC: A Dataset for Macro-Management in StarCraft II
Huikai Wu
Yanqi Zong
Junge Zhang
Kaiqi Huang
59
16
0
09 Oct 2017
Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on
  Rough Terrain Challenge
Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on Rough Terrain Challenge
Doo Re Song
Chuanyu Yang
C. McGreavy
Zhibin Li
167
30
0
08 Oct 2017
Socially Compliant Navigation through Raw Depth Inputs with Generative
  Adversarial Imitation Learning
Socially Compliant Navigation through Raw Depth Inputs with Generative Adversarial Imitation Learning
L. Tai
Jingwei Zhang
Ming-Yuan Liu
Wolfram Burgard
GAN
67
180
0
06 Oct 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
112
2,283
0
06 Oct 2017
Detecting Adversarial Attacks on Neural Network Policies with Visual
  Foresight
Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight
Yen-Chen Lin
Ming-Yuan Liu
Min Sun
Jia-Bin Huang
AAML
104
49
0
02 Oct 2017
Parameter Sharing Deep Deterministic Policy Gradient for Cooperative
  Multi-agent Reinforcement Learning
Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning
Xiangxiang Chu
Hangjun Ye
74
56
0
01 Oct 2017
Vision-based deep execution monitoring
Vision-based deep execution monitoring
Francesco Puja
S. Grazioso
A. Tammaro
Valsamis Ntouskos
Marta Sanzari
F. Pirri
34
1
0
29 Sep 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
86
273
0
28 Sep 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
188
791
0
28 Sep 2017
Towards continuous control of flippers for a multi-terrain robot using
  deep reinforcement learning
Towards continuous control of flippers for a multi-terrain robot using deep reinforcement learning
Giuseppe Paolo
L. Tai
Ming-Yuan Liu
17
7
0
25 Sep 2017
Learning Unmanned Aerial Vehicle Control for Autonomous Target Following
Learning Unmanned Aerial Vehicle Control for Autonomous Target Following
Siyi Li
Tianbo Liu
Fangqiu Yi
Dit-Yan Yeung
Shaojie Shen
53
38
0
24 Sep 2017
Expanding Motor Skills through Relay Neural Networks
Expanding Motor Skills through Relay Neural Networks
Visak C. V. Kumar
Sehoon Ha
Chenxi Liu
27
2
0
22 Sep 2017
Avoidance of Manual Labeling in Robotic Autonomous Navigation Through
  Multi-Sensory Semi-Supervised Learning
Avoidance of Manual Labeling in Robotic Autonomous Navigation Through Multi-Sensory Semi-Supervised Learning
Junhong Xu
Shangyue Zhu
Hanqing Guo
Shaoen Wu
SSL
31
3
0
22 Sep 2017
Learning Human Behaviors for Robot-Assisted Dressing
Learning Human Behaviors for Robot-Assisted Dressing
Alexander Clegg
Wenhao Yu
Jie Tan
Charles C. Kemp
Greg Turk
Chenxi Liu
38
3
0
20 Sep 2017
OptionGAN: Learning Joint Reward-Policy Options using Generative
  Adversarial Inverse Reinforcement Learning
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson
Wei-Di Chang
Pierre-Luc Bacon
David Meger
Joelle Pineau
Doina Precup
GAN
77
73
0
20 Sep 2017
Deep Reinforcement Learning that Matters
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
155
1,970
0
19 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and
  Open Problems for General Agents
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
114
558
0
18 Sep 2017
Memory Augmented Control Networks
Memory Augmented Control Networks
Arbaaz Khan
Clark Zhang
Nikolay Atanasov
Konstantinos Karydis
Vijay Kumar
Daniel D. Lee
82
77
0
17 Sep 2017
The Uncertainty Bellman Equation and Exploration
The Uncertainty Bellman Equation and Exploration
Brendan O'Donoghue
Ian Osband
Rémi Munos
Volodymyr Mnih
90
193
0
15 Sep 2017
Transforming Cooling Optimization for Green Data Center via Deep
  Reinforcement Learning
Transforming Cooling Optimization for Green Data Center via Deep Reinforcement Learning
Yuanlong Li
Yonggang Wen
K. Guan
Dacheng Tao
AI4CE
88
180
0
15 Sep 2017
Shared Learning : Enhancing Reinforcement in $Q$-Ensembles
Shared Learning : Enhancing Reinforcement in QQQ-Ensembles
Rakesh R Menon
Balaraman Ravindran
33
0
0
14 Sep 2017
A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping
A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping
Debang Li
Huikai Wu
Junge Zhang
Kaiqi Huang
OffRL
56
9
0
14 Sep 2017
When Waiting is not an Option : Learning Options with a Deliberation
  Cost
When Waiting is not an Option : Learning Options with a Deliberation Cost
J. Harb
Pierre-Luc Bacon
Martin Klissarov
Doina Precup
71
150
0
14 Sep 2017
A Study of AI Population Dynamics with Million-agent Reinforcement
  Learning
A Study of AI Population Dynamics with Million-agent Reinforcement Learning
Yaodong Yang
Lantao Yu
Yiwei Bai
Jun Wang
Weinan Zhang
Ying Wen
Yong Yu
64
7
0
13 Sep 2017
Pre-training Neural Networks with Human Demonstrations for Deep
  Reinforcement Learning
Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
3DHOffRL
84
58
0
12 Sep 2017
Deep Reinforcement Learning with Surrogate Agent-Environment Interface
Deep Reinforcement Learning with Surrogate Agent-Environment Interface
Songli Wang
Yutao Jing
27
1
0
12 Sep 2017
TensorFlow Agents: Efficient Batched Reinforcement Learning in
  TensorFlow
TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow
Danijar Hafner
James Davidson
Vincent Vanhoucke
OffRL
59
49
0
08 Sep 2017
Prosocial learning agents solve generalized Stag Hunts better than
  selfish ones
Prosocial learning agents solve generalized Stag Hunts better than selfish ones
A. Peysakhovich
Adam Lerer
114
109
0
08 Sep 2017
BOOK: Storing Algorithm-Invariant Episodes for Deep Reinforcement
  Learning
BOOK: Storing Algorithm-Invariant Episodes for Deep Reinforcement Learning
Simyung Chang
Y. Yoo
Jaeseok Choi
Nojun Kwak
OffRL
15
1
0
05 Sep 2017
Mean Actor Critic
Mean Actor Critic
Cameron Allen
Kavosh Asadi
Melrose Roderick
Abdel-rahman Mohamed
George Konidaris
Michael Littman
92
45
0
01 Sep 2017
Deep Learning for Video Game Playing
Deep Learning for Video Game Playing
Niels Justesen
Philip Bontrager
Julian Togelius
S. Risi
VLM
101
208
0
25 Aug 2017
Learning the Enigma with Recurrent Neural Networks
Learning the Enigma with Recurrent Neural Networks
S. Greydanus
81
39
0
24 Aug 2017
Reinforcement Learning in POMDPs with Memoryless Options and
  Option-Observation Initiation Sets
Reinforcement Learning in POMDPs with Memoryless Options and Option-Observation Initiation Sets
Denis Steckelmacher
D. Roijers
Anna Harutyunyan
Peter Vrancx
Hélène Plisnier
A. Nowé
113
20
0
22 Aug 2017
Teaching UAVs to Race: End-to-End Regression of Agile Controls in
  Simulation
Teaching UAVs to Race: End-to-End Regression of Agile Controls in Simulation
Matthias Mueller
Vincent Casser
Neil G. Smith
D. L. Michels
Guohao Li
84
10
0
19 Aug 2017
Sim4CV: A Photo-Realistic Simulator for Computer Vision Applications
Sim4CV: A Photo-Realistic Simulator for Computer Vision Applications
Matthias Muller
Vincent Casser
Jean Lahoud
Neil G. Smith
Guohao Li
VGen
74
181
0
19 Aug 2017
A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
173
2,830
0
19 Aug 2017
Previous
123...676869707172
Next