ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time
  Budget
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time Budget
Henghui Zhu
Feng Nan
I. Paschalidis
Venkatesh Saligrama
11
2
0
31 May 2017
Experience Replay Using Transition Sequences
Experience Replay Using Transition Sequences
Thommen George Karimpanal
Roland Bouffanais
OffRL
39
14
0
30 May 2017
End-to-end Active Object Tracking via Reinforcement Learning
End-to-end Active Object Tracking via Reinforcement Learning
Wenhan Luo
Peng Sun
Fangwei Zhong
Wei Liu
Yadong Mu
Yizhou Wang
95
86
0
30 May 2017
Constrained Policy Optimization
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
196
1,339
0
30 May 2017
Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation
Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation
Guan-Horng Liu
Avinash Siravuru
Sai P. Selvaraj
Manuela Veloso
George Kantor
89
69
0
30 May 2017
Convergent Tree Backup and Retrace with Function Approximation
Convergent Tree Backup and Retrace with Function Approximation
Ahmed Touati
Pierre-Luc Bacon
Doina Precup
Pascal Vincent
106
40
0
25 May 2017
The Marginal Value of Adaptive Gradient Methods in Machine Learning
The Marginal Value of Adaptive Gradient Methods in Machine Learning
Ashia Wilson
Rebecca Roelofs
Mitchell Stern
Nathan Srebro
Benjamin Recht
ODL
125
1,035
0
23 May 2017
Enhanced Experience Replay Generation for Efficient Reinforcement
  Learning
Enhanced Experience Replay Generation for Efficient Reinforcement Learning
Vincent Huang
Tobias Ley
Martha Vlachou-Konchylaki
Wenfeng Hu
OnRLGANSyDa
41
10
0
23 May 2017
Visual Semantic Planning using Deep Successor Representations
Visual Semantic Planning using Deep Successor Representations
Yuke Zhu
Daniel Gordon
Eric Kolve
Dieter Fox
Li Fei-Fei
Abhinav Gupta
Roozbeh Mottaghi
Ali Farhadi
112
142
0
23 May 2017
Neural Network Memory Architectures for Autonomous Robot Navigation
Neural Network Memory Architectures for Autonomous Robot Navigation
Steven W. Chen
Nikolay Atanasov
Arbaaz Khan
Konstantinos Karydis
Daniel D. Lee
Vijay Kumar
51
7
0
23 May 2017
Pairwise Confusion for Fine-Grained Visual Classification
Pairwise Confusion for Fine-Grained Visual Classification
Abhimanyu Dubey
O. Gupta
Pei Guo
Ramesh Raskar
Ryan Farrell
Nikhil Naik
54
10
0
22 May 2017
A unified view of entropy-regularized Markov decision processes
A unified view of entropy-regularized Markov decision processes
Gergely Neu
Anders Jonsson
Vicencc Gómez
121
264
0
22 May 2017
Guide Actor-Critic for Continuous Control
Guide Actor-Critic for Continuous Control
Voot Tangkaratt
A. Abdolmaleki
Masashi Sugiyama
67
17
0
22 May 2017
Shallow Updates for Deep Reinforcement Learning
Shallow Updates for Deep Reinforcement Learning
Nir Levine
Tom Zahavy
D. Mankowitz
Aviv Tamar
Shie Mannor
OffRL
72
48
0
21 May 2017
Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep
  Reinforcement Learning
Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning
Sahil Sharma
J. GirishRaguvir
S. Ramesh
Balaraman Ravindran
38
6
0
21 May 2017
Learning to Factor Policies and Action-Value Functions: Factored Action
  Space Representations for Deep Reinforcement learning
Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning
Sahil Sharma
A. Suresh
Rahul Ramesh
Balaraman Ravindran
OffRL
56
36
0
20 May 2017
Relaxed Wasserstein with Applications to GANs
Relaxed Wasserstein with Applications to GANs
Xin Guo
Johnny Hong
Tianyi Lin
Nan Yang
GAN
114
35
0
19 May 2017
Atari games and Intel processors
Atari games and Intel processors
R. Adamski
T. Grel
Maciek Klimek
Henryk Michalewski
34
5
0
19 May 2017
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement
  Learning
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
Nat Dilokthanakul
Christos Kaplanis
Nick Pawlowski
Murray Shanahan
87
92
0
18 May 2017
Delving into adversarial attacks on deep policies
Delving into adversarial attacks on deep policies
Jernej Kos
Basel Alomair
AAML
72
228
0
18 May 2017
Probabilistically Safe Policy Transfer
Probabilistically Safe Policy Transfer
David Held
Zoe McCarthy
Michael Zhang
Fred Shentu
Pieter Abbeel
86
19
0
15 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRMSSL
183
2,456
0
15 May 2017
Discrete Sequential Prediction of Continuous Actions for Deep RL
Discrete Sequential Prediction of Continuous Actions for Deep RL
Luke Metz
Julian Ibarz
Navdeep Jaitly
James Davidson
BDLOffRL
92
121
0
14 May 2017
Efficient Parallel Methods for Deep Reinforcement Learning
Efficient Parallel Methods for Deep Reinforcement Learning
Alfredo V. Clemente
Humberto Nicolás Castejón Martínez
A. Chandra
85
115
0
13 May 2017
Metacontrol for Adaptive Imagination-Based Optimization
Metacontrol for Adaptive Imagination-Based Optimization
Jessica B. Hamrick
A. J. Ballard
Razvan Pascanu
Oriol Vinyals
N. Heess
Peter W. Battaglia
76
69
0
07 May 2017
Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural
  Networks for Environmental Awareness
Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural Networks for Environmental Awareness
Nikolai Smolyanskiy
A. Kamenev
Jeffrey Smith
Stan Birchfield
144
223
0
07 May 2017
Traffic Light Control Using Deep Policy-Gradient and Value-Function
  Based Reinforcement Learning
Traffic Light Control Using Deep Policy-Gradient and Value-Function Based Reinforcement Learning
Seyed Sajad Mousavi
Michael Schukat
Enda Howley
95
309
0
28 Apr 2017
Mapping Instructions and Visual Observations to Actions with
  Reinforcement Learning
Mapping Instructions and Visual Observations to Actions with Reinforcement Learning
Dipendra Kumar Misra
John Langford
Yoav Artzi
86
247
0
28 Apr 2017
General Video Game AI: Learning from Screen Capture
General Video Game AI: Learning from Screen Capture
Kamolwan Kunanusont
Simon Lucas
Diego Perez-Liebana
60
20
0
23 Apr 2017
Equivalence Between Policy Gradients and Soft Q-Learning
Equivalence Between Policy Gradients and Soft Q-Learning
John Schulman
Xi Chen
Pieter Abbeel
OffRL
132
349
0
21 Apr 2017
Beating Atari with Natural Language Guided Reinforcement Learning
Beating Atari with Natural Language Guided Reinforcement Learning
Russell Kaplan
Chris Sauer
A. Sosa
LM&Ro
86
69
0
18 Apr 2017
Investigating Recurrence and Eligibility Traces in Deep Q-Networks
Investigating Recurrence and Eligibility Traces in Deep Q-Networks
J. Harb
Doina Precup
54
21
0
18 Apr 2017
The Reactor: A fast and sample-efficient Actor-Critic agent for
  Reinforcement Learning
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
A. Gruslys
Will Dabney
M. G. Azar
Bilal Piot
Marc G. Bellemare
Rémi Munos
76
58
0
15 Apr 2017
Virtual to Real Reinforcement Learning for Autonomous Driving
Virtual to Real Reinforcement Learning for Autonomous Driving
Xinlei Pan
Yurong You
Ziyan Wang
Cewu Lu
OffRL
121
338
0
13 Apr 2017
Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Zhou Ren
Xiaoyu Wang
Ning Zhang
Xutao Lv
Li Li
65
324
0
12 Apr 2017
Deep Q-learning from Demonstrations
Deep Q-learning from Demonstrations
Todd Hester
Matej Vecerík
Olivier Pietquin
Marc Lanctot
Tom Schaul
...
Gabriel Dulac-Arnold
Ian Osband
J. Agapiou
Joel Z Leibo
A. Gruslys
OffRL
94
157
0
12 Apr 2017
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation
I. Popov
N. Heess
Timothy Lillicrap
Roland Hafner
Gabriel Barth-Maron
Matej Vecerík
Thomas Lampe
Yuval Tassa
Tom Erez
Martin Riedmiller
OffRL
99
265
0
10 Apr 2017
Stein Variational Policy Gradient
Stein Variational Policy Gradient
Yang Liu
Prajit Ramachandran
Qiang Liu
Jian-wei Peng
80
141
0
07 Apr 2017
Recurrent Environment Simulators
Recurrent Environment Simulators
Silvia Chiappa
S. Racanière
Daan Wierstra
S. Mohamed
85
211
0
07 Apr 2017
Learned Watershed: End-to-End Learning of Seeded Segmentation
Learned Watershed: End-to-End Learning of Seeded Segmentation
Steffen Wolf
Lukas Schott
Ullrich Kothe
Fred Hamprecht
56
35
0
07 Apr 2017
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level
  Coordination in Learning to Play StarCraft Combat Games
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games
Peng Peng
Ying Wen
Yaodong Yang
Quan Yuan
Zhenkun Tang
Haitao Long
Jun Wang
110
336
0
29 Mar 2017
Socially Aware Motion Planning with Deep Reinforcement Learning
Socially Aware Motion Planning with Deep Reinforcement Learning
Yu Fan Chen
Michael Everett
Miao Liu
Jonathan P. How
121
683
0
26 Mar 2017
Combining Neural Networks and Tree Search for Task and Motion Planning
  in Challenging Environments
Combining Neural Networks and Tree Search for Task and Motion Planning in Challenging Environments
Chris Paxton
Vasumathi Raman
Gregory Hager
Marin Kobilarov
78
123
0
22 Mar 2017
An End-to-End Approach to Natural Language Object Retrieval via
  Context-Aware Deep Reinforcement Learning
An End-to-End Approach to Natural Language Object Retrieval via Context-Aware Deep Reinforcement Learning
Fan Wu
Zhongwen Xu
Yi Yang
ObjD
60
11
0
22 Mar 2017
Learning to Navigate Cloth using Haptics
Learning to Navigate Cloth using Haptics
Alexander Clegg
Wenhao Yu
Zackory M. Erickson
Jie Tan
Chenxi Liu
Greg Turk
86
23
0
20 Mar 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
174
1,545
0
10 Mar 2017
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents
Yen-Chen Lin
Zhang-Wei Hong
Yuan-Hong Liao
Meng-Li Shih
Ming-Yuan Liu
Min Sun
AAML
141
419
0
08 Mar 2017
Combining Self-Supervised Learning and Imitation for Vision-Based Rope
  Manipulation
Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation
Ashvin Nair
Dian Chen
Pulkit Agrawal
Phillip Isola
Pieter Abbeel
Jitendra Malik
Sergey Levine
SSL
83
312
0
06 Mar 2017
Neural Episodic Control
Neural Episodic Control
Alexander Pritzel
Benigno Uria
Sriram Srinivasan
A. Badia
Oriol Vinyals
Demis Hassabis
Daan Wierstra
Charles Blundell
OffRLBDL
113
346
0
06 Mar 2017
Context-Based Concurrent Experience Sharing in Multiagent Systems
Context-Based Concurrent Experience Sharing in Multiagent Systems
Dan Garant
Bruno Castro da Silva
V. Lesser
Chongjie Zhang
22
4
0
06 Mar 2017
Previous
123...69707172
Next