ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Algorithmic Framework for Model-based Deep Reinforcement Learning with
  Theoretical Guarantees
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
Yuping Luo
Huazhe Xu
Yuanzhi Li
Yuandong Tian
Trevor Darrell
Tengyu Ma
OffRL
121
227
0
10 Jul 2018
CIRL: Controllable Imitative Reinforcement Learning for Vision-based
  Self-driving
CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving
Xiaodan Liang
Tairui Wang
Luona Yang
Eric Xing
96
272
0
10 Jul 2018
Is Q-learning Provably Efficient?
Is Q-learning Provably Efficient?
Chi Jin
Zeyuan Allen-Zhu
Sébastien Bubeck
Michael I. Jordan
OffRL
128
813
0
10 Jul 2018
Representation Learning with Contrastive Predictive Coding
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRLSSL
560
10,398
0
10 Jul 2018
An Intriguing Failing of Convolutional Neural Networks and the CoordConv
  Solution
An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution
Rosanne Liu
Joel Lehman
Piero Molino
F. Such
Eric Frank
Alexander Sergeev
J. Yosinski
125
895
0
09 Jul 2018
End-to-End Race Driving with Deep Reinforcement Learning
End-to-End Race Driving with Deep Reinforcement Learning
M. Jaritz
Raoul de Charette
Marin Toromanoff
E. Perot
F. Nashashibi
60
169
0
06 Jul 2018
Memory Augmented Policy Optimization for Program Synthesis and Semantic
  Parsing
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing
Chen Liang
Mohammad Norouzi
Jonathan Berant
Quoc V. Le
Ni Lao
134
134
0
06 Jul 2018
A survey on policy search algorithms for learning robot controllers in a
  handful of trials
A survey on policy search algorithms for learning robot controllers in a handful of trials
Konstantinos Chatzilygeroudis
Vassilis Vassiliades
F. Stulp
Sylvain Calinon
Jean-Baptiste Mouret
103
155
0
06 Jul 2018
Variance Reduction for Reinforcement Learning in Input-Driven
  Environments
Variance Reduction for Reinforcement Learning in Input-Driven Environments
Hongzi Mao
S. Venkatakrishnan
Malte Schwarzkopf
Mohammad Alizadeh
OffRL
102
95
0
06 Jul 2018
Deep Reinforcement Learning for Doom using Unsupervised Auxiliary Tasks
Deep Reinforcement Learning for Doom using Unsupervised Auxiliary Tasks
Georgios Papoudakis
Kyriakos C. Chatzidimitriou
P. Mitkas
51
8
0
05 Jul 2018
Ranked Reward: Enabling Self-Play Reinforcement Learning for
  Combinatorial Optimization
Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization
Alexandre Laterre
Yunguan Fu
Mohamed Khalil Jabri
A. Cohen
David Kas
Karl Hajjar
T. Dahl
Amine Kerkeni
Karim Beguir
128
80
0
04 Jul 2018
Using Reinforcement Learning with Partial Vehicle Detection for
  Intelligent Traffic Signal Control
Using Reinforcement Learning with Partial Vehicle Detection for Intelligent Traffic Signal Control
Rusheng Zhang
A. Ishikawa
Wenli Wang
Benjamin Striner
Ozan Tonguz
74
104
0
04 Jul 2018
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
...
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
OffRL
164
732
0
03 Jul 2018
Solving Atari Games Using Fractals And Entropy
Solving Atari Games Using Fractals And Entropy
Sergio Hernandez Cerezo
Guillem Duran Ballester
Spiros Baxevanakis
15
2
0
03 Jul 2018
Policy Optimization With Penalized Point Probability Distance: An
  Alternative To Proximal Policy Optimization
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization
Xiangxiang Chu
105
9
0
02 Jul 2018
Learning to Drive in a Day
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
115
659
0
01 Jul 2018
Illuminating Generalization in Deep Reinforcement Learning through
  Procedural Level Generation
Illuminating Generalization in Deep Reinforcement Learning through Procedural Level Generation
Niels Justesen
R. Torrado
Philip Bontrager
Ahmed Khalifa
Julian Togelius
S. Risi
196
186
0
28 Jun 2018
Guided evolutionary strategies: Augmenting random search with surrogate
  gradients
Guided evolutionary strategies: Augmenting random search with surrogate gradients
Niru Maheswaranathan
Luke Metz
George Tucker
Dami Choi
Jascha Narain Sohl-Dickstein
76
20
0
26 Jun 2018
Learning Existing Social Conventions via Observationally Augmented
  Self-Play
Learning Existing Social Conventions via Observationally Augmented Self-Play
Adam Lerer
A. Peysakhovich
OffRL
61
2
0
26 Jun 2018
Many-Goals Reinforcement Learning
Many-Goals Reinforcement Learning
Vivek Veeriah
Junhyuk Oh
Satinder Singh
KELM
75
53
0
22 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
130
222
0
20 Jun 2018
Reinforcement Learning using Augmented Neural Networks
Reinforcement Learning using Augmented Neural Networks
Jack Shannon
M. Grzes
11
1
0
20 Jun 2018
Learning from Chunk-based Feedback in Neural Machine Translation
Learning from Chunk-based Feedback in Neural Machine Translation
Pavel Petrushkov
Shahram Khadivi
E. Matusov
64
19
0
19 Jun 2018
Task-Relevant Object Discovery and Categorization for Playing
  First-person Shooter Games
Task-Relevant Object Discovery and Categorization for Playing First-person Shooter Games
Junchi Liang
Abdeslam Boularias
47
2
0
17 Jun 2018
Evolving simple programs for playing Atari games
Evolving simple programs for playing Atari games
Dennis G. Wilson
Sylvain Cussat-Blanc
H. Luga
J. Miller
74
62
0
14 Jun 2018
Self-Imitation Learning
Self-Imitation Learning
Junhyuk Oh
Yijie Guo
Satinder Singh
Honglak Lee
SSL
88
251
0
14 Jun 2018
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
87
480
0
14 Jun 2018
Meta-Learning Transferable Active Learning Policies by Deep
  Reinforcement Learning
Meta-Learning Transferable Active Learning Policies by Deep Reinforcement Learning
Kunkun Pang
Mingzhi Dong
Yang Wu
Timothy M. Hospedales
OffRL
46
91
0
12 Jun 2018
Multi-Agent Deep Reinforcement Learning with Human Strategies
Multi-Agent Deep Reinforcement Learning with Human Strategies
Thanh Nguyen
Ngoc Duy Nguyen
S. Nahavandi
82
12
0
12 Jun 2018
Implicit Policy for Reinforcement Learning
Implicit Policy for Reinforcement Learning
Yunhao Tang
Shipra Agrawal
64
14
0
10 Jun 2018
Distributional Advantage Actor-Critic
Distributional Advantage Actor-Critic
Shangda Li
Selina Bing
Steven Yang
OffRL
25
6
0
10 Jun 2018
Randomized Prior Functions for Deep Reinforcement Learning
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCVBDL
97
380
0
08 Jun 2018
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement
  Learning with Trajectory Embeddings
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
John D. Co-Reyes
YuXuan Liu
Abhishek Gupta
Benjamin Eysenbach
Pieter Abbeel
Sergey Levine
SSLBDLAIFin
87
146
0
07 Jun 2018
Deep Reinforcement Learning for General Video Game AI
Deep Reinforcement Learning for General Video Game AI
R. Torrado
Philip Bontrager
Julian Togelius
Jialin Liu
Diego Perez-Liebana
87
131
0
06 Jun 2018
Deep Variational Reinforcement Learning for POMDPs
Deep Variational Reinforcement Learning for POMDPs
Maximilian Igl
L. Zintgraf
T. Le
Frank Wood
Shimon Whiteson
BDLOffRL
104
263
0
06 Jun 2018
Meta-Learning by the Baldwin Effect
Meta-Learning by the Baldwin Effect
Chrisantha Fernando
Jakub Sygnowski
Simon Osindero
Jane X. Wang
Tom Schaul
Denis Teplyashin
Pablo Sprechmann
Alexander Pritzel
Andrei A. Rusu
87
39
0
06 Jun 2018
Learning to Understand Goal Specifications by Modelling Reward
Learning to Understand Goal Specifications by Modelling Reward
Dzmitry Bahdanau
Felix Hill
Jan Leike
Edward Hughes
Seyedarian Hosseini
Pushmeet Kohli
Edward Grefenstette
190
159
0
05 Jun 2018
The Effect of Planning Shape on Dyna-style Planning in High-dimensional
  State Spaces
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces
G. Z. Holland
Erik Talvitie
Michael Bowling
AI4CE
72
43
0
05 Jun 2018
Mix&Match - Agent Curricula for Reinforcement Learning
Mix&Match - Agent Curricula for Reinforcement Learning
Wojciech M. Czarnecki
Siddhant M. Jayakumar
Max Jaderberg
Leonard Hasenclever
Yee Whye Teh
Simon Osindero
N. Heess
Razvan Pascanu
82
72
0
05 Jun 2018
TD or not TD: Analyzing the Role of Temporal Differencing in Deep
  Reinforcement Learning
TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning
Artemij Amiranashvili
Alexey Dosovitskiy
V. Koltun
Thomas Brox
OffRL
71
19
0
04 Jun 2018
DAQN: Deep Auto-encoder and Q-Network
DAQN: Deep Auto-encoder and Q-Network
Daiki Kimura
58
18
0
02 Jun 2018
Between Progress and Potential Impact of AI: the Neglected Dimensions
Between Progress and Potential Impact of AI: the Neglected Dimensions
Fernando Martínez-Plumed
S. Avin
Miles Brundage
Allan Dafoe
Seán Ó hÉigeartaigh
José Hernández-Orallo
64
3
0
02 Jun 2018
Internal Model from Observations for Reward Shaping
Internal Model from Observations for Reward Shaping
Daiki Kimura
Subhajit Chaudhury
Ryuki Tachibana
Sakyasingha Dasgupta
106
22
0
02 Jun 2018
Efficient Entropy for Policy Gradient with Multidimensional Action Space
Efficient Entropy for Policy Gradient with Multidimensional Action Space
Yiming Zhang
Q. Vuong
Kenny Song
Xiao-Yue Gong
George Andriopoulos
62
17
0
02 Jun 2018
Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on
  Challenging Deep Reinforcement Learning Problems
Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on Challenging Deep Reinforcement Learning Problems
C. Stanton
Jeff Clune
LRM
62
41
0
01 Jun 2018
Integrating Episodic Memory into a Reinforcement Learning Agent using
  Reservoir Sampling
Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling
Kenny Young
R. Sutton
Shuo Yang
56
10
0
01 Jun 2018
Fast Exploration with Simplified Models and Approximately Optimistic
  Planning in Model Based Reinforcement Learning
Fast Exploration with Simplified Models and Approximately Optimistic Planning in Model Based Reinforcement Learning
Ramtin Keramati
Jay Whang
Patrick Cho
Emma Brunskill
OffRL
90
7
0
01 Jun 2018
Transfer Learning for Related Reinforcement Learning Tasks via
  Image-to-Image Translation
Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
Shani Gamrian
Yoav Goldberg
124
108
0
31 May 2018
Supervised Policy Update for Deep Reinforcement Learning
Supervised Policy Update for Deep Reinforcement Learning
Q. Vuong
Yiming Zhang
George Andriopoulos
77
20
0
29 May 2018
Learning to Transcribe by Ear
Learning to Transcribe by Ear
Rainer Kelz
Gerhard Widmer
16
0
0
29 May 2018
Previous
123...626364...707172
Next