ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Fast Abstractive Summarization with Reinforce-Selected Sentence
  Rewriting
Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting
Yen-Chun Chen
Joey Tianyi Zhou
BDL
210
584
0
28 May 2018
Reward Constrained Policy Optimization
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
97
546
0
28 May 2018
Reliability and Learnability of Human Bandit Feedback for
  Sequence-to-Sequence Reinforcement Learning
Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Julia Kreutzer
Joshua Uyheng
Stefan Riezler
91
88
0
27 May 2018
Fast Policy Learning through Imitation and Reinforcement
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
77
84
0
26 May 2018
Self-Net: Lifelong Learning via Continual Self-Modeling
Self-Net: Lifelong Learning via Continual Self-Modeling
Blake Camp
J. Mandivarapu
Rolando Estrada
CLLSSL
71
16
0
25 May 2018
Learning Self-Imitating Diverse Policies
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
99
68
0
25 May 2018
Object-Oriented Dynamics Predictor
Object-Oriented Dynamics Predictor
Guangxiang Zhu
Zhiao Huang
Chongjie Zhang
AI4CE
89
35
0
25 May 2018
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for
  Reinforcement Learning
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning
Jing-Cheng Shi
Yang Yu
Qing Da
Shi-Yong Chen
Anxiang Zeng
OffRL
95
187
0
25 May 2018
Meta-Gradient Reinforcement Learning
Meta-Gradient Reinforcement Learning
Zhongwen Xu
H. V. Hasselt
David Silver
117
327
0
24 May 2018
Deep Reinforcement Learning For Sequence to Sequence Models
Deep Reinforcement Learning For Sequence to Sequence Models
Yaser Keneshloo
Tian Shi
Naren Ramakrishnan
Chandan K. Reddy
AIMat3DVOffRL
92
211
0
24 May 2018
Deep Reinforcement Learning of Marked Temporal Point Processes
Deep Reinforcement Learning of Marked Temporal Point Processes
U. Upadhyay
A. De
Manuel Gomez Rodriguez
BDLOffRL
85
112
0
23 May 2018
Variational Inference for Data-Efficient Model Learning in POMDPs
Variational Inference for Data-Efficient Model Learning in POMDPs
Sebastian Tschiatschek
Kai Arulkumaran
Jan Stühmer
Katja Hofmann
55
15
0
23 May 2018
Gradient Energy Matching for Distributed Asynchronous Gradient Descent
Gradient Energy Matching for Distributed Asynchronous Gradient Descent
Joeri Hermans
Gilles Louppe
53
5
0
22 May 2018
Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy
  Gradients
Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients
Arbaaz Khan
Clark Zhang
Daniel D. Lee
Vijay Kumar
Alejandro Ribeiro
64
30
0
22 May 2018
Guided Feature Transformation (GFT): A Neural Language Grounding Module
  for Embodied Agents
Guided Feature Transformation (GFT): A Neural Language Grounding Module for Embodied Agents
Haonan Yu
Xiaochen Lian
Haichao Zhang
Wenyuan Xu
LM&Ro
58
21
0
22 May 2018
Multiple-Step Greedy Policies in Online and Approximate Reinforcement
  Learning
Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning
Yonathan Efroni
Gal Dalal
B. Scherrer
Shie Mannor
OffRL
119
14
0
21 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning
Evolution-Guided Policy Gradient in Reinforcement Learning
Shauharda Khadka
Kagan Tumer
132
232
0
21 May 2018
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Vikrant Goel
James Weng
Pascal Poupart
OCL
73
66
0
20 May 2018
Episodic Memory Deep Q-Networks
Episodic Memory Deep Q-Networks
Zichuan Lin
Tianqi Zhao
Guangwen Yang
Lintao Zhang
OffRL
61
87
0
19 May 2018
End-to-end driving simulation via angle branched network
End-to-end driving simulation via angle branched network
Qing Wang
Long Chen
Wei Tian
48
9
0
19 May 2018
Solving the Rubik's Cube Without Human Knowledge
Solving the Rubik's Cube Without Human Knowledge
Stephen Marcus McAleer
Forest Agostinelli
Alexander Shmakov
Pierre Baldi
57
41
0
18 May 2018
Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for
  Map-less Navigation by Leveraging Prior Demonstrations
Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for Map-less Navigation by Leveraging Prior Demonstrations
Mark Pfeiffer
Samarth Shukla
M. Turchetta
Cesar Cadena
Andreas Krause
Roland Siegwart
Juan I. Nieto
78
159
0
18 May 2018
Learning Time-Sensitive Strategies in Space Fortress
Akshat Agarwal
Ryan Hope
Katia Sycara
60
0
0
17 May 2018
Fast Retinomorphic Event Stream for Video Recognition and Reinforcement
  Learning
Fast Retinomorphic Event Stream for Video Recognition and Reinforcement Learning
Wanjia Liu
Huaijin Chen
Rishab Goel
Yuzhong Huang
Ashok Veeraraghavan
Ankit B. Patel
OffRL
34
2
0
16 May 2018
Spark-MPI: Approaching the Fifth Paradigm of Cognitive Applications
Spark-MPI: Approaching the Fifth Paradigm of Cognitive Applications
N. Malitsky
R. Castain
Matt Cowan
60
7
0
16 May 2018
Visual Representations for Semantic Target Driven Navigation
Visual Representations for Semantic Target Driven Navigation
Arsalan Mousavian
Alexander Toshev
Marek Fiser
Jana Kosecka
Ayzaan Wahid
James Davidson
89
202
0
15 May 2018
Low-pass Recurrent Neural Networks - A memory architecture for
  longer-term correlation discovery
Low-pass Recurrent Neural Networks - A memory architecture for longer-term correlation discovery
T. Stepleton
Razvan Pascanu
Will Dabney
Siddhant M. Jayakumar
Hubert Soyer
Rémi Munos
83
4
0
13 May 2018
Metatrace Actor-Critic: Online Step-size Tuning by Meta-gradient Descent
  for Reinforcement Learning Control
Metatrace Actor-Critic: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
K. Young
Baoxiang Wang
Matthew E. Taylor
OffRL
84
15
0
10 May 2018
Policy Optimization with Second-Order Advantage Information
Policy Optimization with Second-Order Advantage Information
Jiajin Li
Baoxiang Wang
41
6
0
09 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
98
42
0
09 May 2018
Deep Reinforcement Learning for Playing 2.5D Fighting Games
Deep Reinforcement Learning for Playing 2.5D Fighting Games
Yu-Jhe Li
Hsin-Yu Chang
Yu-Jing Lin
Po-Wei Wu
Y. Wang
GAN
31
5
0
05 May 2018
Motion Planning Among Dynamic, Decision-Making Agents with Deep
  Reinforcement Learning
Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning
Michael Everett
Yu Fan Chen
Jonathan P. How
255
520
0
04 May 2018
A Reinforcement Learning Approach to Interactive-Predictive Neural
  Machine Translation
A Reinforcement Learning Approach to Interactive-Predictive Neural Machine Translation
Tsz Kin Lam
Julia Kreutzer
Stefan Riezler
79
32
0
03 May 2018
AGI Safety Literature Review
AGI Safety Literature Review
Tom Everitt
G. Lea
Marcus Hutter
AI4CE
86
116
0
03 May 2018
Falsification of Cyber-Physical Systems Using Deep Reinforcement
  Learning
Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning
Takumi Akazaki
Shuang Liu
Yoriyuki Yamagata
Yihai Duan
Jianye Hao
AI4CE
77
92
0
01 May 2018
From Credit Assignment to Entropy Regularization: Two New Algorithms for
  Neural Sequence Prediction
From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction
Zihang Dai
Qizhe Xie
Eduard H. Hovy
41
6
0
29 Apr 2018
Decoupling Dynamics and Reward for Transfer Learning
Decoupling Dynamics and Reward for Transfer Learning
Amy Zhang
Harsh Satija
Joelle Pineau
OOD
80
72
0
27 Apr 2018
Deep Reinforcement Learning to Acquire Navigation Skills for
  Wheel-Legged Robots in Complex Environments
Deep Reinforcement Learning to Acquire Navigation Skills for Wheel-Legged Robots in Complex Environments
Xi Chen
Ali Ghadirzadeh
John Folkesson
Patric Jensfelt
102
44
0
27 Apr 2018
Generative Temporal Models with Spatial Memory for Partially Observed
  Environments
Generative Temporal Models with Spatial Memory for Partially Observed Environments
Marco Fraccaro
Danilo Jimenez Rezende
Yori Zwols
Alexander Pritzel
S. M. Ali Eslami
Fabio Viola
122
28
0
25 Apr 2018
Driving Policy Transfer via Modularity and Abstraction
Driving Policy Transfer via Modularity and Abstraction
Matthias Muller
Alexey Dosovitskiy
Guohao Li
V. Koltun
99
225
0
25 Apr 2018
Crawling in Rogue's dungeons with (partitioned) A3C
Crawling in Rogue's dungeons with (partitioned) A3C
Andrea Asperti
Daniele Cortesi
Francesco Sovrano
67
12
0
23 Apr 2018
Attention Based Natural Language Grounding by Navigating Virtual
  Environment
Attention Based Natural Language Grounding by Navigating Virtual Environment
B. Akilesh
Abhishek Sinha
Mausoom Sarkar
Balaji Krishnamurthy
LM&Ro
48
11
0
23 Apr 2018
Distributed Distributional Deterministic Policy Gradients
Distributed Distributional Deterministic Policy Gradients
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
OffRL
114
480
0
23 Apr 2018
A Study on Overfitting in Deep Reinforcement Learning
A Study on Overfitting in Deep Reinforcement Learning
Chiyuan Zhang
Oriol Vinyals
Rémi Munos
Samy Bengio
OffRLOnRL
63
391
0
18 Apr 2018
The Limits and Potentials of Deep Learning for Robotics
The Limits and Potentials of Deep Learning for Robotics
Niko Sünderhauf
Oliver Brock
Walter J. Scheirer
R. Hadsell
Dieter Fox
...
B. Upcroft
Pieter Abbeel
Wolfram Burgard
Michael Milford
Peter Corke
89
530
0
18 Apr 2018
An Adaptive Clipping Approach for Proximal Policy Optimization
An Adaptive Clipping Approach for Proximal Policy Optimization
Gang Chen
Yiming Peng
Mengjie Zhang
57
22
0
17 Apr 2018
On Learning Intrinsic Rewards for Policy Gradient Methods
On Learning Intrinsic Rewards for Policy Gradient Methods
Zeyu Zheng
Junhyuk Oh
Satinder Singh
77
210
0
17 Apr 2018
Leveraging Statistical Multi-Agent Online Planning with Emergent Value
  Function Approximation
Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation
Thomy Phan
Lenz Belzner
Thomas Gabor
Kyrill Schmid
OffRL
60
15
0
17 Apr 2018
Automated vehicle's behavior decision making using deep reinforcement
  learning and high-fidelity simulation environment
Automated vehicle's behavior decision making using deep reinforcement learning and high-fidelity simulation environment
Yingjun Ye
Xiaohui Zhang
Jian Sun
52
133
0
17 Apr 2018
Rafiki: Machine Learning as an Analytics Service System
Rafiki: Machine Learning as an Analytics Service System
Wei Wang
Sheng Wang
Jinyang Gao
Meihui Zhang
Gang Chen
Teck Khim Ng
Beng Chin Ooi
105
113
0
17 Apr 2018
Previous
123...636465...707172
Next