ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
ViZDoom Competitions: Playing Doom from Pixels
ViZDoom Competitions: Playing Doom from Pixels
Marek Wydmuch
Michal Kempka
Wojciech Ja'skowski
58
119
0
10 Sep 2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Michal Garmulewicz
Henryk Michalewski
Piotr Milos
78
8
0
10 Sep 2018
Adaptive Behavior Generation for Autonomous Driving using Deep
  Reinforcement Learning with Compact Semantic States
Adaptive Behavior Generation for Autonomous Driving using Deep Reinforcement Learning with Compact Semantic States
Peter Wolf
Karl Kurzer
Tobias Wingert
Florian Kuhnt
Johann Marius Zöllner
60
56
0
10 Sep 2018
Learning Adaptive Display Exposure for Real-Time Advertising
Learning Adaptive Display Exposure for Real-Time Advertising
Weixun Wang
Junqi Jin
Jianye Hao
Chunjie Chen
Chuan Yu
...
Xiaotian Hao
Yixi Wang
Han Li
Jian Xu
Kun Gai
43
6
0
10 Sep 2018
Variance Reduction in Monte Carlo Counterfactual Regret Minimization
  (VR-MCCFR) for Extensive Form Games using Baselines
Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Martin Schmid
Neil Burch
Marc Lanctot
Matej Moravcík
Rudolf Kadlec
Michael Bowling
158
64
0
09 Sep 2018
Learning Invariances for Policy Generalization
Learning Invariances for Policy Generalization
Rémi Tachet des Combes
Philip Bachman
H. V. Seijen
83
12
0
07 Sep 2018
Improving On-policy Learning with Statistical Reward Accumulation
Improving On-policy Learning with Statistical Reward Accumulation
Yubin Deng
K. Yu
Dahua Lin
Xiaoou Tang
Chen Change Loy
OffRL
31
0
0
07 Sep 2018
Challenges of Context and Time in Reinforcement Learning: Introducing
  Space Fortress as a Benchmark
Challenges of Context and Time in Reinforcement Learning: Introducing Space Fortress as a Benchmark
Akshat Agarwal
Ryan Hope
Katia Sycara
OffRL
34
9
0
06 Sep 2018
ANS: Adaptive Network Scaling for Deep Rectifier Reinforcement Learning
  Models
ANS: Adaptive Network Scaling for Deep Rectifier Reinforcement Learning Models
Yueh-hua Wu
Fan-Yun Sun
Yen-Yu Chang
Shou-De Lin
57
5
0
06 Sep 2018
Emergence of Human-comparable Balancing Behaviors by Deep Reinforcement
  Learning
Emergence of Human-comparable Balancing Behaviors by Deep Reinforcement Learning
Chuanyu Yang
Taku Komura
Zhibin Li
55
20
0
06 Sep 2018
How to Combine Tree-Search Methods in Reinforcement Learning
How to Combine Tree-Search Methods in Reinforcement Learning
Yonathan Efroni
Gal Dalal
B. Scherrer
Shie Mannor
61
32
0
06 Sep 2018
Transferring Deep Reinforcement Learning with Adversarial Objective and
  Augmentation
Transferring Deep Reinforcement Learning with Adversarial Objective and Augmentation
Shu-Hsuan Hsu
I-Chao Shen
Bing-Yu Chen
21
2
0
04 Sep 2018
A Minimum Discounted Reward Hamilton-Jacobi Formulation for Computing
  Reachable Sets
A Minimum Discounted Reward Hamilton-Jacobi Formulation for Computing Reachable Sets
Anayo K. Akametalu
Shromona Ghosh
J. F. Fisac
Claire Tomlin
35
13
0
03 Sep 2018
Emergence of Communication in an Interactive World with Consistent
  Speakers
Emergence of Communication in an Interactive World with Consistent Speakers
Ben Bogin
Mor Geva
Jonathan Berant
79
43
0
03 Sep 2018
Flatland: a Lightweight First-Person 2-D Environment for Reinforcement
  Learning
Flatland: a Lightweight First-Person 2-D Environment for Reinforcement Learning
Hugo Caselles-Dupré
Louis Annabi
Oksana Hagen
Michael Garcia Ortiz
David Filliat
82
13
0
03 Sep 2018
Visual Transfer between Atari Games using Competitive Reinforcement
  Learning
Visual Transfer between Atari Games using Competitive Reinforcement Learning
Akshita Mittel
Sowmya P. Munukutla
Himanshi Yadav
44
11
0
02 Sep 2018
NavigationNet: A Large-scale Interactive Indoor Navigation Dataset
NavigationNet: A Large-scale Interactive Indoor Navigation Dataset
He Huang
Yujing Shen
Jiankai Sun
Cewu Lu
3DV
53
2
0
25 Aug 2018
LIFT: Reinforcement Learning in Computer Systems by Learning From
  Demonstrations
LIFT: Reinforcement Learning in Computer Systems by Learning From Demonstrations
Michael Schaarschmidt
A. Kuhnle
Ben Ellis
Kai Fricke
Felix Gessert
Eiko Yoneki
OffRL
55
41
0
23 Aug 2018
Reinforcement Learning for Autonomous Defence in Software-Defined
  Networking
Reinforcement Learning for Autonomous Defence in Software-Defined Networking
Yi Han
Benjamin I. P. Rubinstein
Tamas Abraham
T. Alpcan
O. Vel
S. Erfani
David Hubczenko
C. Leckie
Paul Montague
AAML
55
69
0
17 Aug 2018
Experiential Robot Learning with Accelerated Neuroevolution
Experiential Robot Learning with Accelerated Neuroevolution
Ahmed Aly
J. Dugan
34
1
0
16 Aug 2018
Automatic Derivation Of Formulas Using Reforcement Learning
Automatic Derivation Of Formulas Using Reforcement Learning
Minzhong Luo
Li Liu
21
5
0
15 Aug 2018
Visual Sensor Network Reconfiguration with Deep Reinforcement Learning
Visual Sensor Network Reconfiguration with Deep Reinforcement Learning
Paul Jasek
Bernard Abayowa
22
2
0
13 Aug 2018
Learning to Represent Bilingual Dictionaries
Learning to Represent Bilingual Dictionaries
Muhao Chen
Yingtao Tian
Haochen Chen
Kai-Wei Chang
Steven Skiena
C. Zaniolo
44
13
0
10 Aug 2018
End-to-end Active Object Tracking and Its Real-world Deployment via
  Reinforcement Learning
End-to-end Active Object Tracking and Its Real-world Deployment via Reinforcement Learning
Wenhan Luo
Peng Sun
Fangwei Zhong
Wen Liu
Tong Zhang
Yizhou Wang
75
127
0
10 Aug 2018
Policy Optimization as Wasserstein Gradient Flows
Policy Optimization as Wasserstein Gradient Flows
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Lawrence Carin
88
68
0
09 Aug 2018
Learning to Share and Hide Intentions using Information Regularization
Learning to Share and Hide Intentions using Information Regularization
D. Strouse
Max Kleiman-Weiner
J. Tenenbaum
M. Botvinick
D. Schwab
76
60
0
06 Aug 2018
An Efficient Deep Reinforcement Learning Model for Urban Traffic Control
An Efficient Deep Reinforcement Learning Model for Urban Traffic Control
Yilun Lin
Xingyuan Dai
Li Li
Feiyue Wang
39
59
0
06 Aug 2018
Neural Arithmetic Logic Units
Neural Arithmetic Logic Units
Andrew Trask
Felix Hill
Scott E. Reed
Jack W. Rae
Chris Dyer
Phil Blunsom
NAI
96
206
0
01 Aug 2018
Egocentric Spatial Memory
Egocentric Spatial Memory
Mengmi Zhang
K. Ma
S. Yen
J. Lim
Qi Zhao
Jiashi Feng
EgoV
57
4
0
31 Jul 2018
Learning to Interrupt: A Hierarchical Deep Reinforcement Learning
  Framework for Efficient Exploration
Learning to Interrupt: A Hierarchical Deep Reinforcement Learning Framework for Efficient Exploration
Tingguang Li
Jin Pan
Delong Zhu
Max Meng
41
14
0
30 Jul 2018
Visual Analogies between Atari Games for Studying Transfer Learning in
  RL
Visual Analogies between Atari Games for Studying Transfer Learning in RL
D. Sobol
Lior Wolf
Yaniv Taigman
OffRL
44
7
0
29 Jul 2018
Variational Option Discovery Algorithms
Variational Option Discovery Algorithms
Joshua Achiam
Harrison Edwards
Dario Amodei
Pieter Abbeel
DRL
78
180
0
26 Jul 2018
ToriLLE: Learning Environment for Hand-to-Hand Combat
ToriLLE: Learning Environment for Hand-to-Hand Combat
Anssi Kanervisto
Ville Hautamaki
62
2
0
26 Jul 2018
Attend Before you Act: Leveraging human visual attention for continual
  learning
Attend Before you Act: Leveraging human visual attention for continual learning
Khimya Khetarpal
Doina Precup
39
7
0
25 Jul 2018
Variational Bayesian Reinforcement Learning with Regret Bounds
Variational Bayesian Reinforcement Learning with Regret Bounds
Brendan O'Donoghue
115
41
0
25 Jul 2018
Backprop-Q: Generalized Backpropagation for Stochastic Computation
  Graphs
Backprop-Q: Generalized Backpropagation for Stochastic Computation Graphs
Xiaoran Xu
Songpeng Zu
Yuan Zhang
Hanning Zhou
Wei Feng
BDL
59
4
0
25 Jul 2018
Multi-Agent Reinforcement Learning: A Report on Challenges and
  Approaches
Multi-Agent Reinforcement Learning: A Report on Challenges and Approaches
Sanyam Kapoor
45
31
0
25 Jul 2018
Learning to Play Pong using Policy Gradient Learning
Learning to Play Pong using Policy Gradient Learning
S. Phon-Amnuaisuk
OffRL
14
0
0
23 Jul 2018
Implementation of Q Learning and Deep Q Network For Controlling a Self
  Balancing Robot Model
Implementation of Q Learning and Deep Q Network For Controlling a Self Balancing Robot Model
Md. Muhaimin Rahman
SM Hasanur Rashid
M. M. Hossain
SSL
6
33
0
22 Jul 2018
Asynchronous Advantage Actor-Critic Agent for Starcraft II
Asynchronous Advantage Actor-Critic Agent for Starcraft II
Basel Alghanem
G. KeerthanaP.
OffRL
31
5
0
22 Jul 2018
Recent Advances in Deep Learning: An Overview
Recent Advances in Deep Learning: An Overview
Matiur Rahman Minar
Jibon Naher
VLM
106
117
0
21 Jul 2018
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
Arushi Jain
Khimya Khetarpal
Doina Precup
70
27
0
21 Jul 2018
FuzzerGym: A Competitive Framework for Fuzzing and Learning
FuzzerGym: A Competitive Framework for Fuzzing and Learning
W. Drozd
Michael D. Wagner
66
33
0
19 Jul 2018
Gradient Band-based Adversarial Training for Generalized Attack Immunity
  of A3C Path Finding
Gradient Band-based Adversarial Training for Generalized Attack Immunity of A3C Path Finding
Tong Chen
Wenjia Niu
Yingxiao Xiang
XiaoXuan Bai
Jiqiang Liu
Zhen Han
Gang Li
AAML
62
24
0
18 Jul 2018
Learning to Listen, Read, and Follow: Score Following as a Reinforcement
  Learning Game
Learning to Listen, Read, and Follow: Score Following as a Reinforcement Learning Game
Matthias Dorfer
Florian Henkel
Gerhard Widmer
66
34
0
17 Jul 2018
Online Robust Policy Learning in the Presence of Unknown Adversaries
Online Robust Policy Learning in the Presence of Unknown Adversaries
Aaron J. Havens
Zhanhong Jiang
Soumik Sarkar
AAML
120
44
0
16 Jul 2018
Remember and Forget for Experience Replay
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
108
92
0
16 Jul 2018
Visual Reinforcement Learning with Imagined Goals
Visual Reinforcement Learning with Imagined Goals
Ashvin Nair
Vitchyr H. Pong
Murtaza Dalal
Shikhar Bahl
Steven Lin
Sergey Levine
SSL
112
544
0
12 Jul 2018
Training Neural Networks Using Features Replay
Training Neural Networks Using Features Replay
Zhouyuan Huo
Bin Gu
Heng-Chiao Huang
94
70
0
12 Jul 2018
Learning Deployable Navigation Policies at Kilometer Scale from a Single
  Traversal
Learning Deployable Navigation Policies at Kilometer Scale from a Single Traversal
Jake Bruce
Niko Sünderhauf
Piotr Wojciech Mirowski
R. Hadsell
Michael Milford
95
35
0
11 Jul 2018
Previous
123...616263...707172
Next