ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Re-understanding Finite-State Representations of Recurrent Policy
  Networks
Re-understanding Finite-State Representations of Recurrent Policy Networks
Mohamad H. Danesh
Anurag Koul
Alan Fern
Saeed Khorram
69
22
0
06 Jun 2020
A Novel Update Mechanism for Q-Networks Based On Extreme Learning
  Machines
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines
Callum Wilson
A. Riccardi
E. Minisci
21
4
0
04 Jun 2020
Jointly Learning Environments and Control Policies with Projected
  Stochastic Gradient Ascent
Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent
Adrien Bolland
Ioannis Boukas
M. Berger
D. Ernst
20
3
0
02 Jun 2020
Encoding formulas as deep networks: Reinforcement learning for zero-shot
  execution of LTL formulas
Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas
Yen-Ling Kuo
Boris Katz
Andrei Barbu
78
41
0
01 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
143
226
0
01 Jun 2020
Hyperparameter optimization with REINFORCE and Transformers
Hyperparameter optimization with REINFORCE and Transformers
C. Krishna
Ashish Gupta
Swarnim Narayan
Himanshu Rai
Diksha Manchanda
58
2
0
01 Jun 2020
MM-KTD: Multiple Model Kalman Temporal Differences for Reinforcement
  Learning
MM-KTD: Multiple Model Kalman Temporal Differences for Reinforcement Learning
Parvin Malekzadeh
Mohammad Salimibeni
Arash Mohammadi
A. Assa
Konstantinos N. Plataniotis
OffRL
45
12
0
30 May 2020
Deep Reinforcement learning for real autonomous mobile robot navigation
  in indoor environments
Deep Reinforcement learning for real autonomous mobile robot navigation in indoor environments
H. Surmann
Christian Jestel
Robin Marchel
Franziska Musberg
Houssem Elhadj
Mahbube Ardani
49
85
0
28 May 2020
Domain Knowledge Integration By Gradient Matching For Sample-Efficient
  Reinforcement Learning
Domain Knowledge Integration By Gradient Matching For Sample-Efficient Reinforcement Learning
Parth Chadha
17
0
0
28 May 2020
The Adversarial Resilience Learning Architecture for AI-based Modelling,
  Exploration, and Operation of Complex Cyber-Physical Systems
The Adversarial Resilience Learning Architecture for AI-based Modelling, Exploration, and Operation of Complex Cyber-Physical Systems
Eric M. S. P. Veith
Nils Wenninghoff
Emilie Frost
47
5
0
27 May 2020
MOPO: Model-based Offline Policy Optimization
MOPO: Model-based Offline Policy Optimization
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
119
776
0
27 May 2020
Towards intervention-centric causal reasoning in learning agents
Towards intervention-centric causal reasoning in learning agents
B. Lansdell
LRMCML
21
0
0
26 May 2020
Efficient Use of heuristics for accelerating XCS-based Policy Learning
  in Markov Games
Efficient Use of heuristics for accelerating XCS-based Policy Learning in Markov Games
Hao Chen
Chang Wang
Jian Huang
Jianxing Gong
114
5
0
26 May 2020
Generator and Critic: A Deep Reinforcement Learning Approach for Slate
  Re-ranking in E-commerce
Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce
Jianxiong Wei
Anxiang Zeng
Yueqiu Wu
P. Guo
Q. Hua
Qingpeng Cai
OffRL
74
9
0
25 May 2020
Learning to Simulate Dynamic Environments with GameGAN
Learning to Simulate Dynamic Environments with GameGAN
Seung Wook Kim
Yuhao Zhou
Jonah Philion
Antonio Torralba
Sanja Fidler
GAN
104
106
0
25 May 2020
Gradient Monitored Reinforcement Learning
Gradient Monitored Reinforcement Learning
Mohammed Sharafath Abdul Hameed
Gavneet Singh Chadha
Andreas Schwung
S. Ding
99
11
0
25 May 2020
Policy Entropy for Out-of-Distribution Classification
Policy Entropy for Out-of-Distribution Classification
Andreas Sedlmeier
Robert Muller
Steffen Illium
Claudia Linnhoff-Popien
OODDOffRL
59
14
0
25 May 2020
Learning visual servo policies via planner cloning
Learning visual servo policies via planner cloning
Ulrich Viereck
Kate Saenko
Robert Platt
OffRL
35
2
0
24 May 2020
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
Jianfeng Liu
Feiyang Pan
Ling Luo
OffRL
67
23
0
24 May 2020
Learning from Naturalistic Driving Data for Human-like Autonomous
  Highway Driving
Learning from Naturalistic Driving Data for Human-like Autonomous Highway Driving
Donghao Xu
Zhezhang Ding
Xu He
Huijing Zhao
M. Moze
François Aioun
F. Guillemard
39
55
0
23 May 2020
Evaluating Generalisation in General Video Game Playing
Evaluating Generalisation in General Video Game Playing
Martin Balla
Simon Lucas
Diego Perez-Liebana
44
2
0
22 May 2020
Learning Combinatorial Optimization on Graphs: A Survey with
  Applications to Networking
Learning Combinatorial Optimization on Graphs: A Survey with Applications to Networking
N. Vesselinova
Rebecca Steinert
Daniel F. Perez-Ramirez
Magnus Boman
GNNAI4CE
90
146
0
22 May 2020
Consensus Driven Learning
Consensus Driven Learning
K. Crandall
Dustin J. Webb
FedML
19
0
0
20 May 2020
A Metric Learning Approach to Anomaly Detection in Video Games
A Metric Learning Approach to Anomaly Detection in Video Games
Benedict Wilkins
C. Watkins
Kostas Stathis
37
1
0
20 May 2020
Finite-sample Analysis of Greedy-GQ with Linear Function Approximation
  under Markovian Noise
Finite-sample Analysis of Greedy-GQ with Linear Function Approximation under Markovian Noise
Yue Wang
Shaofeng Zou
53
21
0
20 May 2020
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR
  Control in Active Distribution Networks
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks
Haotian Liu
Wenchuan Wu
OffRL
36
98
0
20 May 2020
Mirror Descent Policy Optimization
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
169
87
0
20 May 2020
Human Instruction-Following with Deep Reinforcement Learning via
  Transfer-Learning from Text
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text
Felix Hill
Soňa Mokrá
Nathaniel Wong
Tim Harley
LM&Ro
101
82
0
19 May 2020
Privileged Information Dropout in Reinforcement Learning
Privileged Information Dropout in Reinforcement Learning
Pierre-Alexandre Kamienny
Kai Arulkumaran
Feryal M. P. Behbahani
Wendelin Boehmer
Shimon Whiteson
61
10
0
19 May 2020
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular
  Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Abdulelah S. Alshehri
R. Gani
Fengqi You
AI4CE
98
86
0
18 May 2020
Multi-Objective level generator generation with Marahel
Multi-Objective level generator generation with Marahel
Ahmed Khalifa
Julian Togelius
55
9
0
17 May 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
94
88
0
16 May 2020
Stealthy and Efficient Adversarial Attacks against Deep Reinforcement
  Learning
Stealthy and Efficient Adversarial Attacks against Deep Reinforcement Learning
Jianwen Sun
Tianwei Zhang
Xiaofei Xie
Lei Ma
Yan Zheng
Kangjie Chen
Yang Liu
AAML
72
118
0
14 May 2020
Reinforced Coloring for End-to-End Instance Segmentation
Reinforced Coloring for End-to-End Instance Segmentation
Tuan Tran Anh
Khoa Nguyen-Tuan
Tran Minh Quan
Won-Ki Jeong
SSegISeg
37
2
0
14 May 2020
On the Global Convergence Rates of Softmax Policy Gradient Methods
On the Global Convergence Rates of Softmax Policy Gradient Methods
Jincheng Mei
Chenjun Xiao
Csaba Szepesvári
Dale Schuurmans
176
294
0
13 May 2020
From Simulation to Real World Maneuver Execution using Deep
  Reinforcement Learning
From Simulation to Real World Maneuver Execution using Deep Reinforcement Learning
Alessandro Paolo Capasso
Giulio Bacchiani
A. Broggi
56
4
0
13 May 2020
Proxy Experience Replay: Federated Distillation for Distributed
  Reinforcement Learning
Proxy Experience Replay: Federated Distillation for Distributed Reinforcement Learning
Han Cha
Jihong Park
Hyesung Kim
M. Bennis
Seong-Lyun Kim
71
26
0
13 May 2020
Training spiking neural networks using reinforcement learning
Training spiking neural networks using reinforcement learning
Sneha Aenugu
OffRL
16
2
0
12 May 2020
Smooth Exploration for Robotic Reinforcement Learning
Smooth Exploration for Robotic Reinforcement Learning
Antonin Raffin
Jens Kober
F. Stulp
88
58
0
12 May 2020
Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and
  Competitive Environments
Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments
Baiming Chen
Mengdi Xu
Zuxin Liu
Liang-Sheng Li
Ding Zhao
70
37
0
11 May 2020
Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement
  Learning Problems
Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement Learning Problems
Alexis Asseman
Nicolas Antoine
A. Ozcan
34
4
0
10 May 2020
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Ge Yang
Amy Zhang
Ari S. Morcos
Joelle Pineau
Pieter Abbeel
Roberto Calandra
SSLOffRL
89
27
0
07 May 2020
Guided Policy Search Model-based Reinforcement Learning for Urban
  Autonomous Driving
Guided Policy Search Model-based Reinforcement Learning for Urban Autonomous Driving
Zhuo Xu
Jianyu Chen
Masayoshi Tomizuka
30
12
0
06 May 2020
A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical
  Systems
A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical Systems
Anthony Corso
Robert J. Moss
Mark Koren
Ritchie Lee
Mykel J. Kochenderfer
97
176
0
06 May 2020
Learning Adaptive Exploration Strategies in Dynamic Environments Through
  Informed Policy Regularization
Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Pierre-Alexandre Kamienny
Matteo Pirotta
A. Lazaric
Thibault Lavril
Nicolas Usunier
Ludovic Denoyer
87
18
0
06 May 2020
Robotic Arm Control and Task Training through Deep Reinforcement
  Learning
Robotic Arm Control and Task Training through Deep Reinforcement Learning
Andrea Franceschetti
E. Tosello
Nicola Castaman
Stefano Ghidoni
56
32
0
06 May 2020
Generalized Planning With Deep Reinforcement Learning
Generalized Planning With Deep Reinforcement Learning
Or Rivlin
Tamir Hazan
E. Karpas
OffRL
55
48
0
05 May 2020
Discrete-to-Deep Supervised Policy Learning
Discrete-to-Deep Supervised Policy Learning
B. Kurniawan
Peter Vamplew
Michael Papasimeon
Richard Dazeley
Cameron Foale
OffRL
16
3
0
05 May 2020
Demand-Side Scheduling Based on Multi-Agent Deep Actor-Critic Learning
  for Smart Grids
Demand-Side Scheduling Based on Multi-Agent Deep Actor-Critic Learning for Smart Grids
Joash Lee
Wenbo Wang
Dusit Niyato
22
9
0
05 May 2020
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
193
149
0
04 May 2020
Previous
123...434445...707172
Next