ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 1,672 papers shown
Title
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
Elise van der Pol
Daniel E. Worrall
H. V. Hoof
F. Oliehoek
Max Welling
BDL
AI4CE
36
156
0
30 Jun 2020
Online 3D Bin Packing with Constrained Deep Reinforcement Learning
Online 3D Bin Packing with Constrained Deep Reinforcement Learning
Hang Zhao
Qijin She
Chenyang Zhu
Yifan Yang
Kai Xu
OffRL
27
120
0
26 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
33
55
0
23 Jun 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement
  Learning
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
30
27
0
23 Jun 2020
dm_control: Software and Tasks for Continuous Control
dm_control: Software and Tasks for Continuous Control
Yuval Tassa
S. Tunyasuvunakool
Alistair Muldal
Yotam Doron
Piotr Trochim
...
Steven Bohez
J. Merel
Tom Erez
Timothy Lillicrap
N. Heess
LM&Ro
42
400
0
22 Jun 2020
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Andres Campero
Roberta Raileanu
Heinrich Küttler
J. Tenenbaum
Tim Rocktaschel
Edward Grefenstette
47
125
0
22 Jun 2020
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement
  Learning
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiaolin Hu
Feng Chen
30
82
0
20 Jun 2020
Reparameterized Variational Divergence Minimization for Stable Imitation
Reparameterized Variational Divergence Minimization for Stable Imitation
Dilip Arumugam
Debadeepta Dey
Alekh Agarwal
Asli Celikyilmaz
E. Nouri
W. Dolan
38
3
0
18 Jun 2020
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
Qiang He
Xinwen Hou
OffRL
10
28
0
18 Jun 2020
Forgetful Experience Replay in Hierarchical Reinforcement Learning from
  Demonstrations
Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations
Alexey Skrynnik
A. Staroverov
Ermek Aitygulov
Kirill Aksenov
Vasilii Davydov
Aleksandr I. Panov
OffRL
25
4
0
17 Jun 2020
Neural Ordinary Differential Equation Control of Dynamics on Graphs
Neural Ordinary Differential Equation Control of Dynamics on Graphs
Thomas Asikis
Lucas Böttcher
Nino Antulov-Fantulin
35
43
0
17 Jun 2020
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using
  Deep Reinforcement Learning
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning
Eivind Meyer
Amalie Heiberg
Adil Rasheed
Omer San
43
74
0
16 Jun 2020
Solving the Order Batching and Sequencing Problem using Deep
  Reinforcement Learning
Solving the Order Batching and Sequencing Problem using Deep Reinforcement Learning
Bram Cals
Yingqian Zhang
R. Dijkman
Claudy van Dorst
OffRL
25
29
0
16 Jun 2020
Agent Modelling under Partial Observability for Deep Reinforcement
  Learning
Agent Modelling under Partial Observability for Deep Reinforcement Learning
Georgios Papoudakis
Filippos Christianos
Stefano V. Albrecht
31
61
0
16 Jun 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
46
592
0
16 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in
  Cooperative Tasks
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
26
221
0
14 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative
  Exploration
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
18
18
0
14 Jun 2020
Bootstrap your own latent: A new approach to self-supervised Learning
Bootstrap your own latent: A new approach to self-supervised Learning
Jean-Bastien Grill
Florian Strub
Florent Altché
Corentin Tallec
Pierre Harvey Richemond
...
M. G. Azar
Bilal Piot
Koray Kavukcuoglu
Rémi Munos
Michal Valko
SSL
187
6,682
0
13 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
33
24
0
12 Jun 2020
Systematic Generalisation through Task Temporal Logic and Deep
  Reinforcement Learning
Systematic Generalisation through Task Temporal Logic and Deep Reinforcement Learning
Borja G. Leon
Murray Shanahan
Francesco Belardinelli
NAI
AI4CE
36
29
0
12 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale
  Empirical Study
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
Matthieu Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
31
215
0
10 Jun 2020
Deep Learning for Change Detection in Remote Sensing Images:
  Comprehensive Review and Meta-Analysis
Deep Learning for Change Detection in Remote Sensing Images: Comprehensive Review and Meta-Analysis
Lazhar Khelifi
M. Mignotte
34
259
0
10 Jun 2020
Stealing Deep Reinforcement Learning Models for Fun and Profit
Stealing Deep Reinforcement Learning Models for Fun and Profit
Kangjie Chen
Shangwei Guo
Tianwei Zhang
Xiaofei Xie
Yang Liu
MLAU
MIACV
OffRL
24
45
0
09 Jun 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
26
42
0
08 Jun 2020
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
Mehdi Jafarnia-Jahromi
Chen-Yu Wei
Rahul Jain
Haipeng Luo
28
7
0
08 Jun 2020
Deep Reinforcement Learning for Human-Like Driving Policies in Collision
  Avoidance Tasks of Self-Driving Cars
Deep Reinforcement Learning for Human-Like Driving Policies in Collision Avoidance Tasks of Self-Driving Cars
Ran Emuna
A. Borowsky
Armin Biess
42
22
0
07 Jun 2020
Re-understanding Finite-State Representations of Recurrent Policy
  Networks
Re-understanding Finite-State Representations of Recurrent Policy Networks
Mohamad H. Danesh
Anurag Koul
Alan Fern
Saeed Khorram
38
21
0
06 Jun 2020
A Novel Update Mechanism for Q-Networks Based On Extreme Learning
  Machines
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines
Callum Wilson
A. Riccardi
E. Minisci
19
4
0
04 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
67
225
0
01 Jun 2020
The Adversarial Resilience Learning Architecture for AI-based Modelling,
  Exploration, and Operation of Complex Cyber-Physical Systems
The Adversarial Resilience Learning Architecture for AI-based Modelling, Exploration, and Operation of Complex Cyber-Physical Systems
Eric M. S. P. Veith
Nils Wenninghoff
Emilie Frost
28
5
0
27 May 2020
Efficient Use of heuristics for accelerating XCS-based Policy Learning
  in Markov Games
Efficient Use of heuristics for accelerating XCS-based Policy Learning in Markov Games
Hao Chen
Chang Wang
Jian Huang
Jianxing Gong
16
5
0
26 May 2020
Learning to Simulate Dynamic Environments with GameGAN
Learning to Simulate Dynamic Environments with GameGAN
Seung Wook Kim
Yuhao Zhou
Jonah Philion
Antonio Torralba
Sanja Fidler
GAN
28
102
0
25 May 2020
Gradient Monitored Reinforcement Learning
Gradient Monitored Reinforcement Learning
Mohammed Sharafath Abdul Hameed
Gavneet Singh Chadha
Andreas Schwung
S. Ding
33
11
0
25 May 2020
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
Jianfeng Liu
Feiyang Pan
Ling Luo
OffRL
22
23
0
24 May 2020
Learning from Naturalistic Driving Data for Human-like Autonomous
  Highway Driving
Learning from Naturalistic Driving Data for Human-like Autonomous Highway Driving
Donghao Xu
Zhezhang Ding
Xu He
Huijing Zhao
M. Moze
François Aioun
F. Guillemard
14
51
0
23 May 2020
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR
  Control in Active Distribution Networks
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks
Haotian Liu
Wenchuan Wu
OffRL
19
96
0
20 May 2020
Mirror Descent Policy Optimization
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
30
83
0
20 May 2020
Human Instruction-Following with Deep Reinforcement Learning via
  Transfer-Learning from Text
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text
Felix Hill
Soňa Mokrá
Nathaniel Wong
Tim Harley
LM&Ro
31
81
0
19 May 2020
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular
  Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Abdulelah S. Alshehri
R. Gani
Fengqi You
AI4CE
38
83
0
18 May 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
44
87
0
16 May 2020
On the Global Convergence Rates of Softmax Policy Gradient Methods
On the Global Convergence Rates of Softmax Policy Gradient Methods
Jincheng Mei
Chenjun Xiao
Csaba Szepesvári
Dale Schuurmans
47
276
0
13 May 2020
Smooth Exploration for Robotic Reinforcement Learning
Smooth Exploration for Robotic Reinforcement Learning
Antonin Raffin
Jens Kober
F. Stulp
32
57
0
12 May 2020
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Ge Yang
Amy Zhang
Ari S. Morcos
Joelle Pineau
Pieter Abbeel
Roberto Calandra
SSL
OffRL
33
27
0
07 May 2020
A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical
  Systems
A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical Systems
Anthony Corso
Robert J. Moss
Mark Koren
Ritchie Lee
Mykel J. Kochenderfer
36
173
0
06 May 2020
Learning Adaptive Exploration Strategies in Dynamic Environments Through
  Informed Policy Regularization
Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Pierre-Alexandre Kamienny
Matteo Pirotta
A. Lazaric
Thibault Lavril
Nicolas Usunier
Ludovic Denoyer
30
19
0
06 May 2020
Robotic Arm Control and Task Training through Deep Reinforcement
  Learning
Robotic Arm Control and Task Training through Deep Reinforcement Learning
Andrea Franceschetti
E. Tosello
Nicola Castaman
Stefano Ghidoni
18
32
0
06 May 2020
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
95
146
0
04 May 2020
Formal Policy Synthesis for Continuous-Space Systems via Reinforcement
  Learning
Formal Policy Synthesis for Continuous-Space Systems via Reinforcement Learning
Milad Kazemi
Sadegh Soudjani
35
29
0
04 May 2020
Noise Pollution in Hospital Readmission Prediction: Long Document
  Classification with Reinforcement Learning
Noise Pollution in Hospital Readmission Prediction: Long Document Classification with Reinforcement Learning
Liyan Xu
J. Hogan
R. Patzer
Jinho Choi
22
4
0
04 May 2020
Deep Reinforcement Learning for Intelligent Transportation Systems: A
  Survey
Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey
Ammar Haydari
Y. Yilmaz
AI4TS
33
456
0
02 May 2020
Previous
123...192021...323334
Next