ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 1,612 papers shown
Title
TAP-Net: Transport-and-Pack using Reinforcement Learning
TAP-Net: Transport-and-Pack using Reinforcement Learning
Huang Ruizhen
XU Juzhan
Bin Chen
Minglun Gong
Hao Zhang
Hui Huang
26
25
0
03 Sep 2020
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown
  Dynamics
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics
Yanchao Sun
Da Huo
Furong Huang
AAML
OffRL
OnRL
37
49
0
02 Sep 2020
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments
  using A3C learning and Residual Recurrent Neural Networks
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments using A3C learning and Residual Recurrent Neural Networks
Shreshth Tuli
Shashikant Ilager
K. Ramamohanarao
Rajkumar Buyya
27
176
0
01 Sep 2020
PlotThread: Creating Expressive Storyline Visualizations using
  Reinforcement Learning
PlotThread: Creating Expressive Storyline Visualizations using Reinforcement Learning
Tan Tang
Renzhong Li
Xinke Wu
Shuhan Liu
Johannes Knittel
Steffen Koch
Thomas Ertl
Lingyun Yu
Peiran Ren
Yingcai Wu
41
52
0
01 Sep 2020
Real-world Video Adaptation with Reinforcement Learning
Real-world Video Adaptation with Reinforcement Learning
Hongzi Mao
Shannon Chen
Drew Dimmery
Shaun Singh
Drew Blaisdell
Yuandong Tian
Mohammad Alizadeh
E. Bakshy
OffRL
8
76
0
28 Aug 2020
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity
  Edge Devices
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity Edge Devices
Parth Mannan
A. Samajdar
T. Krishna
31
2
0
27 Aug 2020
Constrained Markov Decision Processes via Backward Value Functions
Constrained Markov Decision Processes via Backward Value Functions
Harsh Satija
Philip Amortila
Joelle Pineau
46
51
0
26 Aug 2020
Exploiting Scene-specific Features for Object Goal Navigation
Exploiting Scene-specific Features for Object Goal Navigation
Tommaso Campari
Paolo Eccher
Luciano Serafini
Lamberto Ballan
28
28
0
21 Aug 2020
Ubiquitous Distributed Deep Reinforcement Learning at the Edge:
  Analyzing Byzantine Agents in Discrete Action Spaces
Ubiquitous Distributed Deep Reinforcement Learning at the Edge: Analyzing Byzantine Agents in Discrete Action Spaces
Wenshuai Zhao
Jorge Peña Queralta
Qingqing Li
Tomi Westerlund
14
6
0
18 Aug 2020
Learning Fair Policies in Multiobjective (Deep) Reinforcement Learning
  with Average and Discounted Rewards
Learning Fair Policies in Multiobjective (Deep) Reinforcement Learning with Average and Discounted Rewards
Umer Siddique
Paul Weng
Matthieu Zimmer
FaML
OffRL
22
84
0
18 Aug 2020
Reducing Sampling Error in Batch Temporal Difference Learning
Reducing Sampling Error in Batch Temporal Difference Learning
Brahma S. Pavse
Ishan Durugkar
Josiah P. Hanna
Peter Stone
OffRL
25
12
0
15 Aug 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect
  Information
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
42
19
0
14 Aug 2020
TriFinger: An Open-Source Robot for Learning Dexterity
TriFinger: An Open-Source Robot for Learning Dexterity
Manuel Wüthrich
Felix Widmaier
F. Grimminger
J. Akpo
S. Joshi
...
Julian Viereck
M. Naveau
Ludovic Righetti
Bernhard Schölkopf
Stefan Bauer
29
72
0
08 Aug 2020
Robust Deep Reinforcement Learning through Adversarial Loss
Robust Deep Reinforcement Learning through Adversarial Loss
Tuomas P. Oikarinen
Wang Zhang
Alexandre Megretski
Luca Daniel
Tsui-Wei Weng
AAML
49
94
0
05 Aug 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
26
42
0
02 Aug 2020
MAPPER: Multi-Agent Path Planning with Evolutionary Reinforcement
  Learning in Mixed Dynamic Environments
MAPPER: Multi-Agent Path Planning with Evolutionary Reinforcement Learning in Mixed Dynamic Environments
Zuxin Liu
Baiming Chen
Hongyi Zhou
G. Koushik
M. Hebert
Ding Zhao
AI4CE
58
86
0
30 Jul 2020
Understanding the Stability of Deep Control Policies for Biped
  Locomotion
Understanding the Stability of Deep Control Policies for Biped Locomotion
Hwangpil Park
R. Yu
Yoonsang Lee
Kyungho Lee
Jehee Lee
22
9
0
30 Jul 2020
Modular Transfer Learning with Transition Mismatch Compensation for
  Excessive Disturbance Rejection
Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection
Tianming Wang
Wenjie Lu
H. Yu
Dikai Liu
46
1
0
29 Jul 2020
Adaptive Bitrate Video Streaming for Wireless nodes: A Survey
Adaptive Bitrate Video Streaming for Wireless nodes: A Survey
Kamran Nishat
O. Gnawali
A. Abdelhadi
14
1
0
27 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
25
4
0
24 Jul 2020
Value-Decomposition Multi-Agent Actor-Critics
Value-Decomposition Multi-Agent Actor-Critics
Jianyu Su
Stephen C. Adams
Peter A. Beling
68
101
0
24 Jul 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
27
77
0
16 Jul 2020
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Liang Liu
Hao Lu
Hongwei Zou
Haipeng Xiong
Zhiguo Cao
Chunhua Shen
OffRL
27
70
0
16 Jul 2020
Information Freshness-Aware Task Offloading in Air-Ground Integrated
  Edge Computing Systems
Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems
Xianfu Chen
Celimuge Wu
Tao Chen
Zhi Liu
Honggang Zhang
M. Bennis
Hang Liu
Yusheng Ji
24
71
0
15 Jul 2020
Relational-Grid-World: A Novel Relational Reasoning Environment and An
  Agent Model for Relational Information Extraction
Relational-Grid-World: A Novel Relational Reasoning Environment and An Agent Model for Relational Information Extraction
Faruk Küçüksubasi
Elif Surer
24
2
0
12 Jul 2020
An Asymptotically Optimal Multi-Armed Bandit Algorithm and
  Hyperparameter Optimization
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization
Yimin Huang
Yujun Li
Hanrong Ye
Zhenguo Li
Zhihua Zhang
34
7
0
11 Jul 2020
Self-Supervised Policy Adaptation during Deployment
Self-Supervised Policy Adaptation during Deployment
Nicklas Hansen
Rishabh Jangir
Yu Sun
Guillem Alenyà
Pieter Abbeel
Alexei A. Efros
Lerrel Pinto
Xiaolong Wang
41
159
0
08 Jul 2020
Tracking-by-Trackers with a Distilled and Reinforced Model
Tracking-by-Trackers with a Distilled and Reinforced Model
Matteo Dunnhofer
N. Martinel
C. Micheloni
VOT
OffRL
27
4
0
08 Jul 2020
Discount Factor as a Regularizer in Reinforcement Learning
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
30
72
0
04 Jul 2020
Adaptive Discretization for Model-Based Reinforcement Learning
Adaptive Discretization for Model-Based Reinforcement Learning
Sean R. Sinclair
Tianyu Wang
Gauri Jain
Siddhartha Banerjee
Chao Yu
OffRL
19
21
0
01 Jul 2020
Robustifying the Deployment of tinyML Models for Autonomous
  mini-vehicles
Robustifying the Deployment of tinyML Models for Autonomous mini-vehicles
Miguel de Prado
Manuele Rusci
Romain Donze
Alessandro Capotondi
Serge Monnerat
Luca Benini and
Nuria Pazos
30
39
0
01 Jul 2020
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
Elise van der Pol
Daniel E. Worrall
H. V. Hoof
F. Oliehoek
Max Welling
BDL
AI4CE
31
156
0
30 Jun 2020
Online 3D Bin Packing with Constrained Deep Reinforcement Learning
Online 3D Bin Packing with Constrained Deep Reinforcement Learning
Hang Zhao
Qijin She
Chenyang Zhu
Yifan Yang
Kai Xu
OffRL
27
120
0
26 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
33
55
0
23 Jun 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement
  Learning
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
30
27
0
23 Jun 2020
dm_control: Software and Tasks for Continuous Control
dm_control: Software and Tasks for Continuous Control
Yuval Tassa
S. Tunyasuvunakool
Alistair Muldal
Yotam Doron
Piotr Trochim
...
Steven Bohez
J. Merel
Tom Erez
Timothy Lillicrap
N. Heess
LM&Ro
42
399
0
22 Jun 2020
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Andres Campero
Roberta Raileanu
Heinrich Küttler
J. Tenenbaum
Tim Rocktaschel
Edward Grefenstette
40
125
0
22 Jun 2020
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement
  Learning
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiaolin Hu
Feng Chen
30
82
0
20 Jun 2020
Reparameterized Variational Divergence Minimization for Stable Imitation
Reparameterized Variational Divergence Minimization for Stable Imitation
Dilip Arumugam
Debadeepta Dey
Alekh Agarwal
Asli Celikyilmaz
E. Nouri
W. Dolan
33
3
0
18 Jun 2020
Forgetful Experience Replay in Hierarchical Reinforcement Learning from
  Demonstrations
Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations
Alexey Skrynnik
A. Staroverov
Ermek Aitygulov
Kirill Aksenov
Vasilii Davydov
Aleksandr I. Panov
OffRL
23
4
0
17 Jun 2020
Neural Ordinary Differential Equation Control of Dynamics on Graphs
Neural Ordinary Differential Equation Control of Dynamics on Graphs
Thomas Asikis
Lucas Böttcher
Nino Antulov-Fantulin
33
43
0
17 Jun 2020
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using
  Deep Reinforcement Learning
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning
Eivind Meyer
Amalie Heiberg
Adil Rasheed
Omer San
43
74
0
16 Jun 2020
Solving the Order Batching and Sequencing Problem using Deep
  Reinforcement Learning
Solving the Order Batching and Sequencing Problem using Deep Reinforcement Learning
Bram Cals
Yingqian Zhang
R. Dijkman
Claudy van Dorst
OffRL
25
29
0
16 Jun 2020
Agent Modelling under Partial Observability for Deep Reinforcement
  Learning
Agent Modelling under Partial Observability for Deep Reinforcement Learning
Georgios Papoudakis
Filippos Christianos
Stefano V. Albrecht
29
61
0
16 Jun 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
46
592
0
16 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in
  Cooperative Tasks
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
26
220
0
14 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative
  Exploration
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
18
18
0
14 Jun 2020
Bootstrap your own latent: A new approach to self-supervised Learning
Bootstrap your own latent: A new approach to self-supervised Learning
Jean-Bastien Grill
Florian Strub
Florent Altché
Corentin Tallec
Pierre Harvey Richemond
...
M. G. Azar
Bilal Piot
Koray Kavukcuoglu
Rémi Munos
Michal Valko
SSL
181
6,679
0
13 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
33
24
0
12 Jun 2020
Systematic Generalisation through Task Temporal Logic and Deep
  Reinforcement Learning
Systematic Generalisation through Task Temporal Logic and Deep Reinforcement Learning
Borja G. Leon
Murray Shanahan
Francesco Belardinelli
NAI
AI4CE
33
28
0
12 Jun 2020
Previous
123...181920...313233
Next