ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Area-wide traffic signal control based on a deep graph Q-Network (DGQN)
  trained in an asynchronous manner
Area-wide traffic signal control based on a deep graph Q-Network (DGQN) trained in an asynchronous manner
Gyeongjun Kim
Keemin Sohn
GNN
14
9
0
05 Aug 2020
Learning Transition Models with Time-delayed Causal Relations
Learning Transition Models with Time-delayed Causal Relations
Junchi Liang
Abdeslam Boularias
OffRL
47
3
0
04 Aug 2020
On The Plurality of Graphs
On The Plurality of Graphs
N. Fitzgerald
Jacopo Tagliabue
28
1
0
03 Aug 2020
Tracking the Race Between Deep Reinforcement Learning and Imitation
  Learning -- Extended Version
Tracking the Race Between Deep Reinforcement Learning and Imitation Learning -- Extended Version
Timo P. Gros
Daniel Holler
Jörg Hoffmann
V. Wolf
17
12
0
03 Aug 2020
Proximal Deterministic Policy Gradient
Proximal Deterministic Policy Gradient
Marco Maggipinto
Gian Antonio Susto
Pratik Chaudhari
OffRL
41
5
0
03 Aug 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
91
43
0
02 Aug 2020
MAPPER: Multi-Agent Path Planning with Evolutionary Reinforcement
  Learning in Mixed Dynamic Environments
MAPPER: Multi-Agent Path Planning with Evolutionary Reinforcement Learning in Mixed Dynamic Environments
Zuxin Liu
Baiming Chen
Hongyi Zhou
G. Koushik
M. Hebert
Ding Zhao
AI4CE
107
89
0
30 Jul 2020
Understanding the Stability of Deep Control Policies for Biped
  Locomotion
Understanding the Stability of Deep Control Policies for Biped Locomotion
Hwangpil Park
R. Yu
Yoonsang Lee
Kyungho Lee
Jehee Lee
52
9
0
30 Jul 2020
Modular Transfer Learning with Transition Mismatch Compensation for
  Excessive Disturbance Rejection
Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection
Tianming Wang
Wenjie Lu
H. Yu
Dikai Liu
89
1
0
29 Jul 2020
Adaptive Bitrate Video Streaming for Wireless nodes: A Survey
Adaptive Bitrate Video Streaming for Wireless nodes: A Survey
Kamran Nishat
O. Gnawali
A. Abdelhadi
35
1
0
27 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
95
4
0
24 Jul 2020
Value-Decomposition Multi-Agent Actor-Critics
Value-Decomposition Multi-Agent Actor-Critics
Jianyu Su
Stephen C. Adams
Peter A. Beling
135
106
0
24 Jul 2020
Bridging the Imitation Gap by Adaptive Insubordination
Bridging the Imitation Gap by Adaptive Insubordination
Luca Weihs
Unnat Jain
Iou-Jen Liu
Jordi Salvador
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
93
36
0
23 Jul 2020
Learning Infinite-horizon Average-reward MDPs with Linear Function
  Approximation
Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Rahul Jain
94
43
0
23 Jul 2020
Time Perception: A Review on Psychological, Computational and Robotic
  Models
Time Perception: A Review on Psychological, Computational and Robotic Models
Hamit Basgol
I. Ayhan
Emre Ugur
32
14
0
23 Jul 2020
Attend and Segment: Attention Guided Active Semantic Segmentation
Attend and Segment: Attention Guided Active Semantic Segmentation
Soroush Seifi
Tinne Tuytelaars
71
13
0
22 Jul 2020
Learning Object Relation Graph and Tentative Policy for Visual
  Navigation
Learning Object Relation Graph and Tentative Policy for Visual Navigation
Heming Du
Xin Yu
Liang Zheng
84
131
0
21 Jul 2020
Soft Expert Reward Learning for Vision-and-Language Navigation
Soft Expert Reward Learning for Vision-and-Language Navigation
Hu Wang
Qi Wu
Chunhua Shen
57
51
0
21 Jul 2020
Lagrangian Duality in Reinforcement Learning
Lagrangian Duality in Reinforcement Learning
Pranay Pasula
OffRL
30
0
0
20 Jul 2020
Quick Question: Interrupting Users for Microtasks with Reinforcement
  Learning
Quick Question: Interrupting Users for Microtasks with Reinforcement Learning
Bo-Jhang Ho
Bharathan Balaji
Mehmet Köseoğlu
S. Sandha
Siyou Pei
Mani B. Srivastava
37
6
0
18 Jul 2020
Discovering Reinforcement Learning Algorithms
Discovering Reinforcement Learning Algorithms
Junhyuk Oh
Matteo Hessel
Wojciech M. Czarnecki
Zhongwen Xu
H. V. Hasselt
Satinder Singh
David Silver
94
129
0
17 Jul 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
92
78
0
16 Jul 2020
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Liang Liu
Hao Lu
Hongwei Zou
Haipeng Xiong
Zhiguo Cao
Chunhua Shen
OffRL
78
72
0
16 Jul 2020
Mixture of Step Returns in Bootstrapped DQN
Mixture of Step Returns in Bootstrapped DQN
Po-Han Chiang
Hsuan-Kung Yang
Zhang-Wei Hong
Chun-Yi Lee
45
4
0
16 Jul 2020
Active Visual Information Gathering for Vision-Language Navigation
Active Visual Information Gathering for Vision-Language Navigation
Hanqing Wang
Wenguan Wang
Tianmin Shu
Wei Liang
Jianbing Shen
145
73
0
15 Jul 2020
Information Freshness-Aware Task Offloading in Air-Ground Integrated
  Edge Computing Systems
Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems
Xianfu Chen
Celimuge Wu
Tao Chen
Zhi Liu
Honggang Zhang
M. Bennis
Hang Liu
Yusheng Ji
85
72
0
15 Jul 2020
Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep
  Reinforcement Learning
Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning
Sabrina Hoppe
Marc Toussaint
OffRL
56
7
0
15 Jul 2020
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient
  Descent
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
51
25
0
15 Jul 2020
Machine Learning for Offensive Security: Sandbox Classification Using
  Decision Trees and Artificial Neural Networks
Machine Learning for Offensive Security: Sandbox Classification Using Decision Trees and Artificial Neural Networks
William W. Pearce
Nick Landers
Nancy Fulda
11
4
0
14 Jul 2020
Relational-Grid-World: A Novel Relational Reasoning Environment and An
  Agent Model for Relational Information Extraction
Relational-Grid-World: A Novel Relational Reasoning Environment and An Agent Model for Relational Information Extraction
Faruk Küçüksubasi
Elif Surer
34
2
0
12 Jul 2020
An Asymptotically Optimal Multi-Armed Bandit Algorithm and
  Hyperparameter Optimization
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization
Yimin Huang
Yujun Li
Hanrong Ye
Zhenguo Li
Zhihua Zhang
70
7
0
11 Jul 2020
A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied
  Tasks
A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks
Unnat Jain
Luca Weihs
Eric Kolve
Ali Farhadi
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
86
58
0
09 Jul 2020
Self-Supervised Policy Adaptation during Deployment
Self-Supervised Policy Adaptation during Deployment
Nicklas Hansen
Rishabh Jangir
Yu Sun
Guillem Alenyà
Pieter Abbeel
Alexei A. Efros
Lerrel Pinto
Xiaolong Wang
110
163
0
08 Jul 2020
Tracking-by-Trackers with a Distilled and Reinforced Model
Tracking-by-Trackers with a Distilled and Reinforced Model
Matteo Dunnhofer
N. Martinel
C. Micheloni
VOTOffRL
66
4
0
08 Jul 2020
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Joshua Romoff
Peter Henderson
David Kanaa
Emmanuel Bengio
Ahmed Touati
Pierre-Luc Bacon
Joelle Pineau
66
3
0
06 Jul 2020
Integrating Distributed Architectures in Highly Modular RL Libraries
Integrating Distributed Architectures in Highly Modular RL Libraries
Albert Bou
Sebastian Dittert
Gianni De Fabritiis
76
0
0
06 Jul 2020
Learning Implicit Credit Assignment for Cooperative Multi-Agent
  Reinforcement Learning
Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning
Meng Zhou
Ziyu Liu
Pengwei Sui
Yixuan Li
Yuk Ying Chung
72
27
0
06 Jul 2020
Discount Factor as a Regularizer in Reinforcement Learning
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
97
72
0
04 Jul 2020
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via
  Metagradient
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Yufei Wang
Tianwei Ni
77
21
0
03 Jul 2020
Efficient Marginalization of Discrete and Structured Latent Variables
  via Sparsity
Efficient Marginalization of Discrete and Structured Latent Variables via Sparsity
Gonçalo M. Correia
Vlad Niculae
Wilker Aziz
André F. T. Martins
BDL
162
23
0
03 Jul 2020
Expected Eligibility Traces
Expected Eligibility Traces
H. V. Hasselt
Sephora Madjiheurem
Matteo Hessel
David Silver
André Barreto
Diana Borsa
64
38
0
03 Jul 2020
Towards Generalization and Data Efficient Learning of Deep Robotic
  Grasping
Towards Generalization and Data Efficient Learning of Deep Robotic Grasping
Zhixin Chen
Mengxiang Lin
Zhixin Jia
Shibo Jian
48
6
0
02 Jul 2020
Decentralized Deep Reinforcement Learning for Network Level Traffic
  Signal Control
Decentralized Deep Reinforcement Learning for Network Level Traffic Signal Control
Jinqiu Guo
27
1
0
02 Jul 2020
Adaptive Discretization for Model-Based Reinforcement Learning
Adaptive Discretization for Model-Based Reinforcement Learning
Sean R. Sinclair
Tianyu Wang
Gauri Jain
Siddhartha Banerjee
Chao Yu
OffRL
89
21
0
01 Jul 2020
Gradient Temporal-Difference Learning with Regularized Corrections
Gradient Temporal-Difference Learning with Regularized Corrections
Sina Ghiassian
Andrew Patterson
Shivam Garg
Dhawal Gupta
Adam White
Martha White
177
42
0
01 Jul 2020
Convex Regularization in Monte-Carlo Tree Search
Convex Regularization in Monte-Carlo Tree Search
Tuan Dam
Carlo DÉramo
Jan Peters
Joni Pajarinen
OffRL
81
11
0
01 Jul 2020
Robustifying the Deployment of tinyML Models for Autonomous
  mini-vehicles
Robustifying the Deployment of tinyML Models for Autonomous mini-vehicles
Miguel de Prado
Manuele Rusci
Romain Donze
Alessandro Capotondi
Serge Monnerat
Luca Benini and
Nuria Pazos
97
40
0
01 Jul 2020
A Novel RL-assisted Deep Learning Framework for Task-informative Signals
  Selection and Classification for Spontaneous BCIs
A Novel RL-assisted Deep Learning Framework for Task-informative Signals Selection and Classification for Spontaneous BCIs
Wonjun Ko
Eunjin Jeon
Heung-Il Suk
40
15
0
01 Jul 2020
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
Elise van der Pol
Daniel E. Worrall
H. V. Hoof
F. Oliehoek
Max Welling
BDLAI4CE
121
164
0
30 Jun 2020
Lachesis: Automatic Partitioning for UDF-Centric Analytics
Lachesis: Automatic Partitioning for UDF-Centric Analytics
Jia Zou
A. Das
Pratik Barhate
Arun Iyengar
Binhang Yuan
Dimitrije Jankov
Chis Jermaine
24
4
0
30 Jun 2020
Previous
123...414243...707172
Next