ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Causal Campbell-Goodhart's law and Reinforcement Learning
Causal Campbell-Goodhart's law and Reinforcement Learning
Hal Ashton
CML
71
4
0
02 Nov 2020
Learning a Deep Reinforcement Learning Policy Over the Latent Space of a
  Pre-trained GAN for Semantic Age Manipulation
Learning a Deep Reinforcement Learning Policy Over the Latent Space of a Pre-trained GAN for Semantic Age Manipulation
K. Shubham
Gopalakrishnan Venkatesh
Reijul Sachdev
Akshi
D. Jayagopi
G. Srinivasaraghavan
GAN
61
6
0
02 Nov 2020
Reinforcement Learning with Efficient Active Feature Acquisition
Reinforcement Learning with Efficient Active Feature Acquisition
Haiyan Yin
Yingzhen Li
Sinno Jialin Pan
Cheng Zhang
Sebastian Tschiatschek
OffRL
57
14
0
02 Nov 2020
Cooperative Heterogeneous Deep Reinforcement Learning
Cooperative Heterogeneous Deep Reinforcement Learning
Han Zheng
Pengfei Wei
Jing Jiang
Guodong Long
Qinghua Lu
Chengqi Zhang
96
12
0
02 Nov 2020
Observation Space Matters: Benchmark and Optimization Algorithm
Observation Space Matters: Benchmark and Optimization Algorithm
J. Kim
Sehoon Ha
OODOffRL
49
11
0
02 Nov 2020
Fast Reinforcement Learning with Incremental Gaussian Mixture Models
Fast Reinforcement Learning with Incremental Gaussian Mixture Models
R. Pinto
24
1
0
02 Nov 2020
A Policy Gradient Algorithm for Learning to Learn in Multiagent
  Reinforcement Learning
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Dong-Ki Kim
Miao Liu
Matthew D Riemer
Chuangchuang Sun
Marwa Abdulhai
Golnaz Habibi
Sebastian Lopez-Cot
Gerald Tesauro
Jonathan P. How
61
56
0
31 Oct 2020
Finding the Near Optimal Policy via Adaptive Reduced Regularization in
  MDPs
Finding the Near Optimal Policy via Adaptive Reduced Regularization in MDPs
Wenhao Yang
Xiang Li
Guangzeng Xie
Zhihua Zhang
91
5
0
31 Oct 2020
Machine versus Human Attention in Deep Reinforcement Learning Tasks
Machine versus Human Attention in Deep Reinforcement Learning Tasks
Sihang Guo
Ruohan Zhang
Bo Liu
Yifeng Zhu
M. Hayhoe
D. Ballard
Peter Stone
OffRL
101
28
0
29 Oct 2020
Reinforcement Learning of Causal Variables Using Mediation Analysis
Reinforcement Learning of Causal Variables Using Mediation Analysis
Tue Herlau
Rasmus Larsen
OODCML
66
8
0
29 Oct 2020
Low-Variance Policy Gradient Estimation with World Models
Low-Variance Policy Gradient Estimation with World Models
Michal Nauman
Floris den Hengst
OffRL
55
1
0
29 Oct 2020
Learning to Unknot
Learning to Unknot
Sergei Gukov
James Halverson
Fabian Ruehle
P. Sułkowski
98
59
0
28 Oct 2020
DeepFoldit -- A Deep Reinforcement Learning Neural Network Folding
  Proteins
DeepFoldit -- A Deep Reinforcement Learning Neural Network Folding Proteins
Dimitra N. Panou
M. Reczko
77
3
0
28 Oct 2020
Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via
  Latent Model Ensembles
Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model Ensembles
Tim Seyde
Wilko Schwarting
S. Karaman
Daniela Rus
112
14
0
27 Oct 2020
Behavior Priors for Efficient Reinforcement Learning
Behavior Priors for Efficient Reinforcement Learning
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
116
40
0
27 Oct 2020
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time
  Systems with Lipschitz Continuous Controls
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls
Jeongho Kim
Jaeuk Shin
Insoon Yang
61
35
0
27 Oct 2020
VisualHints: A Visual-Lingual Environment for Multimodal Reinforcement
  Learning
VisualHints: A Visual-Lingual Environment for Multimodal Reinforcement Learning
Thomas Carta
Subhajit Chaudhury
Kartik Talamadupula
Michiaki Tatsubori
32
3
0
26 Oct 2020
Deep reinforced learning enables solving rich discrete-choice life cycle
  models to analyze social security reforms
Deep reinforced learning enables solving rich discrete-choice life cycle models to analyze social security reforms
A. Tanskanen
29
1
0
26 Oct 2020
Learning Multi-Agent Coordination for Enhancing Target Coverage in
  Directional Sensor Networks
Learning Multi-Agent Coordination for Enhancing Target Coverage in Directional Sensor Networks
Jing Xu
Fangwei Zhong
Yizhou Wang
83
50
0
25 Oct 2020
Planning with Exploration: Addressing Dynamics Bottleneck in Model-based
  Reinforcement Learning
Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning
Xiyao Wang
Junge Zhang
Wenzhen Huang
Qiyue Yin
51
0
0
24 Oct 2020
Deep Neural Mobile Networking
Deep Neural Mobile Networking
Chaoyun Zhang
81
1
0
23 Oct 2020
Bridging Imagination and Reality for Model-Based Deep Reinforcement
  Learning
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
Guangxiang Zhu
Minghao Zhang
Honglak Lee
Chongjie Zhang
OffRL
140
18
0
23 Oct 2020
Optimising Stochastic Routing for Taxi Fleets with Model Enhanced
  Reinforcement Learning
Optimising Stochastic Routing for Taxi Fleets with Model Enhanced Reinforcement Learning
Shen Ren
Qianxiao Li
Liye Zhang
Zheng Qin
Bo Yang
28
0
0
22 Oct 2020
Detecting Rewards Deterioration in Episodic Reinforcement Learning
Detecting Rewards Deterioration in Episodic Reinforcement Learning
Ido Greenberg
Shie Mannor
OffRL
66
13
0
22 Oct 2020
Deep Reinforcement Learning with Stacked Hierarchical Attention for
  Text-based Games
Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games
Yunqiu Xu
Meng Fang
Ling-Hao Chen
Yali Du
Qiufeng Wang
Chengqi Zhang
OffRL
102
44
0
22 Oct 2020
Sample Efficient Reinforcement Learning with REINFORCE
Sample Efficient Reinforcement Learning with REINFORCE
Junzi Zhang
Jongho Kim
Brendan O'Donoghue
Stephen P. Boyd
131
113
0
22 Oct 2020
Logistic Q-Learning
Logistic Q-Learning
Joan Bas-Serrano
Sebastian Curi
Andreas Krause
Gergely Neu
108
40
0
21 Oct 2020
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for
  Payment Fraud Systems in Retail Banking
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking
Hongda Shen
Eren Kurshan
77
21
0
21 Oct 2020
Visual Navigation in Real-World Indoor Environments Using End-to-End
  Deep Reinforcement Learning
Visual Navigation in Real-World Indoor Environments Using End-to-End Deep Reinforcement Learning
Jonáš Kulhánek
Erik Derner
Robert Babuška
75
41
0
21 Oct 2020
Iterative Amortized Policy Optimization
Iterative Amortized Policy Optimization
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
117
21
0
20 Oct 2020
Negotiating Team Formation Using Deep Reinforcement Learning
Negotiating Team Formation Using Deep Reinforcement Learning
Yoram Bachrach
Richard Everett
Edward Hughes
Angeliki Lazaridou
Joel Z Leibo
Marc Lanctot
Michael Bradley Johanson
Wojciech M. Czarnecki
T. Graepel
109
36
0
20 Oct 2020
Quality of service based radar resource management using deep
  reinforcement learning
Quality of service based radar resource management using deep reinforcement learning
S. Durst
S. Brüggenwirth
23
12
0
20 Oct 2020
Integrating LEO Satellites and Multi-UAV Reinforcement Learning for
  Hybrid FSO/RF Non-Terrestrial Networks
Integrating LEO Satellites and Multi-UAV Reinforcement Learning for Hybrid FSO/RF Non-Terrestrial Networks
Ju-Hyung Lee
Jihong Park
M. Bennis
Young-Chai Ko
86
51
0
20 Oct 2020
Multi-Radar Tracking Optimization for Collaborative Combat
Multi-Radar Tracking Optimization for Collaborative Combat
Nouredine Nour
Reda Belhaj-Soullami
Cédric L. R. Buron
A. Peres
F. Barbaresco
19
2
0
20 Oct 2020
Survivable Hyper-Redundant Robotic Arm with Bayesian Policy Morphing
Survivable Hyper-Redundant Robotic Arm with Bayesian Policy Morphing
Sayyed Jaffar Ali Raza
Apan Dastider
Mingjie Lin
23
1
0
20 Oct 2020
Improving Dialog Systems for Negotiation with Personality Modeling
Improving Dialog Systems for Negotiation with Personality Modeling
Runzhe Yang
Jingxiao Chen
Karthik Narasimhan
106
51
0
20 Oct 2020
Proximal Policy Gradient: PPO with Policy Gradient
Proximal Policy Gradient: PPO with Policy Gradient
Ju-Seung Byun
Byungmoon Kim
Huamin Wang
OffRL
46
8
0
20 Oct 2020
Watch-And-Help: A Challenge for Social Perception and Human-AI
  Collaboration
Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration
Xavier Puig
Tianmin Shu
Shuang Li
Zilin Wang
Yuan-Hong Liao
J. Tenenbaum
Sanja Fidler
Antonio Torralba
LM&Ro
163
130
0
19 Oct 2020
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for
  Autonomous Driving
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Ming Zhou
Jun Luo
Julian Villela
Yaodong Yang
David Rusu
...
H. Ammar
Hongbo Zhang
Wulong Liu
Jianye Hao
Jun Wang
198
198
0
19 Oct 2020
What About Inputing Policy in Value Function: Policy Representation and
  Policy-extended Value Function Approximator
What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator
Hongyao Tang
Zhaopeng Meng
Jianye Hao
Chong Chen
D. Graves
...
Hangyu Mao
Wulong Liu
Yaodong Yang
Wenyuan Tao
Li Wang
OffRL
89
7
0
19 Oct 2020
Language and Visual Entity Relationship Graph for Agent Navigation
Language and Visual Entity Relationship Graph for Agent Navigation
Yicong Hong
Cristian Rodriguez-Opazo
Yuankai Qi
Qi Wu
Stephen Gould
LM&Ro
229
135
0
19 Oct 2020
Belief-Grounded Networks for Accelerated Robot Learning under Partial
  Observability
Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability
Hai V. Nguyen
Brett Daley
Xinchao Song
Chris Amato
Robert Platt
83
14
0
19 Oct 2020
Average-reward model-free reinforcement learning: a systematic review
  and literature mapping
Average-reward model-free reinforcement learning: a systematic review and literature mapping
Vektor Dewanto
George Dunn
A. Eshragh
M. Gallagher
Fred Roosta
94
30
0
18 Oct 2020
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning
  with Intrinsic-Extrinsic Modeling
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling
Xin Ye
Yezhou Yang
83
15
0
16 Oct 2020
PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent
  Learning -- Lifelong
PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Mehul Damani
Zhiyao Luo
Emerson Wenzel
Guillaume Sartoretti
AI4CE
168
127
0
16 Oct 2020
Autonomous Control of a Particle Accelerator using Deep Reinforcement
  Learning
Autonomous Control of a Particle Accelerator using Deep Reinforcement Learning
X. Pang
S. Thulasidasan
L. Rybarcyk
57
10
0
16 Oct 2020
Cooperative-Competitive Reinforcement Learning with History-Dependent
  Rewards
Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards
Keyang He
Bikramjit Banerjee
Prashant Doshi
58
10
0
15 Oct 2020
Multi-Agent Trust Region Policy Optimization
Multi-Agent Trust Region Policy Optimization
Hepeng Li
Haibo He
106
42
0
15 Oct 2020
MAP Propagation Algorithm: Faster Learning with a Team of Reinforcement
  Learning Agents
MAP Propagation Algorithm: Faster Learning with a Team of Reinforcement Learning Agents
Stephen Chung
46
5
0
15 Oct 2020
A game-theoretic analysis of networked system control for common-pool
  resource management using multi-agent reinforcement learning
A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning
Arnu Pretorius
Scott A. Cameron
Elan Van Biljon
Tom Makkink
Shahil Mawjee
J. D. Plessis
Jonathan P. Shock
Alexandre Laterre
Karim Beguir
67
12
0
15 Oct 2020
Previous
123...383940...707172
Next