ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Gradient-based Adaptive Markov Chain Monte Carlo
Gradient-based Adaptive Markov Chain Monte Carlo
Michalis K. Titsias
P. Dellaportas
BDL
102
22
0
04 Nov 2019
Learning to Scaffold the Development of Robotic Manipulation Skills
Learning to Scaffold the Development of Robotic Manipulation Skills
Lin Shao
Toki Migimatsu
Jeannette Bohg
97
30
0
03 Nov 2019
Online Robustness Training for Deep Reinforcement Learning
Online Robustness Training for Deep Reinforcement Learning
Marc Fischer
M. Mirman
Steven Stalder
Martin Vechev
OnRL
110
41
0
03 Nov 2019
Maximum Entropy Diverse Exploration: Disentangling Maximum Entropy
  Reinforcement Learning
Maximum Entropy Diverse Exploration: Disentangling Maximum Entropy Reinforcement Learning
Andrew Cohen
Lei Yu
Xingye Qiao
Xiangrong Tong
62
2
0
03 Nov 2019
Learning from Trajectories via Subgoal Discovery
Learning from Trajectories via Subgoal Discovery
S. Paul
J. Baar
Amit K. Roy-Chowdhury
151
48
0
03 Nov 2019
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Mikael Henaff
OffRL
121
32
0
01 Nov 2019
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion
  Frames
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames
Erik Wijmans
Abhishek Kadian
Ari S. Morcos
Stefan Lee
Irfan Essa
Devi Parikh
Manolis Savva
Dhruv Batra
114
485
0
01 Nov 2019
A2: Extracting Cyclic Switchings from DOB-nets for Rejecting Excessive
  Disturbances
A2: Extracting Cyclic Switchings from DOB-nets for Rejecting Excessive Disturbances
Wenjie Lu
Dikai Liu
21
0
0
01 Nov 2019
A Narration-based Reward Shaping Approach using Grounded Natural
  Language Commands
A Narration-based Reward Shaping Approach using Grounded Natural Language Commands
Nicholas R. Waytowich
Sean L. Barton
Vernon J. Lawhern
Garrett A. Warnell
OffRL
72
17
0
31 Oct 2019
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement
  Learning and Hierarchical Actions Filtering
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering
Yuval Heffetz
Roman Vainshtein
Gilad Katz
Lior Rokach
60
40
0
31 Oct 2019
DADI: Dynamic Discovery of Fair Information with Adversarial
  Reinforcement Learning
DADI: Dynamic Discovery of Fair Information with Adversarial Reinforcement Learning
Michiel A. Bakker
Duy Patrick Tu
Humberto Riverón Valdés
Krishna P. Gummadi
Kush R. Varshney
Adrian Weller
Alex Pentland
75
5
0
30 Oct 2019
Thompson Sampling via Local Uncertainty
Thompson Sampling via Local Uncertainty
Zhendong Wang
Mingyuan Zhou
80
19
0
30 Oct 2019
Learning to Manipulate Deformable Objects without Demonstrations
Learning to Manipulate Deformable Objects without Demonstrations
Yilin Wu
Wilson Yan
Thanard Kurutach
Lerrel Pinto
Pieter Abbeel
OffRL
78
202
0
29 Oct 2019
Generalization of Reinforcement Learners with Working and Episodic
  Memory
Generalization of Reinforcement Learners with Working and Episodic Memory
Meire Fortunato
Melissa Tan
Ryan Faulkner
Steven Hansen
Adria Puigdomenech Badia
Gavin Buttimore
Charlie Deck
Joel Z Leibo
Charles Blundell
119
71
0
29 Oct 2019
Feedback Linearization for Unknown Systems via Reinforcement Learning
Feedback Linearization for Unknown Systems via Reinforcement Learning
T. Westenbroek
David Fridovich-Keil
Eric Mazumdar
Shreyas Arora
Valmik Prabhu
S. Shankar Sastry
Claire Tomlin
51
28
0
29 Oct 2019
Learning to Predict Without Looking Ahead: World Models Without Forward
  Prediction
Learning to Predict Without Looking Ahead: World Models Without Forward Prediction
C. Freeman
Luke Metz
David R Ha
89
36
0
29 Oct 2019
Learning Transferable Graph Exploration
Learning Transferable Graph Exploration
H. Dai
Yujia Li
Chenglong Wang
Rishabh Singh
Po-Sen Huang
Pushmeet Kohli
79
22
0
28 Oct 2019
Generalization in Reinforcement Learning with Selective Noise Injection
  and Information Bottleneck
Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck
Maximilian Igl
K. Ciosek
Yingzhen Li
Sebastian Tschiatschek
Cheng Zhang
Sam Devlin
Katja Hofmann
OffRL
95
174
0
28 Oct 2019
Certified Adversarial Robustness for Deep Reinforcement Learning
Certified Adversarial Robustness for Deep Reinforcement Learning
Björn Lütjens
Michael Everett
Jonathan P. How
AAML
109
96
0
28 Oct 2019
Asynchronous Methods for Model-Based Reinforcement Learning
Asynchronous Methods for Model-Based Reinforcement Learning
Yunzhi Zhang
I. Clavera
Bo-Yu Tsai
Pieter Abbeel
OffRL
60
27
0
28 Oct 2019
PRNet: Self-Supervised Learning for Partial-to-Partial Registration
PRNet: Self-Supervised Learning for Partial-to-Partial Registration
Yue Wang
Justin Solomon
SSL3DPC
157
390
0
27 Oct 2019
Convergent Policy Optimization for Safe Reinforcement Learning
Convergent Policy Optimization for Safe Reinforcement Learning
Ming Yu
Zhuoran Yang
Mladen Kolar
Zhaoran Wang
102
96
0
26 Oct 2019
Comparing Observation and Action Representations for Deep Reinforcement
  Learning in $μ$RTS
Comparing Observation and Action Representations for Deep Reinforcement Learning in μμμRTS
Shengyi Huang
Santiago Ontañón
65
8
0
26 Oct 2019
Collision Avoidance in Pedestrian-Rich Environments with Deep
  Reinforcement Learning
Collision Avoidance in Pedestrian-Rich Environments with Deep Reinforcement Learning
Michael Everett
Yu Fan Chen
Jonathan P. How
OffRL
159
176
0
24 Oct 2019
Robust Model Predictive Shielding for Safe Reinforcement Learning with
  Stochastic Dynamics
Robust Model Predictive Shielding for Safe Reinforcement Learning with Stochastic Dynamics
Shuo Li
Osbert Bastani
84
86
0
24 Oct 2019
Attention-based Curiosity-driven Exploration in Deep Reinforcement
  Learning
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning
Patrik Reizinger
Marton Szemenyei
45
16
0
23 Oct 2019
Partially Detected Intelligent Traffic Signal Control: Environmental
  Adaptation
Partially Detected Intelligent Traffic Signal Control: Environmental Adaptation
Rusheng Zhang
Romain Leteurtre
Benjamin Striner
Ammar S. Alanazi
Abdullah A. Alghafis
Ozan K. Tonguz
42
13
0
23 Oct 2019
Improving the Gating Mechanism of Recurrent Neural Networks
Improving the Gating Mechanism of Recurrent Neural Networks
Albert Gu
Çağlar Gülçehre
T. Paine
Matthew W. Hoffman
Razvan Pascanu
AI4CE
35
2
0
22 Oct 2019
Collaborative Graph Walk for Semi-supervised Multi-Label Node
  Classification
Collaborative Graph Walk for Semi-supervised Multi-Label Node Classification
Uchenna Akujuobi
Yufei Han
Qiannan Zhang
Xiangliang Zhang
55
17
0
22 Oct 2019
Dealing with Sparse Rewards in Reinforcement Learning
Dealing with Sparse Rewards in Reinforcement Learning
J. Hare
64
80
0
21 Oct 2019
Regularization Matters in Policy Optimization
Regularization Matters in Policy Optimization
Zhuang Liu
Xuanlin Li
Bingyi Kang
Trevor Darrell
OffRL
83
33
0
21 Oct 2019
A New Framework for Multi-Agent Reinforcement Learning -- Centralized
  Training and Exploration with Decentralized Execution via Policy Distillation
A New Framework for Multi-Agent Reinforcement Learning -- Centralized Training and Exploration with Decentralized Execution via Policy Distillation
Gang Chen
64
41
0
21 Oct 2019
Diverse Behavior Is What Game AI Needs: Generating Varied Human-Like Playing Styles Using Evolutionary Multi-Objective Deep Reinforcement Learning
R. Shen
Yan Zheng
Jianye Hao
Yinfeng Chen
Changjie Fan
SyDa
51
3
0
20 Oct 2019
Autonomous Industrial Management via Reinforcement Learning:
  Self-Learning Agents for Decision-Making -- A Review
Autonomous Industrial Management via Reinforcement Learning: Self-Learning Agents for Decision-Making -- A Review
L. E. Leal
Magnus Westerlund
A. Chapman
22
0
0
20 Oct 2019
A Structured Prediction Approach for Generalization in Cooperative
  Multi-Agent Reinforcement Learning
A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement Learning
Nicolas Carion
Gabriel Synnaeve
A. Lazaric
Nicolas Usunier
65
29
0
19 Oct 2019
OffWorld Gym: open-access physical robotics environment for real-world
  reinforcement learning benchmark and research
OffWorld Gym: open-access physical robotics environment for real-world reinforcement learning benchmark and research
Ashish Kumar
Toby Buckley
John B. Lanier
Qiaozhi Wang
A. Kavelaars
Ilya Kuzovkin
OffRL
90
14
0
18 Oct 2019
On the Sample Complexity of Actor-Critic Method for Reinforcement
  Learning with Function Approximation
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation
Harshat Kumar
Alec Koppel
Alejandro Ribeiro
171
84
0
18 Oct 2019
Reinforcement Learning for Robotic Manipulation using Simulated
  Locomotion Demonstrations
Reinforcement Learning for Robotic Manipulation using Simulated Locomotion Demonstrations
Ozsel Kilinc
Giovanni Montana
54
39
0
16 Oct 2019
Parallel Exploration via Negatively Correlated Search
Parallel Exploration via Negatively Correlated Search
Peng Yang
Qi Yang
K. Tang
Xin Yao
127
14
0
16 Oct 2019
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
175
108
0
15 Oct 2019
Dynamic Attention Networks for Task Oriented Grounding
Dynamic Attention Networks for Task Oriented Grounding
S. Dasgupta
Badri N. Patro
Vinay P. Namboodiri
86
1
0
14 Oct 2019
Neural Program Synthesis By Self-Learning
Neural Program Synthesis By Self-Learning
Yifan Xu
Luke Dai
Udaikaran Singh
Kening Zhang
Zhuowen Tu
58
6
0
13 Oct 2019
Rethinking Exposure Bias In Language Modeling
Rethinking Exposure Bias In Language Modeling
Yifan Xu
Kening Zhang
Haoyu Dong
Yuezhou Sun
Wenlong Zhao
Zhuowen Tu
72
5
0
13 Oct 2019
Stabilizing Transformers for Reinforcement Learning
Stabilizing Transformers for Reinforcement Learning
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
...
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
115
368
0
13 Oct 2019
Curiosity-Driven Recommendation Strategy for Adaptive Learning via Deep
  Reinforcement Learning
Curiosity-Driven Recommendation Strategy for Adaptive Learning via Deep Reinforcement Learning
Ruijian Han
Kani Chen
Chunxi Tan
24
14
0
12 Oct 2019
Orchestrating the Development Lifecycle of Machine Learning-Based IoT
  Applications: A Taxonomy and Survey
Orchestrating the Development Lifecycle of Machine Learning-Based IoT Applications: A Taxonomy and Survey
Bin Qian
Jie Su
Z. Wen
D. N. Jha
Yinhao Li
...
Albert Y. Zomaya
Omer F. Rana
Lizhe Wang
Maciej Koutny
R. Ranjan
75
4
0
11 Oct 2019
Decoupling Hierarchical Recurrent Neural Networks With Locally
  Computable Losses
Decoupling Hierarchical Recurrent Neural Networks With Locally Computable Losses
Asier Mujika
Felix Weissenberger
Angelika Steger
49
0
0
11 Oct 2019
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary
  Rewards
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
Siyuan Li
Rui Wang
Minxue Tang
Chongjie Zhang
77
83
0
10 Oct 2019
Imitation Learning from Observations by Minimizing Inverse Dynamics
  Disagreement
Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement
Chao Yang
Xiaojian Ma
Wenbing Huang
F. Sun
Huaping Liu
Junzhou Huang
Chuang Gan
119
71
0
10 Oct 2019
Prescribed Generative Adversarial Networks
Prescribed Generative Adversarial Networks
Adji Bousso Dieng
Francisco J. R. Ruiz
David M. Blei
Michalis K. Titsias
GANDRL
83
62
0
09 Oct 2019
Previous
123...495051...707172
Next