ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Learning to Plan via a Multi-Step Policy Regression Method
Learning to Plan via a Multi-Step Policy Regression Method
Stefan Sylvius Wagner
Michael Janschek
Tobias Uelwer
Stefan Harmeling
19
0
0
18 Jun 2021
Adapting the Function Approximation Architecture in Online Reinforcement
  Learning
Adapting the Function Approximation Architecture in Online Reinforcement Learning
John D. Martin
Joseph Modayil
60
2
0
17 Jun 2021
RHNAS: Realizable Hardware and Neural Architecture Search
RHNAS: Realizable Hardware and Neural Architecture Search
Yash Akhauri
Adithya Niranjan
J. P. Muñoz
Suvadeep Banerjee
A. Davare
P. Cocchini
A. Sorokin
R. Iyer
Nilesh Jain
49
3
0
17 Jun 2021
A learning agent that acquires social norms from public sanctions in
  decentralized multi-agent settings
A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings
Eugene Vinitsky
Raphael Köster
J. Agapiou
Edgar A. Duénez-Guzmán
A. Vezhnevets
Joel Z Leibo
81
41
0
16 Jun 2021
Towards Automatic Actor-Critic Solutions to Continuous Control
Towards Automatic Actor-Critic Solutions to Continuous Control
J. E. Grigsby
Jinsu Yoo
Yanjun Qi
OffRL
78
6
0
16 Jun 2021
Analysis and Optimisation of Bellman Residual Errors with Neural
  Function Approximation
Analysis and Optimisation of Bellman Residual Errors with Neural Function Approximation
Martin Gottwald
Sven Gronauer
Hao Shen
Klaus Diepold
36
3
0
16 Jun 2021
Real-time Adversarial Perturbations against Deep Reinforcement Learning
  Policies: Attacks and Defenses
Real-time Adversarial Perturbations against Deep Reinforcement Learning Policies: Attacks and Defenses
Buse G. A. Tekgul
Shelly Wang
Samuel Marchal
Nadarajah Asokan
AAMLOffRL
83
6
0
16 Jun 2021
Minimizing Communication while Maximizing Performance in Multi-Agent
  Reinforcement Learning
Minimizing Communication while Maximizing Performance in Multi-Agent Reinforcement Learning
V. Vijay
Hassam Sheikh
Somdeb Majumdar
Mariano Phielipp
51
5
0
15 Jun 2021
Deep Reinforcement Learning for Conservation Decisions
Deep Reinforcement Learning for Conservation Decisions
Marcus Lapeyrolerie
Melissa S. Chapman
Kari E. A. Norman
C. Boettiger
OffRL
124
18
0
15 Jun 2021
On Multi-objective Policy Optimization as a Tool for Reinforcement
  Learning: Case Studies in Offline RL and Finetuning
On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning
A. Abdolmaleki
Sandy H. Huang
Giulia Vezzani
Bobak Shahriari
Jost Tobias Springenberg
...
András Gyorgy
Csaba Szepesvári
R. Hadsell
N. Heess
Martin Riedmiller
OffRL
56
5
0
15 Jun 2021
Vision-Language Navigation with Random Environmental Mixup
Vision-Language Navigation with Random Environmental Mixup
Chong Liu
Fengda Zhu
Xiaojun Chang
Xiaodan Liang
Zongyuan Ge
Yi-Dong Shen
LM&Ro
135
88
0
15 Jun 2021
Unsupervised Learning of Visual 3D Keypoints for Control
Unsupervised Learning of Visual 3D Keypoints for Control
Boyuan Chen
Pieter Abbeel
Deepak Pathak
3DPCSSL
92
40
0
14 Jun 2021
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function
  Approximation
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Anas Barakat
Pascal Bianchi
Julien Lehmann
95
9
0
14 Jun 2021
Learning-Aided Heuristics Design for Storage System
Learning-Aided Heuristics Design for Storage System
Yingtian Tang
H. Lu
Xijun Li
Lei Chen
Mingxuan Yuan
Jia Zeng
27
2
0
14 Jun 2021
Variational Policy Search using Sparse Gaussian Process Priors for
  Learning Multimodal Optimal Actions
Variational Policy Search using Sparse Gaussian Process Priors for Learning Multimodal Optimal Actions
Hikaru Sasaki
Takamitsu Matsubara
24
6
0
14 Jun 2021
Characterizing the Gap Between Actor-Critic and Policy Gradient
Characterizing the Gap Between Actor-Critic and Policy Gradient
Junfeng Wen
Saurabh Kumar
Ramki Gummadi
Dale Schuurmans
92
15
0
13 Jun 2021
A New Formalism, Method and Open Issues for Zero-Shot Coordination
A New Formalism, Method and Open Issues for Zero-Shot Coordination
Johannes Treutlein
Michael Dennis
Caspar Oesterheld
Jakob N. Foerster
OffRL
87
35
0
11 Jun 2021
A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning
A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning
Yonggan Fu
Yongan Zhang
Chaojian Li
Zhongzhi Yu
Yingyan Lin
52
6
0
11 Jun 2021
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
118
64
0
11 Jun 2021
GDI: Rethinking What Makes Reinforcement Learning Different From
  Supervised Learning
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning
Jiajun Fan
Changnan Xiao
Yue Huang
OffRL
91
10
0
11 Jun 2021
Taylor Expansion of Discount Factors
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
63
5
0
11 Jun 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Helen Zhou
Ji Liu
71
118
0
11 Jun 2021
DECORE: Deep Compression with Reinforcement Learning
DECORE: Deep Compression with Reinforcement Learning
Manoj Alwani
Yang Wang
Vashisht Madhavan
AI4CE
75
44
0
11 Jun 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Helen Zhou
SSL
94
15
0
10 Jun 2021
RLCorrector: Reinforced Proofreading for Cell-level Microscopy Image
  Segmentation
RLCorrector: Reinforced Proofreading for Cell-level Microscopy Image Segmentation
K. Nguyen
Ganghee Jang
T. Tuan
Won-Ki Jeong
112
2
0
10 Jun 2021
Eye of the Beholder: Improved Relation Generalization for Text-based
  Reinforcement Learning Agents
Eye of the Beholder: Improved Relation Generalization for Text-based Reinforcement Learning Agents
K. Murugesan
Subhajit Chaudhury
Kartik Talamadupula
99
5
0
09 Jun 2021
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via
  Relabeling Experience and Unsupervised Pre-training
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
Kimin Lee
Laura M. Smith
Pieter Abbeel
OffRL
70
289
0
09 Jun 2021
Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion
  Attacks in Deep RL
Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL
Yanchao Sun
Ruijie Zheng
Yongyuan Liang
Furong Huang
AAML
110
69
0
09 Jun 2021
Vector Quantized Models for Planning
Vector Quantized Models for Planning
Sherjil Ozair
Yazhe Li
Ali Razavi
Ioannis Antonoglou
Aaron van den Oord
Oriol Vinyals
OffRL
94
51
0
08 Jun 2021
Towards Practical Credit Assignment for Deep Reinforcement Learning
Towards Practical Credit Assignment for Deep Reinforcement Learning
Vyacheslav Alipov
Riley Simmons-Edler
N.Yu. Putintsev
Pavel Kalinin
Dmitry Vetrov
OffRL
80
11
0
08 Jun 2021
Linear Convergence of Entropy-Regularized Natural Policy Gradient with
  Linear Function Approximation
Linear Convergence of Entropy-Regularized Natural Policy Gradient with Linear Function Approximation
Semih Cayci
Niao He
R. Srikant
108
36
0
08 Jun 2021
Amortized Generation of Sequential Algorithmic Recourses for Black-box
  Models
Amortized Generation of Sequential Algorithmic Recourses for Black-box Models
Sahil Verma
Keegan E. Hines
John P. Dickerson
94
24
0
07 Jun 2021
Correcting Momentum in Temporal Difference Learning
Correcting Momentum in Temporal Difference Learning
Emmanuel Bengio
Joelle Pineau
Doina Precup
72
10
0
07 Jun 2021
Offline Policy Comparison under Limited Historical Agent-Environment
  Interactions
Offline Policy Comparison under Limited Historical Agent-Environment Interactions
Anton Dereventsov
Joseph Daws
Clayton Webster
OffRL
65
3
0
07 Jun 2021
Launchpad: A Programming Model for Distributed Machine Learning Research
Launchpad: A Programming Model for Distributed Machine Learning Research
Fan Yang
Gabriel Barth-Maron
Piotr Stańczyk
Matthew Hoffman
Siqi Liu
M. Kroiss
Aedan Pope
Alban Rrustemi
71
24
0
07 Jun 2021
RegMix: Data Mixing Augmentation for Regression
RegMix: Data Mixing Augmentation for Regression
Seonghyeon Hwang
Steven Euijong Whang
UQCV
57
9
0
07 Jun 2021
Efficient Continuous Control with Double Actors and Regularized Critics
Efficient Continuous Control with Double Actors and Regularized Critics
Jiafei Lyu
Xiaoteng Ma
Jiangpeng Yan
Xiu Li
OffRL
47
50
0
06 Jun 2021
Learning Routines for Effective Off-Policy Reinforcement Learning
Learning Routines for Effective Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
28
1
0
05 Jun 2021
MALib: A Parallel Framework for Population-based Multi-agent
  Reinforcement Learning
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Ming Zhou
Bo Liu
Hanjing Wang
Muning Wen
Runzhe Wu
Ying Wen
Yaodong Yang
Weinan Zhang
Jun Wang
OffRL
61
49
0
05 Jun 2021
Efficient Classification of Very Large Images with Tiny Objects
Efficient Classification of Very Large Images with Tiny Objects
Fanjie Kong
Ricardo Henao
102
35
0
04 Jun 2021
UAV Swarm Path Planning with Reinforcement Learning for Field
  prospecting
UAV Swarm Path Planning with Reinforcement Learning for Field prospecting
Alejandro Puente-Castro
Daniel Rivero
A. Pazos
Enrique Fernández-Blanco
50
39
0
04 Jun 2021
Detecting and Adapting to Novelty in Games
Detecting and Adapting to Novelty in Games
Xiangyu Peng
Jonathan C. Balloch
Mark O. Riedl
TTA
51
10
0
04 Jun 2021
Hierarchical Representation Learning for Markov Decision Processes
Hierarchical Representation Learning for Markov Decision Processes
Lorenzo Steccanella
Simone Totaro
Anders Jonsson
62
4
0
03 Jun 2021
Robot in a China Shop: Using Reinforcement Learning for
  Location-Specific Navigation Behaviour
Robot in a China Shop: Using Reinforcement Learning for Location-Specific Navigation Behaviour
Xihan Bian
Oscar Alejandro Mendez Maldonado
Simon Hadfield
42
3
0
02 Jun 2021
Towards Deeper Deep Reinforcement Learning with Spectral Normalization
Towards Deeper Deep Reinforcement Learning with Spectral Normalization
Johan Bjorck
Carla P. Gomes
Kilian Q. Weinberger
106
23
0
02 Jun 2021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement
  Learning
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
71
5
0
01 Jun 2021
Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning
Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning
Jiahui Li
Kun Kuang
Baoxiang Wang
Furui Liu
Long Chen
Leilei Gan
Jun Xiao
OffRL
129
66
0
01 Jun 2021
Deep Reinforcement Learning in Quantitative Algorithmic Trading: A
  Review
Deep Reinforcement Learning in Quantitative Algorithmic Trading: A Review
Tidor-Vlad Pricope
AIFin
80
36
0
31 May 2021
Reducing the Deployment-Time Inference Control Costs of Deep
  Reinforcement Learning Agents via an Asymmetric Architecture
Reducing the Deployment-Time Inference Control Costs of Deep Reinforcement Learning Agents via an Asymmetric Architecture
Chin-Jui Chang
Yu-Wei Chu
Chao-Hsien Ting
Hao-Kang Liu
Zhang-Wei Hong
Chun-Yi Lee
AI4CE
27
1
0
30 May 2021
Learning to Optimize Industry-Scale Dynamic Pickup and Delivery Problems
Learning to Optimize Industry-Scale Dynamic Pickup and Delivery Problems
Xijun Li
Weilin Luo
Mingxuan Yuan
Jun Wang
Jiawen Lu
Jie Wang
Jinhu Lu
Jia Zeng
68
42
0
27 May 2021
Previous
123...323334...707172
Next