ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
A Survey on Reinforcement Learning in Aviation Applications
A Survey on Reinforcement Learning in Aviation Applications
Pouria Razzaghi
Amin Tabrizian
Wei Guo
Shulu Chen
Abenezer Taye
Ellis E. Thompson
Alexis Bregeon
Ali Baheri
Peng Wei
OffRL
54
56
0
03 Nov 2022
Leveraging Fully Observable Policies for Learning under Partial
  Observability
Leveraging Fully Observable Policies for Learning under Partial Observability
Hai V. Nguyen
Andrea Baisero
Dian Wang
Chris Amato
Robert Platt
OffRL
97
20
0
03 Nov 2022
Reinforcement Learning Applied to Trading Systems: A Survey
Reinforcement Learning Applied to Trading Systems: A Survey
L. Felizardo
Francisco Caio Lima Paiva
Anna Helena Reali Costa
E. Del-Moral-Hernandez
AIFin
51
1
0
01 Nov 2022
DanZero: Mastering GuanDan Game with Reinforcement Learning
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
71
6
0
31 Oct 2022
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight
  Grouping for Multi-Agent Reinforcement Learning
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning
Jenny Yang
Jaeuk Kim
Joo-Young Kim
56
2
0
29 Oct 2022
In-context Reinforcement Learning with Algorithm Distillation
In-context Reinforcement Learning with Algorithm Distillation
Michael Laskin
Luyu Wang
Junhyuk Oh
Emilio Parisotto
Stephen Spencer
...
Ethan A. Brooks
Maxime Gazeau
Himanshu Sahni
Satinder Singh
Volodymyr Mnih
OffRL
85
133
0
25 Oct 2022
Local Connection Reinforcement Learning Method for Efficient Control of
  Robotic Peg-in-Hole Assembly
Local Connection Reinforcement Learning Method for Efficient Control of Robotic Peg-in-Hole Assembly
Yuhang Gai
Jiwen Zhang
Dan Wu
Ken Chen
OffRL
58
1
0
24 Oct 2022
On Many-Actions Policy Gradient
On Many-Actions Policy Gradient
Michal Nauman
Marek Cygan
70
0
0
24 Oct 2022
AACHER: Assorted Actor-Critic Deep Reinforcement Learning with Hindsight
  Experience Replay
AACHER: Assorted Actor-Critic Deep Reinforcement Learning with Hindsight Experience Replay
Adarsh Sehgal
Muskan Sehgal
Hung M. La
31
2
0
24 Oct 2022
Climate Change Policy Exploration using Reinforcement Learning
Climate Change Policy Exploration using Reinforcement Learning
Theodore Wolf
49
1
0
23 Oct 2022
Multi-Objective GFlowNets
Multi-Objective GFlowNets
Moksh Jain
Sharath Chandra Raparthy
Alex Hernandez-Garcia
Jarrid Rector-Brooks
Yoshua Bengio
Santiago Miret
Emmanuel Bengio
111
91
0
23 Oct 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
74
6
0
22 Oct 2022
Biologically Plausible Variational Policy Gradient with Spiking
  Recurrent Winner-Take-All Networks
Biologically Plausible Variational Policy Gradient with Spiking Recurrent Winner-Take-All Networks
Zhile Yang
Shangqi Guo
Ying Fang
Jian K. Liu
25
1
0
21 Oct 2022
Learning Robust Dynamics through Variational Sparse Gating
Learning Robust Dynamics through Variational Sparse Gating
A. Jain
Shivakanth Sujit
S. Joshi
Vincent Michalski
Danijar Hafner
Samira Ebrahimi Kahou
73
9
0
21 Oct 2022
Self-Supervised Learning via Maximum Entropy Coding
Self-Supervised Learning via Maximum Entropy Coding
Xin Liu
Zhongdao Wang
Yali Li
Shengjin Wang
SSL
137
43
0
20 Oct 2022
Krylov-Bellman boosting: Super-linear policy evaluation in general state
  spaces
Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces
Eric Xia
Martin J. Wainwright
OffRL
57
2
0
20 Oct 2022
RMBench: Benchmarking Deep Reinforcement Learning for Robotic
  Manipulator Control
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control
Yanfei Xiang
Xin Wang
Shu Hu
Bin Zhu
Xiaomeng Huang
Xi Wu
Siwei Lyu
SSL
94
5
0
20 Oct 2022
Trust Region Policy Optimization with Optimal Transport Discrepancies:
  Duality and Algorithm for Continuous Actions
Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions
Antonio Terpin
Nicolas Lanzetti
Batuhan Yardim
Florian Dorfler
Giorgia Ramponi
70
5
0
20 Oct 2022
Emerging Threats in Deep Learning-Based Autonomous Driving: A
  Comprehensive Survey
Emerging Threats in Deep Learning-Based Autonomous Driving: A Comprehensive Survey
Huiyun Cao
Wenlong Zou
Yinkun Wang
Ting Song
Mengjun Liu
AAML
100
6
0
19 Oct 2022
Hierarchical Reinforcement Learning for Furniture Layout in Virtual
  Indoor Scenes
Hierarchical Reinforcement Learning for Furniture Layout in Virtual Indoor Scenes
Xinhan Di
Pengqian Yu
3DV
33
0
0
19 Oct 2022
Topology Optimization via Machine Learning and Deep Learning: A Review
Topology Optimization via Machine Learning and Deep Learning: A Review
S. Shin
Dongju Shin
Namwoo Kang
AI4CE
88
69
0
19 Oct 2022
Commonsense Knowledge from Scene Graphs for Textual Environments
Commonsense Knowledge from Scene Graphs for Textual Environments
Tsunehiko Tanaka
Daiki Kimura
Michiaki Tatsubori
69
2
0
19 Oct 2022
ULN: Towards Underspecified Vision-and-Language Navigation
ULN: Towards Underspecified Vision-and-Language Navigation
Weixi Feng
Tsu-Jui Fu
Yujie Lu
William Yang Wang
113
5
0
18 Oct 2022
Finite-time analysis of single-timescale actor-critic
Finite-time analysis of single-timescale actor-critic
Xu-yang Chen
Lin Zhao
OffRL
85
24
0
18 Oct 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
72
2
0
18 Oct 2022
Towards More Efficient Shared Autonomous Mobility: A Learning-Based
  Fleet Repositioning Approach
Towards More Efficient Shared Autonomous Mobility: A Learning-Based Fleet Repositioning Approach
Monika Filipovska
Michael F. Hyland
Haimanti Bala
65
0
0
16 Oct 2022
DyFEn: Agent-Based Fee Setting in Payment Channel Networks
DyFEn: Agent-Based Fee Setting in Payment Channel Networks
Kian Asgari
Aida Mohammadian
M. Tefagh
31
7
0
15 Oct 2022
CUP: Critic-Guided Policy Reuse
CUP: Critic-Guided Policy Reuse
Jin Zhang
Siyuan Li
Chongjie Zhang
86
8
0
15 Oct 2022
Just Round: Quantized Observation Spaces Enable Memory Efficient
  Learning of Dynamic Locomotion
Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion
Lev Grossman
Brian Plancher
MQ
69
4
0
14 Oct 2022
A Scalable Finite Difference Method for Deep Reinforcement Learning
A Scalable Finite Difference Method for Deep Reinforcement Learning
Matthew Allen
John C. Raisbeck
Hakho Lee
52
0
0
14 Oct 2022
COFFEE: Counterfactual Fairness for Personalized Text Generation in
  Explainable Recommendation
COFFEE: Counterfactual Fairness for Personalized Text Generation in Explainable Recommendation
Nan Wang
Qifan Wang
Yi-Chia Wang
Maziar Sanjabi
Jingzhou Liu
Hamed Firooz
Hongning Wang
Shaoliang Nie
107
6
0
14 Oct 2022
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter
  Market Simulations
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations
N. Vadori
Leo Ardon
Sumitra Ganesh
Thomas Spooner
Selim Amrouni
Jared Vann
Mengda Xu
Zeyu Zheng
T. Balch
Manuela Veloso
75
17
0
13 Oct 2022
Policy Gradient With Serial Markov Chain Reasoning
Policy Gradient With Serial Markov Chain Reasoning
Edoardo Cetin
Oya Celiktutan
BDLLRM
61
2
0
13 Oct 2022
Simulated Contextual Bandits for Personalization Tasks from
  Recommendation Datasets
Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Anton Dereventsov
A. Bibin
66
1
0
12 Oct 2022
DQLAP: Deep Q-Learning Recommender Algorithm with Update Policy for a
  Real Steam Turbine System
DQLAP: Deep Q-Learning Recommender Algorithm with Update Policy for a Real Steam Turbine System
M. Modirrousta
M. A. Shoorehdeli
M. Yari
A. Ghahremani
31
2
0
12 Oct 2022
Point Cloud Scene Completion with Joint Color and Semantic Estimation
  from Single RGB-D Image
Point Cloud Scene Completion with Joint Color and Semantic Estimation from Single RGB-D Image
Zhaoxuan Zhang
Xiaoguang Han
B. Dong
Tong Li
Baocai Yin
Xin Yang
3DPC3DV
71
8
0
12 Oct 2022
Discovered Policy Optimisation
Discovered Policy Optimisation
Chris Xiaoxuan Lu
J. Kuba
Alistair Letcher
Luke Metz
Christian Schroeder de Witt
Jakob N. Foerster
OffRL
111
79
0
11 Oct 2022
A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based
  Policy Learning
A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
Arrasy Rahman
Ignacio Carlucho
Niklas Höpner
Stefano V. Albrecht
114
11
0
11 Oct 2022
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation
  Approach
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach
Peng Mi
Li Shen
Tianhe Ren
Yiyi Zhou
Xiaoshuai Sun
Rongrong Ji
Dacheng Tao
AAML
123
72
0
11 Oct 2022
Long N-step Surrogate Stage Reward to Reduce Variances of Deep
  Reinforcement Learning in Complex Problems
Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems
Junmin Zhong
Ruofan Wu
J. Si
LRM
40
0
0
10 Oct 2022
Experiential Explanations for Reinforcement Learning
Experiential Explanations for Reinforcement Learning
Amal Alabdulkarim
Madhuri Singh
Gennie Mansi
Kaely Hall
Mark O. Riedl
Mark O. Riedl
OffRL
146
3
0
10 Oct 2022
How to Enable Uncertainty Estimation in Proximal Policy Optimization
How to Enable Uncertainty Estimation in Proximal Policy Optimization
Eugene Bykovets
Yannick Metz
Mennatallah El-Assady
Daniel A. Keim
J. M. Buhmann
UQCV
91
1
0
07 Oct 2022
Self-Adaptive Driving in Nonstationary Environments through Conjectural
  Online Lookahead Adaptation
Self-Adaptive Driving in Nonstationary Environments through Conjectural Online Lookahead Adaptation
Tao Li
Haozhe Lei
Quanyan Zhu
124
11
0
06 Oct 2022
Deep Inventory Management
Deep Inventory Management
Dhruv Madeka
Kari Torkkola
Carson Eisenach
Anna Luo
Dean Phillips Foster
Sham M. Kakade
BDL
85
15
0
06 Oct 2022
A New Path: Scaling Vision-and-Language Navigation with Synthetic
  Instructions and Imitation Learning
A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Aishwarya Kamath
Peter Anderson
Su Wang
Jing Yu Koh
Alexander Ku
Austin Waters
Yinfei Yang
Jason Baldridge
Zarana Parekh
LM&Ro
106
48
0
06 Oct 2022
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
S. Mohamad
H. Alamri
A. Bouchachia
85
3
0
06 Oct 2022
Low-Thrust Orbital Transfer using Dynamics-Agnostic Reinforcement
  Learning
Low-Thrust Orbital Transfer using Dynamics-Agnostic Reinforcement Learning
Carlos M. Casas
B. Carro
Antonio J. Sánchez-Esguevillas
26
2
0
06 Oct 2022
Neuro-Planner: A 3D Visual Navigation Method for MAV with Depth Camera
  based on Neuromorphic Reinforcement Learning
Neuro-Planner: A 3D Visual Navigation Method for MAV with Depth Camera based on Neuromorphic Reinforcement Learning
Junjie Jiang
Delei Kong
Kuanxu Hou
Xinjie Huang
Zhuang Hao
Zheng Fang
78
9
0
05 Oct 2022
Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of
  Connected Autonomous Vehicles in Challenging Scenarios
Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of Connected Autonomous Vehicles in Challenging Scenarios
Zhili Zhang
Songyang Han
Jiangwei Wang
Fei Miao
91
20
0
05 Oct 2022
Game Theoretic Rating in N-player general-sum games with Equilibria
Game Theoretic Rating in N-player general-sum games with Equilibria
Luke Marris
Marc Lanctot
I. Gemp
Shayegan Omidshafiei
Stephen Marcus McAleer
Jerome T. Connor
K. Tuyls
T. Graepel
76
3
0
05 Oct 2022
Previous
123...192021...707172
Next