ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
AISYN: AI-driven Reinforcement Learning-Based Logic Synthesis Framework
AISYN: AI-driven Reinforcement Learning-Based Logic Synthesis Framework
Ghasem Pasandi
Sreedhar Pratty
James Forsyth
49
6
0
08 Feb 2023
DITTO: Offline Imitation Learning with World Models
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
82
18
0
06 Feb 2023
Offline Learning of Closed-Loop Deep Brain Stimulation Controllers for
  Parkinson Disease Treatment
Offline Learning of Closed-Loop Deep Brain Stimulation Controllers for Parkinson Disease Treatment
Qitong Gao
Stephen L. Schimdt
Afsana Chowdhury
Guangyu Feng
Jennifer J. Peters
Katherine Genty
W. Grill
Dennis A. Turner
Miroslav Pajic
OffRL
81
11
0
05 Feb 2023
Open Problems and Modern Solutions for Deep Reinforcement Learning
Open Problems and Modern Solutions for Deep Reinforcement Learning
Weiqin Chen
OffRL
112
0
0
05 Feb 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLLOffRL
159
0
0
04 Feb 2023
Two-Stage Constrained Actor-Critic for Short Video Recommendation
Two-Stage Constrained Actor-Critic for Short Video Recommendation
Qingpeng Cai
Zhenghai Xue
Chi Zhang
Wanqi Xue
Shuchang Liu
...
Tianyou Zuo
Wentao Xie
Dong Zheng
Peng Jiang
Kun Gai
OffRLCML
75
44
0
03 Feb 2023
Multiple Thinking Achieving Meta-Ability Decoupling for Object
  Navigation
Multiple Thinking Achieving Meta-Ability Decoupling for Object Navigation
Ronghao Dang
Lu Chen
Liuyi Wang
Zongtao He
Chengju Liu
Qi Chen
LRM
58
8
0
03 Feb 2023
Learning to Optimize for Reinforcement Learning
Learning to Optimize for Reinforcement Learning
Qingfeng Lan
Rupam Mahmood
Shuicheng Yan
Zhongwen Xu
OffRL
115
7
0
03 Feb 2023
A Survey of Deep Learning: From Activations to Transformers
A Survey of Deep Learning: From Activations to Transformers
Johannes Schneider
Michalis Vlachos
ViTMedImAI4TSAI4CE
112
10
0
01 Feb 2023
Distillation Policy Optimization
Distillation Policy Optimization
Jianfei Ma
OffRL
93
1
0
01 Feb 2023
Learning Cut Selection for Mixed-Integer Linear Programming via
  Hierarchical Sequence Model
Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model
Zhihai Wang
Xijun Li
Jie Wang
Yufei Kuang
Mingxuan Yuan
Jianguo Zeng
Yongdong Zhang
Feng Wu
86
42
0
01 Feb 2023
Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for
  Dynamic Environments
Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for Dynamic Environments
Tan Chong Min John
Mehul Motani
43
2
0
31 Jan 2023
Toward Efficient Gradient-Based Value Estimation
Toward Efficient Gradient-Based Value Estimation
Arsalan Sharifnassab
R. Sutton
60
3
0
31 Jan 2023
CRC-RL: A Novel Visual Feature Representation Architecture for
  Unsupervised Reinforcement Learning
CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning
Darshita Jain
A. Majumder
S. Dutta
Swagat Kumar
SSL
64
1
0
31 Jan 2023
Enabling surrogate-assisted evolutionary reinforcement learning via
  policy embedding
Enabling surrogate-assisted evolutionary reinforcement learning via policy embedding
Lan Tang
Xiaxi Li
Jinyuan Zhang
Guiying Li
Peng Yang
Ke Tang
111
1
0
31 Jan 2023
V2N Service Scaling with Deep Reinforcement Learning
V2N Service Scaling with Deep Reinforcement Learning
Cyril Shih-Huan Hsu
Jorge Martín-Pérez
Chrysa Papagianni
Paola Grosso
OffRL
23
2
0
30 Jan 2023
A Novel Framework for Policy Mirror Descent with General
  Parameterization and Linear Convergence
A Novel Framework for Policy Mirror Descent with General Parameterization and Linear Convergence
Carlo Alfano
Rui Yuan
Patrick Rebeschini
145
15
0
30 Jan 2023
Direct Preference-based Policy Optimization without Reward Modeling
Direct Preference-based Policy Optimization without Reward Modeling
Gaon An
Junhyeok Lee
Xingdong Zuo
Norio Kosaka
KyungHyun Kim
Hyun Oh Song
OffRL
74
29
0
30 Jan 2023
StriderNET: A Graph Reinforcement Learning Approach to Optimize Atomic
  Structures on Rough Energy Landscapes
StriderNET: A Graph Reinforcement Learning Approach to Optimize Atomic Structures on Rough Energy Landscapes
Vaibhav Bihani
S. Manchanda
Srikanth Sastry
Sayan Ranu
N. M. A. Krishnan
GNNOffRLAI4CE
70
5
0
29 Jan 2023
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement
  Learning via Multi-Level Monte Carlo Actor-Critic
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic
Wesley A Suttle
Amrit Singh Bedi
Bhrij Patel
Brian M Sadler
Alec Koppel
Dinesh Manocha
100
16
0
28 Jan 2023
Neural Episodic Control with State Abstraction
Neural Episodic Control with State Abstraction
Zhuo Li
Derui Zhu
Yujing Hu
Xiaofei Xie
Lei Ma
Yan Zheng
Yan Song
Yingfeng Chen
Jianjun Zhao
OffRL
88
16
0
27 Jan 2023
Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Ido Greenberg
Shie Mannor
Gal Chechik
E. Meirom
OffRLOOD
95
9
0
26 Jan 2023
FedHQL: Federated Heterogeneous Q-Learning
FedHQL: Federated Heterogeneous Q-Learning
Flint Xiaofeng Fan
Yining Ma
Zhongxiang Dai
Cheston Tan
Bryan Kian Hsiang Low
Roger Wattenhofer
FedML
78
8
0
26 Jan 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement
  Learning
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
88
10
0
26 Jan 2023
Story Shaping: Teaching Agents Human-like Behavior with Stories
Story Shaping: Teaching Agents Human-like Behavior with Stories
Xiangyu Peng
Christopher Cui
Wei Zhou
Renee Jia
Mark O. Riedl
82
6
0
24 Jan 2023
On The Convergence Of Policy Iteration-Based Reinforcement Learning With
  Monte Carlo Policy Evaluation
On The Convergence Of Policy Iteration-Based Reinforcement Learning With Monte Carlo Policy Evaluation
Anna Winnicki
R. Srikant
90
9
0
23 Jan 2023
The configurable tree graph (CT-graph): measurable problems in partially
  observable and distal reward environments for lifelong reinforcement learning
The configurable tree graph (CT-graph): measurable problems in partially observable and distal reward environments for lifelong reinforcement learning
Andrea Soltoggio
Eseoghene Ben-Iwhiwhu
Christos Peridis
Pawel Ladosz
Jeffery Dick
Praveen K. Pilly
Soheil Kolouri
OffRL
90
3
0
21 Jan 2023
Quasi-optimal Reinforcement Learning with Continuous Actions
Quasi-optimal Reinforcement Learning with Continuous Actions
Yuhan Li
Wenzhuo Zhou
Ruoqing Zhu
OffRL
83
5
0
21 Jan 2023
AccDecoder: Accelerated Decoding for Neural-enhanced Video Analytics
AccDecoder: Accelerated Decoding for Neural-enhanced Video Analytics
Tingting Yuan
Liang Mi
Weijun Wang
Haipeng Dai
Xiaoming Fu
73
16
0
20 Jan 2023
Asynchronously Trained Distributed Topographic Maps
Asynchronously Trained Distributed Topographic Maps
Abbas Siddiqui
Dionysios Georgiadis
8
0
0
20 Jan 2023
A Domain-Agnostic Approach for Characterization of Lifelong Learning
  Systems
A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Megan M. Baker
Alexander New
Mario Aguilar-Simon
Ziad Al-Halah
Sébastien M. R. Arnold
...
Zifan Xu
A. Yanguas-Gil
Harel Yedidsion
Shangqun Yu
Gautam K. Vallabha
83
19
0
18 Jan 2023
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative
  Reward Co-Training
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training
Philipp Altmann
Thomy Phan
Fabian Ritz
Thomas Gabor
Claudia Linnhoff-Popien
OffRL
63
1
0
18 Jan 2023
DQNAS: Neural Architecture Search using Reinforcement Learning
DQNAS: Neural Architecture Search using Reinforcement Learning
Anshumaan Chauhan
S. Bhattacharyya
S. Vadivel
OOD
39
3
0
17 Jan 2023
DRL-VO: Learning to Navigate Through Crowded Dynamic Scenes Using
  Velocity Obstacles
DRL-VO: Learning to Navigate Through Crowded Dynamic Scenes Using Velocity Obstacles
Zhanteng Xie
P. Dames
113
68
0
16 Jan 2023
Planning for Learning Object Properties
Planning for Learning Object Properties
Leonardo Lamanna
Luciano Serafini
Mohamadreza Faridghasemnia
A. Saffiotti
A. Saetti
Alfonso Gerevini
P. Traverso
70
7
0
15 Jan 2023
PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning
  in Financial Markets
PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets
Shuo Sun
Molei Qin
Xinrun Wang
Bo An
FaMLOffRLAIFin
88
5
0
14 Jan 2023
Mean-Field Control based Approximation of Multi-Agent Reinforcement
  Learning in Presence of a Non-decomposable Shared Global State
Mean-Field Control based Approximation of Multi-Agent Reinforcement Learning in Presence of a Non-decomposable Shared Global State
Washim Uddin Mondal
Vaneet Aggarwal
S. Ukkusuri
79
8
0
13 Jan 2023
Asynchronous training of quantum reinforcement learning
Asynchronous training of quantum reinforcement learning
Samuel Yen-Chi Chen
OffRL
93
24
0
12 Jan 2023
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework
Zongwei Liu
Yonghong Song
Yuanlin Zhang
OffRL
79
3
0
10 Jan 2023
Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response of
  Residential Loads
Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response of Residential Loads
Vincent Mai
Philippe Maisonneuve
Tianyu Zhang
Hadi Nekoei
Liam Paull
Antoine Lesage-Landry
AI4CE
50
5
0
06 Jan 2023
Centralized Cooperative Exploration Policy for Continuous Control Tasks
Centralized Cooperative Exploration Policy for Continuous Control Tasks
Chong Li
Chen Gong
Qiang He
Xinwen Hou
Yu Liu
80
1
0
06 Jan 2023
Deep Spectral Q-learning with Application to Mobile Health
Deep Spectral Q-learning with Application to Mobile Health
Yuhe Gao
C. Shi
R. Song
75
0
0
03 Jan 2023
Symbolic Visual Reinforcement Learning: A Scalable Framework with
  Object-Level Abstraction and Differentiable Expression Search
Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search
Wenqing Zheng
S. Sharan
Zhiwen Fan
Kevin Wang
Yihan Xi
Zhangyang Wang
102
10
0
30 Dec 2022
Asynchronous Hybrid Reinforcement Learning for Latency and Reliability
  Optimization in the Metaverse over Wireless Communications
Asynchronous Hybrid Reinforcement Learning for Latency and Reliability Optimization in the Metaverse over Wireless Communications
Wen-li Yu
Terence Jie Chua
Jun Zhao
OffRL
81
23
0
30 Dec 2022
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Hangyu Mao
Rui Zhao
Hao Chen
Jianye Hao
Yiqun Chen
Dong Li
Junge Zhang
Zhen Xiao
OffRL
93
8
0
30 Dec 2022
On Deep Recurrent Reinforcement Learning for Active Visual Tracking of
  Space Noncooperative Objects
On Deep Recurrent Reinforcement Learning for Active Visual Tracking of Space Noncooperative Objects
D. Zhou
Guanghui Sun
Zhao-jie Zhang
Ligang Wu
64
10
0
29 Dec 2022
Tuning Synaptic Connections instead of Weights by Genetic Algorithm in
  Spiking Policy Network
Tuning Synaptic Connections instead of Weights by Genetic Algorithm in Spiking Policy Network
Duzhen Zhang
Tielin Zhang
Shuncheng Jia
Qingyu Wang
Bo Xu
OffRL
381
5
0
29 Dec 2022
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
148
30
0
29 Dec 2022
Towards automating Codenames spymasters with deep reinforcement learning
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
63
2
0
28 Dec 2022
Variance Reduction for Score Functions Using Optimal Baselines
Variance Reduction for Score Functions Using Optimal Baselines
Ronan L. Keane
H. Gao
54
0
0
27 Dec 2022
Previous
123...171819...707172
Next