Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Behavior evolution-inspired approach to walking gait reinforcement training for quadruped robots
Yu Wang
Wenchuan Jia
Yi Sun
Dong He
57
0
0
25 Sep 2024
Symbolic State Partitioning for Reinforcement Learning
Mohsen Ghaffari
Mahsa Varshosaz
E. Johnsen
Andrzej Wasowski
60
1
0
25 Sep 2024
Neural Coordination and Capacity Control for Inventory Management
Carson Eisenach
Udaya Ghai
Dhruv Madeka
Kari Torkkola
Dean Phillips Foster
Sham Kakade
59
0
0
24 Sep 2024
MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting
Chen Tessler
Yunrong Guo
Ofir Nabati
Gal Chechik
Xue Bin Peng
VGen
AI4CE
97
36
0
22 Sep 2024
Causal Reinforcement Learning for Optimisation of Robot Dynamics in Unknown Environments
Julian Gerald Dcruz
Sam Mahoney
Jia Yun Chua
Adoundeth Soukhabandith
John Mugabe
Weisi Guo
Miguel Arana-Catania
105
0
0
20 Sep 2024
Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning
Alihan Hüyük
A. R. Koblitz
Atefeh Mohajeri
M. Andrews
OffRL
125
0
0
19 Sep 2024
Average-Reward Maximum Entropy Reinforcement Learning for Underactuated Double Pendulum Tasks
Jean Seong Bjorn Choe
Bumkyu Choi
Jong-kook Kim
73
2
0
13 Sep 2024
Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies
Alexei Pisacane
Victor-Alexandru Darvariu
Mirco Musolesi
48
0
0
12 Sep 2024
An Introduction to Quantum Reinforcement Learning (QRL)
Samuel Yen-Chi Chen
64
0
0
09 Sep 2024
GOPT: Generalizable Online 3D Bin Packing via Transformer-based Deep Reinforcement Learning
Heng Xiong
Changrong Guo
Jian Peng
Kai Ding
Wenjie Chen
Xuchong Qiu
Long Bai
Jianfeng Xu
OffRL
62
5
0
09 Sep 2024
QuantFactor REINFORCE: Mining Steady Formulaic Alpha Factors with Variance-bounded REINFORCE
Junjie Zhao
Chengxi Zhang
Min Qin
Peng Yang
OOD
108
5
0
08 Sep 2024
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance
Renming Huang
Shaochong Liu
Yunqiang Pei
Peng Wang
Guoqing Wang
Yang Yang
Hengtao Shen
OffRL
86
0
0
06 Sep 2024
BEVNav: Robot Autonomous Navigation Via Spatial-Temporal Contrastive Learning in Bird's-Eye View
Jiahao Jiang
Yuxiang Yang
Yingqi Deng
Chenlong Ma
Jing Zhang
SSL
80
4
0
03 Sep 2024
Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization
Gao Tianci
Dmitriev D. Dmitry
Konstantin A. Neusypin
Yang Bo
Rao Shengren
OffRL
102
2
0
02 Sep 2024
Compatible Gradient Approximations for Actor-Critic Algorithms
Baturay Saglam
Dionysis Kalogerias
134
0
0
02 Sep 2024
Discovery of False Data Injection Schemes on Frequency Controllers with Reinforcement Learning
Romesh Prasad
Malik Hassanaly
Xiangyu Zhang
Abhijeet Sahu
AAML
25
1
0
30 Aug 2024
Coverage Analysis of Multi-Environment Q-Learning Algorithms for Wireless Network Optimization
Talha Bozkus
Urbashi Mitra
62
2
0
29 Aug 2024
Statistical QoS Provision in Business-Centric Networks
Chang Wu
Yuang Chen
Hancheng Lu
101
0
0
28 Aug 2024
Equivariant Reinforcement Learning under Partial Observability
Hai Nguyen
Andrea Baisero
David Klee
Dian Wang
Robert Platt
Christopher Amato
88
16
0
26 Aug 2024
LSR-IGRU: Stock Trend Prediction Based on Long Short-Term Relationships and Improved GRU
Peng Zhu
Yuante Li
Yifan Hu
Qinyuan Liu
Dawei Cheng
Yuqi Liang
AIFin
AI4TS
167
6
0
26 Aug 2024
Advances in Preference-based Reinforcement Learning: A Review
Youssef Abdelkareem
Shady Shehata
Fakhri Karray
OffRL
96
10
0
21 Aug 2024
Optimizing Interpretable Decision Tree Policies for Reinforcement Learning
D. Vos
Sicco Verwer
OffRL
61
3
0
21 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
193
4
0
20 Aug 2024
Enhancing Reinforcement Learning Through Guided Search
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
OffRL
181
0
0
19 Aug 2024
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey
Ruiqi Zhang
Jing Hou
Florian Walter
Shangding Gu
Jiayi Guan
Florian Röhrbein
Yali Du
Panpan Cai
G. Chen
Alois Knoll
134
15
0
19 Aug 2024
Towards Safe and Robust Autonomous Vehicle Platooning: A Self-Organizing Cooperative Control Framework
Chengkai Xu
Zihao Deng
Jiaqi Liu
Chao Huang
Peng Hang
60
3
0
18 Aug 2024
HistoGym: A Reinforcement Learning Environment for Histopathological Image Analysis
Zhi-Bo Liu
Xiaobo Pang
Jizhao Wang
Shuai Liu
Chen Li
91
1
0
16 Aug 2024
Neural Reward Machines
Elena Umili
F. Argenziano
Roberto Capobianco
NAI
71
2
0
16 Aug 2024
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning
Jianye Xu
Pan Hu
Bassam Alrifaee
113
5
0
14 Aug 2024
Personalized Dynamic Difficulty Adjustment -- Imitation Learning Meets Reinforcement Learning
Ronja Fuchs
Robin Gieseke
Alexander Dockhorn
43
2
0
13 Aug 2024
Parallel Distributional Deep Reinforcement Learning for Mapless Navigation of Terrestrial Mobile Robots
V. A. Kich
A. H. Kolling
J. C. Jesus
Gabriel V. Heisler
Hiago Jacobs
...
André da Silva Kelbouscas
Akihisa Ohya
Ricardo B. Grando
Paulo Lilles Jorge Drews-Jr
D. T. Gamarra
55
3
0
11 Aug 2024
Impacts of Darwinian Evolution on Pre-trained Deep Neural Networks
Guodong DU
Runhua Jiang
Senqiao Yang
HaoYang Li
Wei Chen
Keren Li
Sim Kuan Goh
Jing Li
69
4
0
10 Aug 2024
Emergence in Multi-Agent Systems: A Safety Perspective
Philipp Altmann
Julian Schonberger
Steffen Illium
Maximilian Zorn
Fabian Ritz
Tom Haider
Simon Burton
Thomas Gabor
66
1
0
08 Aug 2024
Model-Based Transfer Learning for Contextual Reinforcement Learning
Jung-Hoon Cho
Vindula Jayawardana
Sirui Li
Cathy Wu
OffRL
158
0
0
08 Aug 2024
e-Health CSIRO at RRG24: Entropy-Augmented Self-Critical Sequence Training for Radiology Report Generation
Aaron Nicolson
Jinghui Liu
Jason Dowling
Anthony N. Nguyen
Bevan Koopman
81
5
0
07 Aug 2024
Adversarial Safety-Critical Scenario Generation using Naturalistic Human Driving Priors
Kunkun Hao
Yonggang Luo
Wen Cui
Yuqiao Bai
Jucheng Yang
Songyang Yan
Yuxi Pan
Zijiang Yang
AAML
99
20
0
06 Aug 2024
Integrated Intention Prediction and Decision-Making with Spectrum Attention Net and Proximal Policy Optimization
Xiao Zhou
Chengzhen Meng
Wenru Liu
Zengqi Peng
Ming Liu
Jun Ma
109
1
0
06 Aug 2024
Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future Directions
Amanda Jayanetti
Saman K. Halgamuge
Rajkumar Buyya
30
0
0
06 Aug 2024
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Seyeon Kim
Joonhun Lee
Namhoon Cho
Sungjun Han
Seungeon Baek
124
0
0
05 Aug 2024
RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning
Yukinari Hisaki
Isao Ono
70
2
0
04 Aug 2024
What comes after transformers? -- A selective survey connecting ideas in deep learning
Johannes Schneider
AI4CE
112
2
0
01 Aug 2024
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
71
1
0
30 Jul 2024
Finite-Time Analysis of Asynchronous Multi-Agent TD Learning
Nicolò Dal Fabbro
Arman Adibi
Aritra Mitra
George J. Pappas
76
2
0
29 Jul 2024
SAPG: Split and Aggregate Policy Gradients
Jayesh Singla
Ananye Agarwal
Deepak Pathak
OffRL
OnRL
86
5
0
29 Jul 2024
Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning
Samuel Yen-Chi Chen
108
9
0
25 Jul 2024
Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation
Jean Seong Bjorn Choe
Jong-Kook Kim
61
2
0
25 Jul 2024
The Curious Case of Representational Alignment: Unravelling Visio-Linguistic Tasks in Emergent Communication
Tom Kouwenhoven
Max Peeperkorn
Bram van Dijk
Tessa Verhoef
58
4
0
25 Jul 2024
PateGail: A Privacy-Preserving Mobility Trajectory Generator with Imitation Learning
Huandong Wang
Changzheng Gao
Yuchen Wu
Depeng Jin
Lina Yao
Yong Li
70
23
0
23 Jul 2024
Functional Acceleration for Policy Mirror Descent
Veronica Chelu
Doina Precup
108
0
0
23 Jul 2024
Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN
Norman Becker
Daniel Reti
Evridiki V. Ntagiou
M. Wallum
Hans D. Schotten
96
2
0
22 Jul 2024
Previous
1
2
3
4
5
6
...
70
71
72
Next