Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Meta-Reinforcement Learning with Discrete World Models for Adaptive Load Balancing
Cameron Redovian
115
0
0
11 Mar 2025
Pull-Based Query Scheduling for Goal-Oriented Semantic Communication
Pouya Agheli
Nikolaos Pappas
Marios Kountouris
71
0
0
09 Mar 2025
Precise Insulin Delivery for Artificial Pancreas: A Reinforcement Learning Optimized Adaptive Fuzzy Control Approach
Omar Mameche
Abdelhadi Abedou
Taqwa Mezaache
Mohamed Tadjine
127
0
0
09 Mar 2025
Probabilistic Shielding for Safe Reinforcement Learning
Edwin Hamel-De le Court
Francesco Belardinelli
Alex W. Goodall
109
0
0
09 Mar 2025
Deep Reinforcement Learning-Based Semi-Autonomous Control for Magnetic Micro-robot Navigation with Immersive Manipulation
Yudong Mao
Dandan Zhang
77
0
0
08 Mar 2025
Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation
Mohit Prashant
Arvind Easwaran
Suman Das
Michael Yuhas
OffRL
104
1
0
07 Mar 2025
Learning Transformer-based World Models with Contrastive Predictive Coding
Maxime Burchi
Radu Timofte
126
2
0
06 Mar 2025
Can We Optimize Deep RL Policy Weights as Trajectory Modeling?
Hongyao Tang
OffRL
191
0
0
06 Mar 2025
Benchmarking Dynamic SLO Compliance in Distributed Computing Continuum Systems
Alfreds Lapkovskis
Boris Sedlak
Sindri Magnússon
Schahram Dustdar
Praveen Kumar Donta
151
2
0
05 Mar 2025
Learning to Negotiate via Voluntary Commitment
Shuhui Zhu
Baoxiang Wang
Sriram Ganapathi Subramanian
Pascal Poupart
116
0
0
05 Mar 2025
Autonomous Curriculum Design via Relative Entropy Based Task Modifications
Muhammed Yusuf Satici
Jianxun Wang
David L. Roberts
76
0
0
28 Feb 2025
Reducing Reward Dependence in RL Through Adaptive Confidence Discounting
Muhammed Yusuf Satici
David L. Roberts
OffRL
71
0
0
28 Feb 2025
ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic Environments
Jinghao Xin
Zhichao Liang
Zihuan Zhang
Peng Wang
Ning Li
94
0
0
27 Feb 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
168
0
0
27 Feb 2025
Improving the Efficiency of a Deep Reinforcement Learning-Based Power Management System for HPC Clusters Using Curriculum Learning
Thomas Budiarjo
Santana Yuda Pradata
Kadek Gemilang Santiyuda
Muhammad Alfian Amrizal
Reza Pulungan
Hiroyuki Takizawa
111
0
0
27 Feb 2025
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Taiyi Wang
Zhihao Wu
Jianheng Liu
Jianye Hao
Jun Wang
Kun Shao
OffRL
122
29
0
24 Feb 2025
Toward Dependency Dynamics in Multi-Agent Reinforcement Learning for Traffic Signal Control
Yuli Zhang
Shangbo Wang
Dongyao Jia
Pengfei Fan
Ruiyuan Jiang
Hankang Gu
Andy H.F. Chow
92
0
0
23 Feb 2025
Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM
Yao Zhang
Yuyi Mao
Hui Wang
Zhiwen Yu
Song Guo
Jun Zhang
Liang Wang
B. Guo
101
0
0
23 Feb 2025
TeLL-Drive: Enhancing Autonomous Driving with Teacher LLM-Guided Deep Reinforcement Learning
Chengkai Xu
Jiaqi Liu
Shiyu Fang
Jian Sun
Dong Chen
Peng Hang
Jian Sun
234
1
0
21 Feb 2025
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
Shicong Cen
Jincheng Mei
Katayoon Goshvadi
Hanjun Dai
Tong Yang
Sherry Yang
Dale Schuurmans
Yuejie Chi
Bo Dai
OffRL
152
37
0
20 Feb 2025
Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope?
Michael Doherty
Robin Matzner
Rasoul Sadeghi
Polina Bayvel
Alejandra Beghelli
208
0
0
18 Feb 2025
Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations
Abdelrhman Shaheen
Anas Badr
Ali Abohendy
Hatem Alsaadawy
Nadine Alsayad
143
2
0
14 Feb 2025
Provably Robust Federated Reinforcement Learning
Minghong Fang
Xilong Wang
Neil Zhenqiang Gong
FedML
131
0
0
12 Feb 2025
KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems
Jusheng Zhang
Zimeng Huang
Yijia Fan
Ningyuan Liu
Mingyan Li
Zhuojie Yang
Jiawei Yao
Jian Wang
Keze Wang
62
1
0
11 Feb 2025
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning
Jean Vassoyan
Nathanaël Beau
Roman Plaud
OffRL
157
2
0
10 Feb 2025
Circular Microalgae-Based Carbon Control for Net Zero
Federico Zocco
Joan García
W. Haddad
196
1
0
04 Feb 2025
Reinforcement-Learning Portfolio Allocation with Dynamic Embedding of Market Information
Jinghai He
Cheng Hua
Chunyang Zhou
Zeyu Zheng
AIFin
85
2
0
29 Jan 2025
Divergence-Augmented Policy Optimization
Qing Wang
Yingru Li
Jiechao Xiong
Tong Zhang
OffRL
174
16
0
28 Jan 2025
When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search
Xuan Chen
Yuzhou Nie
Wenbo Guo
Xiangyu Zhang
213
18
0
28 Jan 2025
RLER-TTE: An Efficient and Effective Framework for En Route Travel Time Estimation with Reinforcement Learning
Zhihan Zheng
Haitao Yuan
Minxiao Chen
Shangguang Wang
AI4TS
127
2
0
28 Jan 2025
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
Bowen Zheng
Ran Cheng
Kay Chen Tan
100
0
0
25 Jan 2025
Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent
Lucía Güitta-López
Jaime Boal
Álvaro J. López-López
117
6
0
24 Jan 2025
Control-ITRA: Controlling the Behavior of a Driving Model
Vasileios Lioutas
Adam Scibior
Matthew Niedoba
Berend Zwartsenberg
Frank Wood
411
0
0
17 Jan 2025
CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving
Bhargava Uppuluri
Anjel Patel
Neil Mehta
Sridhar Kamath
Pratyush Chakraborty
122
1
0
10 Jan 2025
Highway Graph to Accelerate Reinforcement Learning
Zidu Yin
Zhen Zhang
Dong Gong
Stefano V. Albrecht
J. Q. Shi
OffRL
73
0
0
08 Jan 2025
Distributed Multi-Agent Reinforcement Learning with One-hop Neighbors and Compute Straggler Mitigation
Baoqian Wang
Junfei Xie
Nikolay Atanasov
97
10
0
03 Jan 2025
An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems
Hashmath Shaik
Alex Doboli
OffRL
ELM
467
0
0
31 Dec 2024
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Fangwei Zhong
Kui Wu
Churan Wang
Hao Chen
Hai Ci
Zhoujun Li
Yizhou Wang
VGen
90
2
0
31 Dec 2024
Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning
Xuan Zhou
Xiang Shi
Lele Zhang
Chong Chen
Hongbo Li
Lin Ma
Fang Deng
Jie Chen
57
0
0
27 Dec 2024
Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Matthew D Riemer
G. Subbaraj
Glen Berseth
Irina Rish
OffRL
140
2
0
18 Dec 2024
Design of Restricted Normalizing Flow towards Arbitrary Stochastic Policy with Computational Efficiency
Taisuke Kobayashi
Takumi Aotani
176
5
0
17 Dec 2024
Lightweight Decentralized Neural Network-Based Strategies for Multi-Robot Patrolling
James Ward
Ryan McConville
Edmund R. Hunt
96
0
0
16 Dec 2024
Adaptive Reward Design for Reinforcement Learning
Minjae Kwon
Ingy Elsayed-Aly
Lu Feng
155
2
0
14 Dec 2024
Quantum-Train-Based Distributed Multi-Agent Reinforcement Learning
Kuan-Cheng Chen
Samuel Yen-Chi Chen
Chen-Yu Liu
Kin K Leung
133
7
0
12 Dec 2024
A Cross-Scene Benchmark for Open-World Drone Active Tracking
Haowei Sun
Jinwu Hu
Zhirui Zhang
Haoyuan Tian
Xinze Xie
Yufeng Wang
Zhuliang Yu
Xiaohua Xie
Mingkui Tan
134
0
0
01 Dec 2024
RL-SPH: Learning to Achieve Feasible Solutions for Integer Linear Programs
Tae-Hoon Lee
Min-Soo Kim
150
0
0
29 Nov 2024
Self-reconfiguration Strategies for Space-distributed Spacecraft
Tianle Liu
Zhixiang Wang
Yongwei Zhang
Ziwei Wang
Zihao Liu
Yizhai Zhang
Panfeng Huang
71
0
0
26 Nov 2024
Unsupervised Event Outlier Detection in Continuous Time
Somjit Nath
Yik Chau Lui
Siqi Liu
AI4TS
116
0
0
25 Nov 2024
Creating Hierarchical Dispositions of Needs in an Agent
Tofara Moyo
125
0
0
23 Nov 2024
On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations
Guojun Xiong
Shufan Wang
Daniel Jiang
Jian Li
FedML
151
1
0
22 Nov 2024
Previous
1
2
3
4
5
6
...
70
71
72
Next