Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
An Auction-based Marketplace for Model Trading in Federated Learning
Yue Cui
Liuyi Yao
Yaliang Li
Ziqian Chen
Bolin Ding
Xiaofang Zhou
FedML
82
3
0
02 Feb 2024
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models
Sihao Hu
Tiansheng Huang
Ling Liu
LM&Ro
LLMAG
66
9
0
02 Feb 2024
COA-GPT: Generative Pre-trained Transformers for Accelerated Course of Action Development in Military Operations
Vinicius G. Goecks
Nicholas R. Waytowich
SLR
79
9
0
01 Feb 2024
Adaptive Primal-Dual Method for Safe Reinforcement Learning
Weiqin Chen
James Onyejizu
Long Vu
Lan Hoang
D. Subramanian
Koushik Kar
Sandipan Mishra
Santiago Paternain
56
1
0
01 Feb 2024
Control in Stochastic Environment with Delays: A Model-based Reinforcement Learning Approach
Zhiyuan Yao
Ionuţ Florescu
Chihoon Lee
OffRL
43
2
0
01 Feb 2024
SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models
Xiao Shao
Weifu Jiang
Fei Zuo
Mengqing Liu
LLMAG
95
7
0
31 Jan 2024
Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain
Yiming Gao
Feiyu Liu
Liang Wang
Zhenjie Lian
Dehua Zheng
...
Jing Dai
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
82
1
0
28 Jan 2024
Learning fast changing slow in spiking neural networks
Cristiano Capone
P. Muratore
OffRL
52
0
0
25 Jan 2024
Machine learning for industrial sensing and control: A survey and practical perspective
Nathan P. Lawrence
S. Damarla
Jong Woo Kim
Aditya Tulsyan
Faraz Amjad
Kai Wang
Benoît Chachuat
Jong Min Lee
Biao Huang
R. Bhushan Gopaluni
AI4CE
72
23
0
24 Jan 2024
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations
Matthias Lehmann
79
0
0
24 Jan 2024
Multi-agent deep reinforcement learning with centralized training and decentralized execution for transportation infrastructure management
M. Saifullah
K. G. Papakonstantinou
C. Andriotis
S. M. Stoffels
AI4CE
101
2
0
23 Jan 2024
Self-Labeling the Job Shop Scheduling Problem
Andrea Corsini
Angelo Porrello
Simone Calderara
Mauro DellÁmico
SSL
105
15
0
22 Jan 2024
Integrated Sensing, Communication, and Computing: An Information-oriented Resource Transaction Mechanism
Ning Chen
Zhipeng Cheng
Xuwei Fan
Zhang Liu
Bangzhen Huang
Jie Yang
Yifeng Zhao
Lianfen Huang
39
0
0
22 Jan 2024
Asynchronous Parallel Reinforcement Learning for Optimizing Propulsive Performance in Fin Ray Control
Xin-Yang Liu
Dariush Bodaghi
Q. Xue
Xudong Zheng
Jian-Xun Wang
115
0
0
21 Jan 2024
Episodic Reinforcement Learning with Expanded State-reward Space
Dayang Liang
Yaru Zhang
Yunlong Liu
OffRL
73
1
0
19 Jan 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
105
1
0
17 Jan 2024
Learned Best-Effort LLM Serving
Siddharth Jha
Coleman Hooper
Xiaoxuan Liu
Sehoon Kim
Kurt Keutzer
43
2
0
15 Jan 2024
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation
Meng Cao
Lei Shu
Lei Yu
Yun Zhu
Nevan Wichers
Yinxiao Liu
Lei Meng
OffRL
ALM
53
7
0
14 Jan 2024
A Reinforcement Learning Environment for Directed Quantum Circuit Synthesis
Michael Kolle
Tom Schubert
Philipp Altmann
Maximilian Zorn
Jonas Stein
Claudia Linnhoff-Popien
49
10
0
13 Jan 2024
Quantum Advantage Actor-Critic for Reinforcement Learning
Michael Kolle
Mohamad Hgog
Fabian Ritz
Philipp Altmann
Maximilian Zorn
Jonas Stein
Claudia Linnhoff-Popien
92
8
0
13 Jan 2024
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
Zhipeng Chen
Kun Zhou
Wayne Xin Zhao
Junchen Wan
Fuzheng Zhang
Di Zhang
Ji-Rong Wen
KELM
106
35
0
11 Jan 2024
Machine Learning Insides OptVerse AI Solver: Design Principles and Applications
Xijun Li
Fangzhou Zhu
Hui-Ling Zhen
Weilin Luo
Meng Lu
...
Jia Zeng
Mingxuan Yuan
Jianye Hao
Jun Yao
Kun Mao
118
2
0
11 Jan 2024
An experimental evaluation of Deep Reinforcement Learning algorithms for HVAC control
Antonio Manjavacas
Alejandro Campoy-Nieves
Javier Jiménez Raboso
Miguel Molina-Solana
Juan Gómez-Romero
AI4CE
60
10
0
11 Jan 2024
CNN-DRL for Scalable Actions in Finance
Sina Montazeri
Akram Mirzaeinia
Haseebullah Jumakhan
Amir Mirzaeinia
AIFin
43
2
0
10 Jan 2024
Artificial Intelligence for Operations Research: Revolutionizing the Operations Research Process
Zhenan Fan
Bissan Ghaddar
Xinglu Wang
Linzi Xing
Yong Zhang
Zirui Zhou
AI4CE
101
13
0
06 Jan 2024
Trajectory-Oriented Policy Optimization with Sparse Rewards
Guojian Wang
Faguo Wu
Xiao Zhang
OffRL
45
1
0
04 Jan 2024
Joint Offloading and Resource Allocation for Hybrid Cloud and Edge Computing in SAGINs: A Decision Assisted Hybrid Action Space Deep Reinforcement Learning Approach
Chong Huang
Gaojie Chen
Pei Xiao
Yue Xiao
Zhu Han
Jonathon A. Chambers
132
22
0
02 Jan 2024
Data Assimilation in Chaotic Systems Using Deep Reinforcement Learning
Mohamad Abed El Rahman Hammoud
Naila Raboudi
E. Titi
Omar Knio
Ibrahim Hoteit
AI4CE
86
3
0
01 Jan 2024
Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles
Yuanzhao Zhai
Han Zhang
Yu Lei
Yue Yu
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang
AI4CE
145
35
0
30 Dec 2023
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
69
3
0
27 Dec 2023
Preference as Reward, Maximum Preference Optimization with Importance Sampling
Zaifan Jiang
Xing Huang
Chao Wei
105
2
0
27 Dec 2023
Efficient Reinforcement Learning via Decoupling Exploration and Utilization
Jingpu Yang
Helin Wang
Qirui Zhao
Zhecheng Shi
Zirui Song
Miao Fang
104
1
0
26 Dec 2023
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient
Xingzhou Lou
Junge Zhang
Timothy J. Norman
Kaiqi Huang
Yali Du
70
1
0
25 Dec 2023
DuaLight: Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared Knowledge
Jiaming Lu
Jingqing Ruan
Haoyuan Jiang
Ziyue Li
Hangyu Mao
Rui Zhao
76
12
0
22 Dec 2023
Blox: A Modular Toolkit for Deep Learning Schedulers
Saurabh Agarwal
Amar Phanishayee
Shivaram Venkataraman
OffRL
60
4
0
19 Dec 2023
Multi-agent reinforcement learning using echo-state network and its application to pedestrian dynamics
Hisato Komatsu
88
1
0
19 Dec 2023
An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Youshao Xiao
Weichang Wu
Zhenglei Zhou
Fagui Mao
Shangchun Zhao
Lin Ju
Lei Liang
Xiaolu Zhang
Jun Zhou
83
6
0
19 Dec 2023
Challenges for Reinforcement Learning in Quantum Circuit Design
Philipp Altmann
Jonas Stein
Michael Kolle
Adelina Barligea
Thomas Gabor
Thomy Phan
Sebastian Feld
Claudia Linnhoff-Popien
75
6
0
18 Dec 2023
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
Jakob J. Hollenstein
Georg Martius
J. Piater
125
6
0
18 Dec 2023
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects
Min Hua
Dong Chen
Xinda Qi
Kun Jiang
Z. Liu
Quan Zhou
Hongming Xu
77
10
0
18 Dec 2023
Deep-Dispatch: A Deep Reinforcement Learning-Based Vehicle Dispatch Algorithm for Advanced Air Mobility
Elaheh Sabziyan Varnousfaderani
S. Shihab
E. F. Dulia
31
1
0
17 Dec 2023
Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning
Huy Hoang
Tien Mai
Pradeep Varakantham
92
8
0
16 Dec 2023
Active Reinforcement Learning for Robust Building Control
Doseok Jang
Larry Yan
Lucas Spangher
C. Spanos
76
3
0
16 Dec 2023
Communication-Efficient Soft Actor-Critic Policy Collaboration via Regulated Segment Mixture in Internet of Vehicles
Xiaoxue Yu
Rongpeng Li
Chengchao Liang
Zhifeng Zhao
79
0
0
15 Dec 2023
Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Minyoung Hwang
Luca Weihs
Chanwoo Park
Kimin Lee
Aniruddha Kembhavi
Kiana Ehsani
82
20
0
14 Dec 2023
Improve Robustness of Reinforcement Learning against Observation Perturbations via
l
∞
l_\infty
l
∞
Lipschitz Policy Networks
Buqing Nie
Jingtian Ji
Yangqing Fu
Yue Gao
81
4
0
14 Dec 2023
Adaptive parameter sharing for multi-agent reinforcement learning
Dapeng Li
Na Lou
Bin Zhang
Zhiwei Xu
Guoliang Fan
84
3
0
14 Dec 2023
World Models via Policy-Guided Trajectory Diffusion
Marc Rigter
Jun Yamada
Ingmar Posner
104
21
0
13 Dec 2023
Efficiently Quantifying Individual Agent Importance in Cooperative MARL
Omayma Mahjoub
Ruan de Kock
Siddarth S. Singh
Wiem Khlifi
Abidine Vall
Kale-ab Tessera
Arnu Pretorius
FAtt
85
2
0
13 Dec 2023
How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning
Siddarth S. Singh
Omayma Mahjoub
Ruan de Kock
Wiem Khlifi
Abidine Vall
Kale-ab Tessera
Arnu Pretorius
106
1
0
13 Dec 2023
Previous
1
2
3
...
9
10
11
...
70
71
72
Next