Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.08003
Cited By
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis
9 April 2024
Guangchen Lan
Dong-Jun Han
Abolfazl Hashemi
Vaneet Aggarwal
Christopher G. Brinton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis"
50 / 51 papers shown
Title
On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations
Guojun Xiong
Shufan Wang
Daniel Jiang
Jian Li
FedML
125
1
0
22 Nov 2024
In-Trajectory Inverse Reinforcement Learning: Learn Incrementally Before An Ongoing Trajectory Terminates
Shicheng Liu
Minghui Zhu
81
1
0
21 Oct 2024
StraightLine: An End-to-End Resource-Aware Scheduler for Machine Learning Application Requests
Cheng-Wei Ching
Boyuan Guan
Hailu Xu
Liting Hu
VLM
AI4TS
HAI
49
0
0
25 Jul 2024
Advanced Multimodal Deep Learning Architecture for Image-Text Matching
Jinyin Wang
Haijing Zhang
Yihao Zhong
Yingbin Liang
Rongwei Ji
Yiru Cang
96
22
0
13 Jun 2024
Credit Card Fraud Detection Using Advanced Transformer Model
Chang Yu
Yongshun Xu
Jin Cao
Y. Zhang
Yinxin Jin
Mengran Zhu
63
44
0
06 Jun 2024
Advancements in Feature Extraction Recognition of Medical Imaging Systems Through Deep Learning Technique
Qishi Zhan
Dan Sun
Erdi Gao
Yuhan Ma
Yaxin Liang
Haowei Yang
77
8
0
23 May 2024
Exploration of Multi-Scale Image Fusion Systems in Intelligent Medical Image Analysis
Yuxiang Hu
Haowei Yang
Ting Xu
Shuyao He
Jiajie Yuan
Haozhang Deng
62
12
0
23 May 2024
Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks
Taiyuan Mei
Yun Zi
X. Cheng
Zijun Gao
Qi Wang
Haowei Yang
79
20
0
20 May 2024
Real-Time Pill Identification for the Visually Impaired Using Deep Learning
Bo Dang
Wenchao Zhao
Yufeng Li
Danqing Ma
Qixuan Yu
Elly Yijun Zhu
MedIm
52
41
0
08 May 2024
Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries
Swetha Ganesh
Jiayu Chen
Gugan Thoppe
Vaneet Aggarwal
FedML
95
1
0
15 Mar 2024
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning
Chenyu Zhang
Han Wang
Aritra Mitra
James Anderson
60
19
0
27 Jan 2024
Improved Sample Complexity Analysis of Natural Policy Gradient Algorithm with General Parameterization for Infinite Horizon Discounted Reward Markov Decision Processes
Washim Uddin Mondal
Vaneet Aggarwal
52
11
0
18 Oct 2023
Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates
Guangchen Lan
Han Wang
James Anderson
Christopher G. Brinton
Vaneet Aggarwal
FedML
63
27
0
09 Oct 2023
Decentralized Federated Learning: A Survey and Perspective
Liangqi Yuan
Ziran Wang
Lichao Sun
Philip S. Yu
Christopher G. Brinton
FedML
79
91
0
02 Jun 2023
Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks
Honghao Wei
Xin Liu
Weina Wang
Lei Ying
51
10
0
25 May 2023
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond
Jiin Woo
Gauri Joshi
Yuejie Chi
FedML
50
21
0
18 May 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
1.4K
14,313
0
15 Mar 2023
Federated Temporal Difference Learning with Linear Function Approximation under Environmental Heterogeneity
Han Wang
A. Mitra
Hamed Hassani
George J. Pappas
James Anderson
FedML
70
23
0
04 Feb 2023
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
Ilyas Fatkhullin
Anas Barakat
Anastasia Kireeva
Niao He
66
39
0
03 Feb 2023
Asynchronous Hybrid Reinforcement Learning for Latency and Reliability Optimization in the Metaverse over Wireless Communications
Wen-li Yu
Terence Jie Chua
Jun Zhao
OffRL
51
21
0
30 Dec 2022
An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods
Yanli Liu
Kai Zhang
Tamer Basar
W. Yin
78
109
0
15 Nov 2022
Efficient and Light-Weight Federated Learning via Asynchronous Distributed Dropout
Chen Dun
Mirian Hipolito Garcia
C. Jermaine
Dimitrios Dimitriadis
Anastasios Kyrillidis
112
22
0
28 Oct 2022
Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning
Pihe Hu
L. Pan
Yu Chen
Zhixuan Fang
Longbo Huang
13
5
0
30 Aug 2022
Sharper Convergence Guarantees for Asynchronous SGD for Distributed and Federated Learning
Anastasia Koloskova
Sebastian U. Stich
Martin Jaggi
FedML
40
80
0
16 Jun 2022
Asynchronous SGD Beats Minibatch SGD Under Arbitrary Delays
Konstantin Mishchenko
Francis R. Bach
Mathieu Even
Blake E. Woodworth
54
59
0
15 Jun 2022
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
71
352
0
02 May 2022
FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence
Zhijie Xie
Shenghui Song
FedML
43
48
0
18 Apr 2022
Federated Reinforcement Learning with Environment Heterogeneity
Hao Jin
Yang Peng
Wenhao Yang
Shusen Wang
Zhihua Zhang
84
74
0
06 Apr 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
868
12,916
0
04 Mar 2022
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning
Xiao-Yang Liu
Zechu Li
Zhuoran Yang
Jiahao Zheng
Zhaoran Wang
A. Walid
Jian Guo
Michael I. Jordan
40
25
0
11 Dec 2021
On the Global Optimum Convergence of Momentum-based Policy Gradient
Yuhao Ding
Junzi Zhang
Javad Lavaei
46
18
0
19 Oct 2021
A general sample complexity analysis of vanilla policy gradient
Rui Yuan
Robert Mansel Gower
A. Lazaric
90
64
0
23 Jul 2021
Communication Efficient Parallel Reinforcement Learning
Mridul Agarwal
Bhargav Ganguly
Vaneet Aggarwal
68
10
0
22 Feb 2021
Towards Understanding Asynchronous Advantage Actor-critic: Convergence and Linear Speedup
Han Shen
Kai Zhang
Min-Fong Hong
Tianyi Chen
54
29
0
31 Dec 2020
A Distributed Model-Free Ride-Sharing Approach for Joint Matching, Pricing, and Dispatching using Deep Reinforcement Learning
Marina Haliem
G. Mani
Vaneet Aggarwal
Bharat K. Bhargava
57
63
0
05 Oct 2020
Momentum-Based Policy Gradient Methods
Feihu Huang
Shangqian Gao
J. Pei
Heng-Chiao Huang
50
39
0
13 Jul 2020
VAFL: a Method of Vertical Asynchronous Federated Learning
Tianyi Chen
Xiao Jin
Yuejiao Sun
W. Yin
FedML
107
161
0
12 Jul 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
331
1,683
0
02 Feb 2020
Advances and Open Problems in Federated Learning
Peter Kairouz
H. B. McMahan
Brendan Avent
A. Bellet
M. Bennis
...
Zheng Xu
Qiang Yang
Felix X. Yu
Han Yu
Sen Zhao
FedML
AI4CE
232
6,247
0
10 Dec 2019
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
475
42,407
0
03 Dec 2019
Q-GADMM: Quantized Group ADMM for Communication Efficient Decentralized Machine Learning
Anis Elgabli
Jihong Park
Amrit Singh Bedi
Chaouki Ben Issaid
M. Bennis
Vaneet Aggarwal
51
67
0
23 Oct 2019
Sample Efficient Policy Gradient Methods with Recursive Variance Reduction
Pan Xu
F. Gao
Quanquan Gu
61
88
0
18 Sep 2019
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
80
241
0
29 Aug 2019
Federated Learning for Wireless Communications: Motivation, Opportunities and Challenges
Solmaz Niknam
Harpreet S. Dhillon
J. H. Reed
58
604
0
30 Jul 2019
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies
Kai Zhang
Alec Koppel
Haoqi Zhu
Tamer Basar
65
190
0
19 Jun 2019
Asynchronous Federated Optimization
Cong Xie
Oluwasanmi Koyejo
Indranil Gupta
FedML
68
567
0
10 Mar 2019
DeepPool: Distributed Model-free Algorithm for Ride-sharing using Deep Reinforcement Learning
A. Al-Abbasi
A. Ghosh
Vaneet Aggarwal
56
150
0
09 Mar 2019
Stochastic Variance-Reduced Policy Gradient
Matteo Papini
Damiano Binaghi
Giuseppe Canonaco
Matteo Pirotta
Marcello Restelli
67
177
0
14 Jun 2018
Communication-Efficient Learning of Deep Networks from Decentralized Data
H. B. McMahan
Eider Moore
Daniel Ramage
S. Hampson
Blaise Agüera y Arcas
FedML
394
17,453
0
17 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
191
8,851
0
04 Feb 2016
1
2
Next