ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.00498
  4. Cited By
Optimization for Reinforcement Learning: From Single Agent to
  Cooperative Agents

Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents

1 December 2019
Dong-hwan Lee
Niao He
Parameswaran Kamalaruban
Volkan Cevher
ArXiv (abs)PDFHTML

Papers citing "Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents"

20 / 20 papers shown
Title
Matrix Low-Rank Approximation For Policy Gradient Methods
Matrix Low-Rank Approximation For Policy Gradient Methods
Sergio Rozada
A. Marques
64
2
0
27 May 2024
Continuous-Time Distributed Dynamic Programming for Networked
  Multi-Agent Markov Decision Processes
Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes
Dong-hwan Lee
Han-Dong Lim
Don Wan Kim
15
0
0
31 Jul 2023
Cooperative Actor-Critic via TD Error Aggregation
Cooperative Actor-Critic via TD Error Aggregation
Martin Figura
Yixuan Lin
Ji Liu
V. Gupta
63
1
0
25 Jul 2022
Distributed Evolution Strategies for Black-box Stochastic Optimization
Distributed Evolution Strategies for Black-box Stochastic Optimization
Xiaoyu He
Zibin Zheng
Chuan Chen
Yuren Zhou
Chuan Luo
Qingwei Lin
51
5
0
09 Apr 2022
Variance-Reduced Stochastic Quasi-Newton Methods for Decentralized
  Learning: Part I
Variance-Reduced Stochastic Quasi-Newton Methods for Decentralized Learning: Part I
Jiaojiao Zhang
Huikang Liu
Anthony Man-Cho So
Qing Ling
97
15
0
19 Jan 2022
Distributed Policy Gradient with Variance Reduction in Multi-Agent
  Reinforcement Learning
Distributed Policy Gradient with Variance Reduction in Multi-Agent Reinforcement Learning
Xiaoxiao Zhao
Jinlong Lei
Li Li
Jie-bin Chen
OffRL
24
3
0
25 Nov 2021
Edge Artificial Intelligence for 6G: Vision, Enabling Technologies, and
  Applications
Edge Artificial Intelligence for 6G: Vision, Enabling Technologies, and Applications
Khaled B. Letaief
Yuanming Shi
Jianmin Lu
Jianhua Lu
99
434
0
24 Nov 2021
Towards Learning Generalizable Driving Policies from Restricted Latent
  Representations
Towards Learning Generalizable Driving Policies from Restricted Latent Representations
Behrad Toghi
Rodolfo Valiente
Ramtin Pedarsani
Y. P. Fallah
82
6
0
05 Nov 2021
Convergence Rates of Average-Reward Multi-agent Reinforcement Learning
  via Randomized Linear Programming
Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming
Alec Koppel
Amrit Singh Bedi
Bhargav Ganguly
Vaneet Aggarwal
53
4
0
22 Oct 2021
Learning to Coordinate in Multi-Agent Systems: A Coordinated
  Actor-Critic Algorithm and Finite-Time Guarantees
Learning to Coordinate in Multi-Agent Systems: A Coordinated Actor-Critic Algorithm and Finite-Time Guarantees
Siliang Zeng
Tianyi Chen
Alfredo García
Mingyi Hong
92
11
0
11 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
132
60
0
28 Sep 2021
Reinforcement Learning for Intelligent Healthcare Systems: A
  Comprehensive Survey
Reinforcement Learning for Intelligent Healthcare Systems: A Comprehensive Survey
A. Abdellatif
N. Mhaisen
Z. Chkirbene
Amr M. Mohamed
A. Erbad
Mohsen Guizani
OffRLAI4TS
65
23
0
05 Aug 2021
Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for
  Non-Stationary Bandits
Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits
Gourab Ghatak
Hardhik Mohanty
Aniq Ur Rahman
TTA
132
10
0
30 May 2021
MARL with General Utilities via Decentralized Shadow Reward Actor-Critic
MARL with General Utilities via Decentralized Shadow Reward Actor-Critic
Junyu Zhang
Amrit Singh Bedi
Mengdi Wang
Alec Koppel
57
8
0
29 May 2021
Pervasive AI for IoT applications: A Survey on Resource-efficient
  Distributed Artificial Intelligence
Pervasive AI for IoT applications: A Survey on Resource-efficient Distributed Artificial Intelligence
Emna Baccour
N. Mhaisen
A. Abdellatif
A. Erbad
Amr M. Mohamed
Mounir Hamdi
Mohsen Guizani
98
93
0
04 May 2021
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled
  Wireless Networks: A Tutorial
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial
Amal Feriani
Ekram Hossain
211
245
0
06 Nov 2020
A Decentralized Policy Gradient Approach to Multi-task Reinforcement
  Learning
A Decentralized Policy Gradient Approach to Multi-task Reinforcement Learning
Sihan Zeng
Aqeel Anwar
Thinh T. Doan
A. Raychowdhury
Justin Romberg
88
40
0
08 Jun 2020
Task-Oriented Data Compression for Multi-Agent Communications Over
  Bit-Budgeted Channels
Task-Oriented Data Compression for Multi-Agent Communications Over Bit-Budgeted Channels
Arsham Mostaani
T. Vu
Symeon Chatzinotas
Björn E. Ottersten
61
11
0
28 May 2020
Adaptive Temporal Difference Learning with Linear Function Approximation
Adaptive Temporal Difference Learning with Linear Function Approximation
Tao Sun
Han Shen
Tianyi Chen
Dongsheng Li
77
23
0
20 Feb 2020
Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic
  Primal-Dual Optimization
Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
M. Jovanović
61
15
0
07 Aug 2019
1