ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.00382
  4. Cited By
A Policy Gradient Algorithm for Learning to Learn in Multiagent
  Reinforcement Learning

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

31 October 2020
Dong-Ki Kim
Miao Liu
Matthew D Riemer
Chuangchuang Sun
Marwa Abdulhai
Golnaz Habibi
Sebastian Lopez-Cot
Gerald Tesauro
Jonathan P. How
ArXivPDFHTML

Papers citing "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning"

11 / 11 papers shown
Title
Multi-agent cooperation through learning-aware policy gradients
Multi-agent cooperation through learning-aware policy gradients
Alexander Meulemans
Seijin Kobayashi
J. Oswald
Nino Scherrer
Eric Elmoznino
Blake A. Richards
Guillaume Lajoie
Blaise Agüera y Arcas
João Sacramento
45
0
0
24 Oct 2024
Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents
Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents
John L. Zhou
Weizhe Hong
Jonathan C. Kao
28
0
0
03 Jun 2024
(Ir)rationality in AI: State of the Art, Research Challenges and Open Questions
(Ir)rationality in AI: State of the Art, Research Challenges and Open Questions
Olivia Macmillan-Scott
Mirco Musolesi
37
1
0
28 Nov 2023
Meta-Value Learning: a General Framework for Learning with Learning
  Awareness
Meta-Value Learning: a General Framework for Learning with Learning Awareness
Tim Cooijmans
Milad Aghajohari
Aaron C. Courville
21
6
0
17 Jul 2023
Graph Neural Networks for Decentralized Multi-Agent Perimeter Defense
Graph Neural Networks for Decentralized Multi-Agent Perimeter Defense
Elijah S. Lee
Lifeng Zhou
Alejandro Ribeiro
Vijay Kumar
AAML
GNN
42
13
0
23 Jan 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
122
0
19 Jan 2023
Offline Equilibrium Finding
Offline Equilibrium Finding
Shuxin Li
Xinrun Wang
Youzhi Zhang
Jakub Cerny
Pengdeng Li
Hau Chan
Bo An
OffRL
43
2
0
12 Jul 2022
Meta-CPR: Generalize to Unseen Large Number of Agents with Communication
  Pattern Recognition Module
Meta-CPR: Generalize to Unseen Large Number of Agents with Communication Pattern Recognition Module
Wei-Cheng Tseng
Wei Wei
Da-Cheng Juan
Min Sun
36
2
0
14 Dec 2021
Continual Learning In Environments With Polynomial Mixing Times
Continual Learning In Environments With Polynomial Mixing Times
Matthew D Riemer
Sharath Chandra Raparthy
Ignacio Cases
G. Subbaraj
M. P. Touzel
Irina Rish
CLL
41
8
0
13 Dec 2021
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via
  Convex Relaxation
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation
Chuangchuang Sun
Dong-Ki Kim
Jonathan P. How
AAML
31
19
0
14 Sep 2021
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
362
11,684
0
09 Mar 2017
1