ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.06592
  4. Cited By
Policy Distillation and Value Matching in Multiagent Reinforcement
  Learning

Policy Distillation and Value Matching in Multiagent Reinforcement Learning

15 March 2019
Samir Wadhwania
Dong-Ki Kim
Shayegan Omidshafiei
Jonathan P. How
ArXivPDFHTML

Papers citing "Policy Distillation and Value Matching in Multiagent Reinforcement Learning"

15 / 15 papers shown
Title
Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning
Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning
Beining Zhang
Aditya Kapoor
Mingfei Sun
184
0
0
08 Feb 2025
Learning Hierarchical Teaching Policies for Cooperative Agents
Learning Hierarchical Teaching Policies for Cooperative Agents
Dong-Ki Kim
Miao Liu
Shayegan Omidshafiei
Sebastian Lopez-Cot
Matthew D Riemer
Golnaz Habibi
Gerald Tesauro
Sami Mourad
Murray Campbell
Jonathan P. How
31
7
0
07 Mar 2019
Learning to Teach in Cooperative Multiagent Reinforcement Learning
Learning to Teach in Cooperative Multiagent Reinforcement Learning
Shayegan Omidshafiei
Dong-Ki Kim
Miao Liu
Gerald Tesauro
Matthew D Riemer
Chris Amato
Murray Campbell
Jonathan P. How
47
136
0
20 May 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
279
8,303
0
04 Jan 2018
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
130
4,468
0
07 Jun 2017
Counterfactual Multi-Agent Policy Gradients
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
86
2,069
0
24 May 2017
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under
  Partial Observability
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability
Shayegan Omidshafiei
Jason Pazis
Chris Amato
Jonathan P. How
J. Vian
107
498
0
17 Mar 2017
Emergence of Grounded Compositional Language in Multi-Agent Populations
Emergence of Grounded Compositional Language in Multi-Agent Populations
Igor Mordatch
Pieter Abbeel
LLMAG
113
701
0
15 Mar 2017
Coordinated Multi-Agent Imitation Learning
Coordinated Multi-Agent Imitation Learning
Hoang Minh Le
Yisong Yue
Peter Carr
P. Lucey
56
190
0
09 Mar 2017
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
81
1,337
0
27 Feb 2017
Multi-Agent Cooperation and the Emergence of (Natural) Language
Multi-Agent Cooperation and the Emergence of (Natural) Language
Angeliki Lazaridou
A. Peysakhovich
Marco Baroni
LLMAG
115
432
0
21 Dec 2016
Learning Multiagent Communication with Backpropagation
Learning Multiagent Communication with Backpropagation
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
186
1,143
0
25 May 2016
Policy Distillation
Policy Distillation
Andrei A. Rusu
Sergio Gomez Colmenarejo
Çağlar Gülçehre
Guillaume Desjardins
J. Kirkpatrick
Razvan Pascanu
Volodymyr Mnih
Koray Kavukcuoglu
R. Hadsell
79
689
0
19 Nov 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
292
13,214
0
09 Sep 2015
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
192
3,211
0
02 Nov 2010
1