Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1903.06592
Cited By
Policy Distillation and Value Matching in Multiagent Reinforcement Learning
15 March 2019
Samir Wadhwania
Dong-Ki Kim
Shayegan Omidshafiei
Jonathan P. How
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Policy Distillation and Value Matching in Multiagent Reinforcement Learning"
15 / 15 papers shown
Title
Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning
Beining Zhang
Aditya Kapoor
Mingfei Sun
184
0
0
08 Feb 2025
Learning Hierarchical Teaching Policies for Cooperative Agents
Dong-Ki Kim
Miao Liu
Shayegan Omidshafiei
Sebastian Lopez-Cot
Matthew D Riemer
Golnaz Habibi
Gerald Tesauro
Sami Mourad
Murray Campbell
Jonathan P. How
31
7
0
07 Mar 2019
Learning to Teach in Cooperative Multiagent Reinforcement Learning
Shayegan Omidshafiei
Dong-Ki Kim
Miao Liu
Gerald Tesauro
Matthew D Riemer
Chris Amato
Murray Campbell
Jonathan P. How
47
136
0
20 May 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
279
8,303
0
04 Jan 2018
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
130
4,468
0
07 Jun 2017
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
86
2,069
0
24 May 2017
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability
Shayegan Omidshafiei
Jason Pazis
Chris Amato
Jonathan P. How
J. Vian
107
498
0
17 Mar 2017
Emergence of Grounded Compositional Language in Multi-Agent Populations
Igor Mordatch
Pieter Abbeel
LLMAG
113
701
0
15 Mar 2017
Coordinated Multi-Agent Imitation Learning
Hoang Minh Le
Yisong Yue
Peter Carr
P. Lucey
56
190
0
09 Mar 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
81
1,337
0
27 Feb 2017
Multi-Agent Cooperation and the Emergence of (Natural) Language
Angeliki Lazaridou
A. Peysakhovich
Marco Baroni
LLMAG
115
432
0
21 Dec 2016
Learning Multiagent Communication with Backpropagation
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
186
1,143
0
25 May 2016
Policy Distillation
Andrei A. Rusu
Sergio Gomez Colmenarejo
Çağlar Gülçehre
Guillaume Desjardins
J. Kirkpatrick
Razvan Pascanu
Volodymyr Mnih
Koray Kavukcuoglu
R. Hadsell
79
689
0
19 Nov 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
292
13,214
0
09 Sep 2015
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
192
3,211
0
02 Nov 2010
1