ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.07528
  4. Cited By
Emergent Tool Use From Multi-Agent Autocurricula

Emergent Tool Use From Multi-Agent Autocurricula

17 September 2019
Bowen Baker
I. Kanitscheider
Todor Markov
Yi Wu
Glenn Powell
Bob McGrew
Igor Mordatch
    LRM
ArXivPDFHTML

Papers citing "Emergent Tool Use From Multi-Agent Autocurricula"

50 / 153 papers shown
Title
Reinforcement Learning with Success Induced Task Prioritization
Reinforcement Learning with Success Induced Task Prioritization
Maria Nesterova
Alexey Skrynnik
Aleksandr I. Panov
16
2
0
30 Dec 2022
Decision Market Based Learning For Multi-agent Contextual Bandit
  Problems
Decision Market Based Learning For Multi-agent Contextual Bandit Problems
Wenlong Wang
T. Pfeiffer
21
1
0
01 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player
  Multi-Agent Learning Toolbox
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
30
13
0
01 Dec 2022
Melting Pot 2.0
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
45
32
0
24 Nov 2022
Value-based CTDE Methods in Symmetric Two-team Markov Game: from
  Cooperation to Team Competition
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
Pascal Leroy
J. Pisane
D. Ernst
25
3
0
21 Nov 2022
Reward Gaming in Conditional Text Generation
Reward Gaming in Conditional Text Generation
Richard Yuanzhe Pang
Vishakh Padmakumar
Thibault Sellam
Ankur P. Parikh
He He
35
24
0
16 Nov 2022
Learning Task Requirements and Agent Capabilities for Multi-agent Task
  Allocation
Learning Task Requirements and Agent Capabilities for Multi-agent Task Allocation
Bo Fu
W. Smith
Denise M. Rizzo
Matthew Castanier
Maani Ghaffari
Kira Barton
21
4
0
07 Nov 2022
Group Cohesion in Multi-Agent Scenarios as an Emergent Behavior
Group Cohesion in Multi-Agent Scenarios as an Emergent Behavior
Gianluca Georg Alois Volkmer
Nabil Alsabah
AI4CE
18
0
0
03 Nov 2022
LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon
  Manipulation
LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon Manipulation
Shuo Cheng
Danfei Xu
56
37
0
23 Oct 2022
Co-Training an Observer and an Evading Target
Co-Training an Observer and an Evading Target
André Brandenburger
Folker Hoffmann
A. Charlish
40
1
0
20 Oct 2022
Adaptive patch foraging in deep reinforcement learning agents
Adaptive patch foraging in deep reinforcement learning agents
Nathan J. Wispinski
Andrew Butcher
K. Mathewson
Craig S. Chapman
M. Botvinick
P. Pilarski
21
8
0
14 Oct 2022
Large Language Models are Pretty Good Zero-Shot Video Game Bug Detectors
Large Language Models are Pretty Good Zero-Shot Video Game Bug Detectors
Mohammad Reza Taesiri
Finlay Macklon
Yihe Wang
Hengshuo Shen
C. Bezemer
ELM
LLMAG
MLLM
47
13
0
05 Oct 2022
Disentangling Transfer in Continual Reinforcement Learning
Disentangling Transfer in Continual Reinforcement Learning
Maciej Wołczyk
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
81
27
0
28 Sep 2022
MARLUI: Multi-Agent Reinforcement Learning for Adaptive UIs
MARLUI: Multi-Agent Reinforcement Learning for Adaptive UIs
T. Langerak
Sammy Christen
Mert Albaba
Christoph Gebhardt
Otmar Hilliges
OffRL
24
0
0
26 Sep 2022
Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning
Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning
Yuchen Xiao
Weihao Tan
Chris Amato
OffRL
64
19
0
20 Sep 2022
Learning to Deceive in Multi-Agent Hidden Role Games
Learning to Deceive in Multi-Agent Hidden Role Games
Matthew Aitchison
L. Benke
Penny Sweetser
OffRL
33
5
0
04 Sep 2022
Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations
  Among Team Members
Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members
Daphne Cornelisse
Thomas Rood
Mateusz Malinowski
Yoram Bachrach
Tal Kachman
37
10
0
18 Aug 2022
Human Decision Makings on Curriculum Reinforcement Learning with
  Difficulty Adjustment
Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment
Yilei Zeng
Jiali Duan
Yong Li
Emilio Ferrara
Lerrel Pinto
Chloe Kuo
Stefanos Nikolaidis
51
3
0
04 Aug 2022
A Deep Reinforcement Learning Approach for Finding Non-Exploitable
  Strategies in Two-Player Atari Games
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Zihan Ding
DiJia Su
Qinghua Liu
Chi Jin
33
3
0
18 Jul 2022
Big Learning
Big Learning
Yulai Cong
Miaoyun Zhao
AI4CE
32
0
0
08 Jul 2022
VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning
VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning
Matteo Bettini
Ryan Kortvelesy
J. Blumenkamp
Amanda Prorok
26
37
0
07 Jul 2022
Revisiting Some Common Practices in Cooperative Multi-Agent
  Reinforcement Learning
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Wei Fu
Chao Yu
Zelai Xu
Jiaqi Yang
Yi Wu
34
32
0
15 Jun 2022
Analysis of Randomization Effects on Sim2Real Transfer in Reinforcement
  Learning for Robotic Manipulation Tasks
Analysis of Randomization Effects on Sim2Real Transfer in Reinforcement Learning for Robotic Manipulation Tasks
Josip Josifovski
M. Malmir
Noah Klarmann
B. L. Žagar
Nicolás Navarro-Guerrero
Alois C. Knoll
33
17
0
13 Jun 2022
Generalization, Mayhems and Limits in Recurrent Proximal Policy
  Optimization
Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization
Marco Pleines
Matthias Pallasch
F. Zimmer
Mike Preuss
26
14
0
23 May 2022
Exploring the Benefits of Teams in Multiagent Learning
Exploring the Benefits of Teams in Multiagent Learning
David Radke
Kate Larson
Timothy B. Brecht
AI4TS
39
10
0
04 May 2022
The Importance of Credo in Multiagent Learning
The Importance of Credo in Multiagent Learning
David Radke
Kate Larson
Timothy B. Brecht
32
11
0
15 Apr 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy
  Optimization
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Zihan Zhou
Wei Fu
Bingliang Zhang
Yi Wu
25
28
0
04 Apr 2022
Blocks Assemble! Learning to Assemble with Large-Scale Structured
  Reinforcement Learning
Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning
Seyed Kamyar Seyed Ghasemipour
Daniel Freeman
Byron David
S. Gu
Satoshi Kataoka
Igor Mordatch
OffRL
32
25
0
15 Mar 2022
Learning Markov Games with Adversarial Opponents: Efficient Algorithms
  and Fundamental Limits
Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits
Qinghua Liu
Yuanhao Wang
Chi Jin
AAML
32
15
0
14 Mar 2022
Whole-Body MPC and Dynamic Occlusion Avoidance: A Maximum Likelihood
  Visibility Approach
Whole-Body MPC and Dynamic Occlusion Avoidance: A Maximum Likelihood Visibility Approach
I. Ibrahim
Farbod Farshidian
Jan Preisig
P. Franklin
P. Rocco
Marco Hutter
27
3
0
04 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
46
11
0
01 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
38
9
0
23 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
37
7
0
16 Feb 2022
The Effects of Reward Misspecification: Mapping and Mitigating
  Misaligned Models
The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models
Alexander Pan
Kush S. Bhatia
Jacob Steinhardt
53
172
0
10 Jan 2022
A Deeper Understanding of State-Based Critics in Multi-Agent
  Reinforcement Learning
A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning
Xueguang Lyu
Andrea Baisero
Yuchen Xiao
Chris Amato
OffRL
22
16
0
03 Jan 2022
Building Human-like Communicative Intelligence: A Grounded Perspective
Building Human-like Communicative Intelligence: A Grounded Perspective
M. Dubova
29
12
0
02 Jan 2022
Sequential memory improves sample and memory efficiency in Episodic
  Control
Sequential memory improves sample and memory efficiency in Episodic Control
Ismael T. Freire
A. F. Amil
P. Verschure
OffRL
16
3
0
29 Dec 2021
Collective Intelligence for Deep Learning: A Survey of Recent
  Developments
Collective Intelligence for Deep Learning: A Survey of Recent Developments
David R Ha
Yu Tang
AI4CE
31
69
0
29 Nov 2021
On the Use and Misuse of Absorbing States in Multi-agent Reinforcement
  Learning
On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning
Andrew Cohen
Ervin Teng
Vincent-Pierre Berges
Ruo-Ping Dong
Hunter Henry
Marwan Mattar
Alexander Zook
Sujoy Ganguly
24
33
0
10 Nov 2021
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative
  Multi-Agent Problems
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems
Jiayu Chen
Yuanxin Zhang
Yuanfan Xu
Huimin Ma
Huazhong Yang
Jiaming Song
Yu Wang
Yi Wu
VLM
DRL
26
32
0
08 Nov 2021
Learning to Simulate Self-Driven Particles System with Coordinated
  Policy Optimization
Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization
Zhenghao Peng
Quanyi Li
Ka-Ming Hui
Chunxiao Liu
Bolei Zhou
44
59
0
26 Oct 2021
OPEn: An Open-ended Physics Environment for Learning Without a Task
OPEn: An Open-ended Physics Environment for Learning Without a Task
Chuang Gan
Abhishek Bhandwaldar
Antonio Torralba
J. Tenenbaum
Phillip Isola
LRM
138
4
0
13 Oct 2021
Cooperative Assistance in Robotic Surgery through Multi-Agent
  Reinforcement Learning
Cooperative Assistance in Robotic Surgery through Multi-Agent Reinforcement Learning
Paul Maria Scheikl
B. Gyenes
Tornike Davitashvili
Rayan Younis
A. Schulze
Beat P. Müller-Stich
Gerhard Neumann
M. Wagner
F. Mathis-Ullrich
24
12
0
10 Oct 2021
When Can We Learn General-Sum Markov Games with a Large Number of
  Players Sample-Efficiently?
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?
Ziang Song
Song Mei
Yu Bai
74
67
0
08 Oct 2021
SABER: Data-Driven Motion Planner for Autonomously Navigating
  Heterogeneous Robots
SABER: Data-Driven Motion Planner for Autonomously Navigating Heterogeneous Robots
Alexander Schperberg
Stephanie Tsuei
Stefano Soatto
Dennis W. Hong
17
10
0
03 Aug 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
55
181
0
27 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting
  Pot
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
58
104
0
14 Jul 2021
Explore and Control with Adversarial Surprise
Explore and Control with Adversarial Surprise
Arnaud Fickinger
Natasha Jaques
Samyak Parajuli
Michael Chang
Nicholas Rhinehart
Glen Berseth
Stuart J. Russell
Sergey Levine
40
8
0
12 Jul 2021
Towards Distraction-Robust Active Visual Tracking
Towards Distraction-Robust Active Visual Tracking
Fangwei Zhong
Peng Sun
Wenhan Luo
Tingyun Yan
Yizhou Wang
AAML
30
33
0
18 Jun 2021
Reinforcement learning for pursuit and evasion of microswimmers at low
  Reynolds number
Reinforcement learning for pursuit and evasion of microswimmers at low Reynolds number
Francesco Borra
Luca Biferale
M. Cencini
A. Celani
24
21
0
16 Jun 2021
Previous
1234
Next