SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas

18 March 2025

Abstract

Sequential social dilemmas pose a significant challenge in the field of multi-agent reinforcement learning (MARL), requiring environments that accurately reflect the tension between individual and collective interests. Previous benchmarks and environments, such as Melting Pot, provide an evaluation protocol that measures generalization to new social partners in various test scenarios. However, running reinforcement learning algorithms in traditional environments requires substantial computational resources. In this paper, we introduce SocialJax, a suite of sequential social dilemma environments and algorithms implemented in JAX. JAX is a high-performance numerical computing library for Python that enables significant improvements in operational efficiency. Our experiments demonstrate that the SocialJax training pipeline achieves at least 50\texttimes{} speed-up in real-time performance compared to Melting Pot RLlib baselines. Additionally, we validate the effectiveness of baseline algorithms within SocialJax environments. Finally, we use Schelling diagrams to verify the social dilemma properties of these environments, ensuring that they accurately capture the dynamics of social dilemmas.

View on arXiv

@article{guo2025_2503.14576,
  title={ SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas },
  author={ Zihao Guo and Shuqing Shi and Richard Willis and Tristan Tomilin and Joel Z. Leibo and Yali Du },
  journal={arXiv preprint arXiv:2503.14576},
  year={ 2025 }
}

Comments on this paper