HCPO: Hierarchical Conductor-Based Policy Optimization in Multi-Agent Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2025 |
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy
OptimizationAdaptive Agents and Multi-Agent Systems (AAMAS), 2022 |