ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.15077
  4. Cited By
B2MAPO: A Batch-by-Batch Multi-Agent Policy Optimization to Balance
  Performance and Efficiency

B2MAPO: A Batch-by-Batch Multi-Agent Policy Optimization to Balance Performance and Efficiency

21 July 2024
Wenjing Zhang
Wei Zhang
Wenqing Hu
Yifan Wang
    OffRL
ArXivPDFHTML

Papers citing "B2MAPO: A Batch-by-Batch Multi-Agent Policy Optimization to Balance Performance and Efficiency"

1 / 1 papers shown
Title
Hierarchical Reinforcement Learning with Timed Subgoals
Hierarchical Reinforcement Learning with Timed Subgoals
Nico Gürtler
Le Chen
Georg Martius
51
22
0
06 Dec 2021
1