ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.03084
41
0

InterMamba: Efficient Human-Human Interaction Generation with Adaptive Spatio-Temporal Mamba

3 June 2025
Zizhao Wu
Yingying Sun
Yiming Chen
Xiaoling Gu
Ruyu Liu
Jiazhou Chen
    Mamba
ArXiv (abs)PDFHTML
Main:8 Pages
8 Figures
Bibliography:2 Pages
5 Tables
Abstract

Human-human interaction generation has garnered significant attention in motion synthesis due to its vital role in understanding humans as social beings. However, existing methods typically rely on transformer-based architectures, which often face challenges related to scalability and efficiency. To address these issues, we propose a novel, efficient human-human interaction generation method based on the Mamba framework, designed to meet the demands of effectively capturing long-sequence dependencies while providing real-time feedback. Specifically, we introduce an adaptive spatio-temporal Mamba framework that utilizes two parallel SSM branches with an adaptive mechanism to integrate the spatial and temporal features of motion sequences. To further enhance the model's ability to capture dependencies within individual motion sequences and the interactions between different individual sequences, we develop two key modules: the self-adaptive spatio-temporal Mamba module and the cross-adaptive spatio-temporal Mamba module, enabling efficient feature learning. Extensive experiments demonstrate that our method achieves state-of-the-art results on two interaction datasets with remarkable quality and efficiency. Compared to the baseline method InterGen, our approach not only improves accuracy but also requires a minimal parameter size of just 66M ,only 36% of InterGen's, while achieving an average inference speed of 0.57 seconds, which is 46% of InterGen's execution time.

View on arXiv
@article{wu2025_2506.03084,
  title={ InterMamba: Efficient Human-Human Interaction Generation with Adaptive Spatio-Temporal Mamba },
  author={ Zizhao Wu and Yingying Sun and Yiming Chen and Xiaoling Gu and Ruyu Liu and Jiazhou Chen },
  journal={arXiv preprint arXiv:2506.03084},
  year={ 2025 }
}
Comments on this paper