12
0

GCNT: Graph-Based Transformer Policies for Morphology-Agnostic Reinforcement Learning

Abstract

Training a universal controller for robots with different morphologies is a promising research trend, since it can significantly enhance the robustness and resilience of the robotic system. However, diverse morphologies can yield different dimensions of state space and action space, making it difficult to comply with traditional policy networks. Existing methods address this issue by modularizing the robot configuration, while do not adequately extract and utilize the overall morphological information, which has been proven crucial for training a universal controller. To this end, we propose GCNT, a morphology-agnostic policy network based on improved Graph Convolutional Network (GCN) and Transformer. It exploits the fact that GCN and Transformer can handle arbitrary number of modules to achieve compatibility with diverse morphologies. Our key insight is that the GCN is able to efficiently extract morphology information of robots, while Transformer ensures that it is fully utilized by allowing each node of the robot to communicate this information directly. Experimental results show that our method can generate resilient locomotion behaviors for robots with different configurations, including zero-shot generalization to robot morphologies not seen during training. In particular, GCNT achieved the best performance on 8 tasks in the 2 standard benchmarks.

View on arXiv
@article{luo2025_2505.15211,
  title={ GCNT: Graph-Based Transformer Policies for Morphology-Agnostic Reinforcement Learning },
  author={ Yingbo Luo and Meibao Yao and Xueming Xiao },
  journal={arXiv preprint arXiv:2505.15211},
  year={ 2025 }
}
Comments on this paper