Learning to Plan & Schedule with Reinforcement-Learned Bimanual Robot Skills

29 October 2025

Weikang Wan

Fabio Ramos

Xuning Yang

Caelan Garrett

OffRL

ArXiv (abs)PDF HTML Github (1803★)

Main:5 Pages

2 Figures

Bibliography:2 Pages

1 Tables

Abstract

Long-horizon contact-rich bimanual manipulation presents a significant challenge, requiring complex coordination involving a mixture of parallel execution and sequential collaboration between arms. In this paper, we introduce a hierarchical framework that frames this challenge as an integrated skill planning & scheduling problem, going beyond purely sequential decision-making to support simultaneous skill invocation. Our approach is built upon a library of single-arm and bimanual primitive skills, each trained using Reinforcement Learning (RL) in GPU-accelerated simulation. We then train a Transformer-based planner on a dataset of skill compositions to act as a high-level scheduler, simultaneously predicting the discrete schedule of skills as well as their continuous parameters. We demonstrate that our method achieves higher success rates on complex, contact-rich tasks than end-to-end RL approaches and produces more efficient, coordinated behaviors than traditional sequential-only planners.

View on arXiv

Comments on this paper