ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.13134
86
0

RHINO: Learning Real-Time Humanoid-Human-Object Interaction from Human Demonstrations

18 February 2025
Jingxiao Chen
Xinyao Li
Jiahang Cao
Zhengbang Zhu
Wentao Dong
Minghuan Liu
Ying Wen
Yong Yu
L. Zhang
Weinan Zhang
ArXivPDFHTML
Abstract

Humanoid robots have shown success in locomotion and manipulation. Despite these basic abilities, humanoids are still required to quickly understand human instructions and react based on human interaction signals to become valuable assistants in human daily life. Unfortunately, most existing works only focus on multi-stage interactions, treating each task separately, and neglecting real-time feedback. In this work, we aim to empower humanoid robots with real-time reaction abilities to achieve various tasks, allowing human to interrupt robots at any time, and making robots respond to humans immediately. To support such abilities, we propose a general humanoid-human-object interaction framework, named RHINO, i.e., Real-time Humanoid-human Interaction and Object manipulation. RHINO provides a unified view of reactive motion, instruction-based manipulation, and safety concerns, over multiple human signal modalities, such as languages, images, and motions. RHINO is a hierarchical learning framework, enabling humanoids to learn reaction skills from human-human-object demonstrations and teleoperation data. In particular, it decouples the interaction process into two levels: 1) a high-level planner inferring human intentions from real-time human behaviors; and 2) a low-level controller achieving reactive motion behaviors and object manipulation skills based on the predicted intentions. We evaluate the proposed framework on a real humanoid robot and demonstrate its effectiveness, flexibility, and safety in various scenarios.

View on arXiv
@article{chen2025_2502.13134,
  title={ RHINO: Learning Real-Time Humanoid-Human-Object Interaction from Human Demonstrations },
  author={ Jingxiao Chen and Xinyao Li and Jiahang Cao and Zhengbang Zhu and Wentao Dong and Minghuan Liu and Ying Wen and Yong Yu and Liqing Zhang and Weinan Zhang },
  journal={arXiv preprint arXiv:2502.13134},
  year={ 2025 }
}
Comments on this paper