ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.15157
5
0

Robust Instant Policy: Leveraging Student's t-Regression Model for Robust In-context Imitation Learning of Robot Manipulation

18 June 2025
Hanbit Oh
Andrea M. Salcedo-Vázquez
Ixchel G. Ramirez-Alpizar
Yukiyasu Domae
ArXiv (abs)PDFHTML
Main:7 Pages
7 Figures
Bibliography:2 Pages
2 Tables
Abstract

Imitation learning (IL) aims to enable robots to perform tasks autonomously by observing a few human demonstrations. Recently, a variant of IL, called In-Context IL, utilized off-the-shelf large language models (LLMs) as instant policies that understand the context from a few given demonstrations to perform a new task, rather than explicitly updating network models with large-scale demonstrations. However, its reliability in the robotics domain is undermined by hallucination issues such as LLM-based instant policy, which occasionally generates poor trajectories that deviate from the given demonstrations. To alleviate this problem, we propose a new robust in-context imitation learning algorithm called the robust instant policy (RIP), which utilizes a Student's t-regression model to be robust against the hallucinated trajectories of instant policies to allow reliable trajectory generation. Specifically, RIP generates several candidate robot trajectories to complete a given task from an LLM and aggregates them using the Student's t-distribution, which is beneficial for ignoring outliers (i.e., hallucinations); thereby, a robust trajectory against hallucinations is generated. Our experiments, conducted in both simulated and real-world environments, show that RIP significantly outperforms state-of-the-art IL methods, with at least 26%26\%26% improvement in task success rates, particularly in low-data scenarios for everyday tasks. Video results available atthis https URL.

View on arXiv
@article{oh2025_2506.15157,
  title={ Robust Instant Policy: Leveraging Student's t-Regression Model for Robust In-context Imitation Learning of Robot Manipulation },
  author={ Hanbit Oh and Andrea M. Salcedo-Vázquez and Ixchel G. Ramirez-Alpizar and Yukiyasu Domae },
  journal={arXiv preprint arXiv:2506.15157},
  year={ 2025 }
}
Comments on this paper