ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.14135
7
0

GAF: Gaussian Action Field as a Dvnamic World Model for Robotic Mlanipulation

17 June 2025
Ying Chai
Litao Deng
Ruizhi Shao
Jiajun Zhang
Liangjun Xing
Hongwen Zhang
Yebin Liu
ArXiv (abs)PDFHTML
Main:8 Pages
9 Figures
Bibliography:5 Pages
4 Tables
Appendix:5 Pages
Abstract

Accurate action inference is critical for vision-based robotic manipulation. Existing approaches typically follow either a Vision-to-Action (V-A) paradigm, predicting actions directly from visual inputs, or a Vision-to-3D-to-Action (V-3D-A) paradigm, leveraging intermediate 3D representations. However, these methods often struggle with action inaccuracies due to the complexity and dynamic nature of manipulation scenes. In this paper, we propose a V-4D-A framework that enables direct action reasoning from motion-aware 4D representations via a Gaussian Action Field (GAF). GAF extends 3D Gaussian Splatting (3DGS) by incorporating learnable motion attributes, allowing simultaneous modeling of dynamic scenes and manipulation actions. To learn time-varying scene geometry and action-aware robot motion, GAF supports three key query types: reconstruction of the current scene, prediction of future frames, and estimation of initial action via robot motion. Furthermore, the high-quality current and future frames generated by GAF facilitate manipulation action refinement through a GAF-guided diffusion model. Extensive experiments demonstrate significant improvements, with GAF achieving +11.5385 dB PSNR and -0.5574 LPIPS improvements in reconstruction quality, while boosting the average success rate in robotic manipulation tasks by 10.33% over state-of-the-art methods. Project page:this http URL

View on arXiv
@article{chai2025_2506.14135,
  title={ GAF: Gaussian Action Field as a Dvnamic World Model for Robotic Mlanipulation },
  author={ Ying Chai and Litao Deng and Ruizhi Shao and Jiajun Zhang and Liangjun Xing and Hongwen Zhang and Yebin Liu },
  journal={arXiv preprint arXiv:2506.14135},
  year={ 2025 }
}
Comments on this paper