ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.02477
  4. Cited By
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision

Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision

3 April 2025
Xiaofeng Han
Shunpeng Chen
Zenghuang Fu
Zhe Feng
Lue Fan
Dong An
Changwei Wang
Li Guo
Weiliang Meng
Xiaopeng Zhang
Rongtao Xu
Shibiao Xu
ArXivPDFHTML

Papers citing "Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision"

1 / 1 papers shown
Title
VTLA: Vision-Tactile-Language-Action Model with Preference Learning for Insertion Manipulation
VTLA: Vision-Tactile-Language-Action Model with Preference Learning for Insertion Manipulation
Chaofan Zhang
Peng Hao
Xiaoge Cao
Xiaoshuai Hao
Shaowei Cui
Shuo Wang
32
0
0
14 May 2025
1