ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.20225
  4. Cited By
A Multi-Modal Foundation Model to Assist People with Blindness and Low
  Vision in Environmental Interaction

A Multi-Modal Foundation Model to Assist People with Blindness and Low Vision in Environmental Interaction

31 October 2023
Yu Hao
Fan Yang
Hao Huang
Shuaihang Yuan
Sundeep Rangan
John-Ross Rizzo
Yao Wang
Yi Fang
ArXivPDFHTML

Papers citing "A Multi-Modal Foundation Model to Assist People with Blindness and Low Vision in Environmental Interaction"

1 / 1 papers shown
Title
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
320
4,261
0
30 Jan 2023
1