BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual
  Questions

BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions

Papers citing "BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions"

35 / 35 papers shown
Title