Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.15080
Cited By
v1
v2 (latest)
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
24 May 2023
Geewook Kim
Hodong Lee
D. Kim
Haeji Jung
S. Park
Yoon Kim
Sangdoo Yun
Taeho Kil
Bado Lee
Seunghyun Park
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (46★)
Papers citing
"Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models"
4 / 4 papers shown
Title
TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains
Yoonsik Kim
Moonbin Yim
Ka Yeon Song
LMTD
113
23
0
30 Apr 2024
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming-Hsuan Yang
Fahad Shahbaz Khan
VLM
148
128
0
25 Jul 2023
LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding
Yanzhe Zhang
Ruiyi Zhang
Jiuxiang Gu
Yufan Zhou
Nedim Lipka
Diyi Yang
Tongfei Sun
VLM
MLLM
106
238
0
29 Jun 2023
TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages
J. Clark
Eunsol Choi
Michael Collins
Dan Garrette
Tom Kwiatkowski
Vitaly Nikolaev
J. Palomaki
237
613
0
10 Mar 2020
1