Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.09432
Cited By
Enhancing BERT-Based Visual Question Answering through Keyword-Driven Sentence Selection
13 October 2023
Davide Napolitano
Lorenzo Vaiani
Luca Cagliero
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Enhancing BERT-Based Visual Question Answering through Keyword-Driven Sentence Selection"
2 / 2 papers shown
Title
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Kenton Lee
Mandar Joshi
Iulia Turc
Hexiang Hu
Fangyu Liu
Julian Martin Eisenschlos
Urvashi Khandelwal
Peter Shaw
Ming-Wei Chang
Kristina Toutanova
CLIP
VLM
169
263
0
07 Oct 2022
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
153
498
0
29 Dec 2020
1