Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.09288
Cited By
Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering
14 March 2024
Zhixuan Shen
Haonan Luo
Sijia Li
Tianrui Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering"
1 / 1 papers shown
Title
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
145
498
0
29 Dec 2020
1