Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.14252
Cited By
LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding
21 March 2024
Masato Fujitake
MLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding"
3 / 3 papers shown
Title
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Kenton Lee
Mandar Joshi
Iulia Turc
Hexiang Hu
Fangyu Liu
Julian Martin Eisenschlos
Urvashi Khandelwal
Peter Shaw
Ming-Wei Chang
Kristina Toutanova
CLIP
VLM
163
263
0
07 Oct 2022
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
153
498
0
29 Dec 2020
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
134
355
0
27 May 2019
1