Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.08836
Cited By
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
18 April 2021
Yiheng Xu
Tengchao Lv
Lei Cui
Guoxin Wang
Yijuan Lu
D. Florêncio
Cha Zhang
Furu Wei
MLLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding"
27 / 77 papers shown
Title
Manifestations of Xenophobia in AI Systems
Nenad Tomašev
J. L. Maynard
Iason Gabriel
24
9
0
15 Dec 2022
Unifying Vision, Text, and Layout for Universal Document Processing
Zineng Tang
Ziyi Yang
Guoxin Wang
Yuwei Fang
Yang Liu
Chenguang Zhu
Michael Zeng
Chao-Yue Zhang
Joey Tianyi Zhou
VLM
32
105
0
05 Dec 2022
QueryForm: A Simple Zero-shot Form Entity Query Framework
Zifeng Wang
Zizhao Zhang
Jacob Devlin
Chen-Yu Lee
Guolong Su
Hao Zhang
Jennifer Dy
Vincent Perot
Tomas Pfister
19
7
0
14 Nov 2022
Unimodal and Multimodal Representation Training for Relation Extraction
Ciaran Cooney
Rachel Heyburn
Liam Maddigan
Mairead O'Cuinn
Chloe Thompson
Joana Cavadas
30
2
0
11 Nov 2022
On Web-based Visual Corpus Construction for Visual Document Understanding
Donghyun Kim
Teakgyu Hong
Moonbin Yim
Yoonsik Kim
Geewook Kim
34
3
0
07 Nov 2022
ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select
Yuchen Zhuang
Yinghao Li
Jerry Junyang Cheung
Yue Yu
Yingjun Mou
Xinyu Chen
Le Song
Chao Zhang
21
19
0
26 Oct 2022
PP-StructureV2: A Stronger Document Analysis System
Chenxia Li
Ruoyu Guo
Jun Zhou
Mengtao An
Yuning Du
Lingfeng Zhu
Yi Liu
Xiaoguang Hu
Dianhai Yu
51
22
0
11 Oct 2022
Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections
R. Arroyo
J. Yebes
E. Martínez
Hector Corrales
Javier Lorenzo
33
1
0
07 Oct 2022
XDoc: Unified Pre-training for Cross-Format Document Understanding
Jingye Chen
Tengchao Lv
Lei Cui
Changrong Zhang
Furu Wei
50
13
0
06 Oct 2022
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Wenjin Wang
Zhengjie Huang
Bin Luo
Qianglong Chen
Qiming Peng
...
Weichong Yin
Shi Feng
Yu Sun
Dianhai Yu
Yin Zhang
ViT
30
11
0
18 Sep 2022
Understanding Long Documents with Different Position-Aware Attentions
Hai Pham
Guoxin Wang
Yijuan Lu
D. Florêncio
Changrong Zhang
16
9
0
17 Aug 2022
Sequence-aware multimodal page classification of Brazilian legal documents
Pedro Henrique Luz de Araujo
Ana Paula G. S. de Almeida
Fabricio Ataides Braz
Nilton Correia da Silva
Flávio de Barros Vidal
Teofilo de Campos
6
7
0
02 Jul 2022
Business Document Information Extraction: Towards Practical Benchmarks
Matyás Skalický
Stepán Simsa
Michal Uřičář
Milan Šulc
25
9
0
20 Jun 2022
RDU: A Region-based Approach to Form-style Document Understanding
Fengbin Zhu
Chao Wang
Wenqiang Lei
Ziyang Liu
Tat-Seng Chua
30
2
0
14 Jun 2022
LayoutXLM vs. GNN: An Empirical Evaluation of Relation Extraction for Documents
Hervé Déjean
S. Clinchant
Jean-Luc Meunier
22
4
0
09 May 2022
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking
Yupan Huang
Tengchao Lv
Lei Cui
Yutong Lu
Furu Wei
27
432
0
18 Apr 2022
Towards Few-shot Entity Recognition in Document Images: A Label-aware Sequence-to-Sequence Framework
Zilong Wang
Jingbo Shang
28
10
0
30 Mar 2022
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
Zhangxuan Gu
Changhua Meng
Ke Wang
Jun Lan
Weiqiang Wang
Ming Gu
Liqing Zhang
37
76
0
14 Mar 2022
DiT: Self-supervised Pre-training for Document Image Transformer
Junlong Li
Yiheng Xu
Tengchao Lv
Lei Cui
Chaoxi Zhang
Furu Wei
ViT
VLM
35
159
0
04 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding
Jiapeng Wang
Lianwen Jin
Kai Ding
VLM
33
138
0
28 Feb 2022
Combining Deep Learning and Reasoning for Address Detection in Unstructured Text Documents
Matthias Engelbach
Dennis Klau
Jens Drawehn
Maximilien Kintz
9
2
0
07 Feb 2022
OCR-free Document Understanding Transformer
Geewook Kim
Teakgyu Hong
Moonbin Yim
Jeongyeon Nam
Jinyoung Park
Jinyeong Yim
Wonseok Hwang
Sangdoo Yun
Dongyoon Han
Seunghyun Park
ViT
52
262
0
30 Nov 2021
Document AI: Benchmarks, Models and Applications
Lei Cui
Yiheng Xu
Tengchao Lv
Furu Wei
VLM
24
69
0
16 Nov 2021
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding
Junlong Li
Yiheng Xu
Lei Cui
Furu Wei
VLM
3DGS
31
59
0
16 Oct 2021
A Span Extraction Approach for Information Extraction on Visually-Rich Documents
Tuan-Anh Dang Nguyen
Hieu M. Vu
Nguyen Hong Son
Minh-Tien Nguyen
19
6
0
02 Jun 2021
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
153
498
0
29 Dec 2020
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
140
355
0
27 May 2019
Previous
1
2