Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.02356
Cited By
Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout
4 March 2020
Filip Graliñski
Tomasz Stanislawek
Anna Wróblewska
Dawid Lipiñski
Agnieszka Kaliska
Paulina Rosalska
Bartosz Topolski
P. Biecek
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout"
29 / 29 papers shown
Title
Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription
Benjamin Gutteridge
Matthew Thomas Jackson
Toni Kukurin
Xiaowen Dong
34
0
0
27 Feb 2025
The future of document indexing: GPT and Donut revolutionize table of content processing
Degaga Wolde Feyisa
Haylemicheal Berihun
Amanuel Zewdu
Mahsa Najimoghadam
Marzieh Zare
29
0
0
12 Mar 2024
Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
Hongshen Xu
Lu Chen
Zihan Zhao
Da Ma
Ruisheng Cao
Zichen Zhu
Kai Yu
37
2
0
28 Feb 2024
Classification of Visualization Types and Perspectives in Patents
J. Ghauri
Eric Müller-Budack
Ralph Ewerth
27
2
0
19 Jul 2023
Estimating Post-OCR Denoising Complexity on Numerical Texts
Arthur Hemmer
Jérôme Brachat
Mickael Coustaty
J. Ogier
14
3
0
03 Jul 2023
Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents
Catherine Chen
Zejiang Shen
Dan Klein
Gabriel Stanovsky
Doug Downey
Kyle Lo
19
2
0
01 Jun 2023
RE
2
^2
2
: Region-Aware Relation Extraction from Visually Rich Documents
Pritika Ramu
Sijia Wang
Lalla Mouatadid
Joy Rimchala
Lifu Huang
30
0
0
24 May 2023
LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization
Laura Nguyen
Thomas Scialom
Benjamin Piwowarski
Jacopo Staiano
27
7
0
26 Jan 2023
Unimodal and Multimodal Representation Training for Relation Extraction
Ciaran Cooney
Rachel Heyburn
Liam Maddigan
Mairead O'Cuinn
Chloe Thompson
Joana Cavadas
28
2
0
11 Nov 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Qiming Peng
Yinxu Pan
Wenjin Wang
Bin Luo
Zhenyu Zhang
...
Shi Feng
Yu Sun
Hao Tian
Hua-Hong Wu
Haifeng Wang
13
83
0
12 Oct 2022
Understanding Long Documents with Different Position-Aware Attentions
Hai Pham
Guoxin Wang
Yijuan Lu
D. Florêncio
Changrong Zhang
11
9
0
17 Aug 2022
Towards Complex Document Understanding By Discrete Reasoning
Fengbin Zhu
Wenqiang Lei
Fuli Feng
Chao Wang
Haozhou Zhang
Tat-Seng Chua
31
42
0
25 Jul 2022
TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents
Zhanzhan Cheng
Peng Zhang
Can Li
Qiao Liang
Yunlu Xu
Pengfei Li
Shiliang Pu
Yi Niu
Fei Wu
18
10
0
14 Jul 2022
RDU: A Region-based Approach to Form-style Document Understanding
Fengbin Zhu
Chao Wang
Wenqiang Lei
Ziyang Liu
Tat-Seng Chua
22
2
0
14 Jun 2022
Detection Masking for Improved OCR on Noisy Documents
Daniel Rotman
Ophir Azulai
Inbar Shapira
Yevgeny Burshtein
Udi Barzelay
38
4
0
17 May 2022
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Chen-Yu Lee
Chun-Liang Li
Timothy Dozat
Vincent Perot
Guolong Su
Nan Hua
Joshua Ainslie
Renshen Wang
Yasuhisa Fujii
Tomas Pfister
17
77
0
16 Mar 2022
Information Extraction from Visually Rich Documents with Font Style Embeddings
Ismail Oussaid
William Vanhuffel
Pirashanth Ratnamogan
Mhamed Hajaiej
Alexis Mathey
Thomas Gilles
19
1
0
07 Nov 2021
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents
Yue Zhang
Bo-Wen Zhang
Rui Wang
Junjie Cao
Chen Li
Zuyi Bao
40
32
0
19 Oct 2021
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding
Junlong Li
Yiheng Xu
Lei Cui
Furu Wei
VLM
3DGS
23
59
0
16 Oct 2021
Form2Seq : A Framework for Higher-Order Form Structure Extraction
Milan Aggarwal
Hiresh Gupta
Mausoom Sarkar
Balaji Krishnamurthy
3DV
6
22
0
09 Jul 2021
DocFormer: End-to-End Transformer for Document Understanding
Srikar Appalaraju
Bhavan A. Jasani
Bhargava Urala Kota
Yusheng Xie
R. Manmatha
ViT
29
270
0
22 Jun 2021
Doc2Dict: Information Extraction as Text Generation
Benjamin Townsend
Eamon Ito-Fisher
Lily Zhang
Madison May
28
7
0
16 May 2021
DocReader: Bounding-Box Free Training of a Document Information Extraction Model
S. Klaiman
Marius Lehne
13
6
0
10 May 2021
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Yiheng Xu
Tengchao Lv
Lei Cui
Guoxin Wang
Yijuan Lu
D. Florêncio
Cha Zhang
Furu Wei
MLLM
VLM
29
127
0
18 Apr 2021
DeepCPCFG: Deep Learning and Context Free Grammars for End-to-End Information Extraction
Freddy Chongtat Chua
Nigel P. Duffy
40
7
0
10 Mar 2021
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer
Rafal Powalski
Łukasz Borchmann
Dawid Jurkiewicz
Tomasz Dwojak
Michal Pietruszka
Gabriela Pałka
ViT
30
157
0
18 Feb 2021
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
150
498
0
29 Dec 2020
LAMBERT: Layout-Aware (Language) Modeling for information extraction
Lukasz Garncarek
Rafal Powalski
Tomasz Stanislawek
Bartosz Topolski
Piotr Halama
M. Turski
Filip Graliñski
8
87
0
19 Feb 2020
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
134
355
0
27 May 2019
1