Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout

4 March 2020

Papers citing "Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout"

29 / 29 papers shown

Title
Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription Benjamin Gutteridge Matthew Thomas Jackson Toni Kukurin Xiaowen Dong 34 0 0 27 Feb 2025
The future of document indexing: GPT and Donut revolutionize table of content processing Degaga Wolde Feyisa Haylemicheal Berihun Amanuel Zewdu Mahsa Najimoghadam Marzieh Zare 29 0 0 12 Mar 2024
Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding Hongshen Xu Lu Chen Zihan Zhao Da Ma Ruisheng Cao Zichen Zhu Kai Yu 37 2 0 28 Feb 2024
Classification of Visualization Types and Perspectives in Patents J. Ghauri Eric Müller-Budack Ralph Ewerth 27 2 0 19 Jul 2023
Estimating Post-OCR Denoising Complexity on Numerical Texts Arthur Hemmer Jérôme Brachat Mickael Coustaty J. Ogier 14 3 0 03 Jul 2023
Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents Catherine Chen Zejiang Shen Dan Klein Gabriel Stanovsky Doug Downey Kyle Lo 19 2 0 01 Jun 2023
RE $^2$ : Region-Aware Relation Extraction from Visually Rich Documents Pritika Ramu Sijia Wang Lalla Mouatadid Joy Rimchala Lifu Huang 30 0 0 24 May 2023
LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization Laura Nguyen Thomas Scialom Benjamin Piwowarski Jacopo Staiano 27 7 0 26 Jan 2023
Unimodal and Multimodal Representation Training for Relation Extraction Ciaran Cooney Rachel Heyburn Liam Maddigan Mairead O'Cuinn Chloe Thompson Joana Cavadas 28 2 0 11 Nov 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding Qiming Peng Yinxu Pan Wenjin Wang Bin Luo Zhenyu Zhang ... Shi Feng Yu Sun Hao Tian Hua-Hong Wu Haifeng Wang 13 83 0 12 Oct 2022
Understanding Long Documents with Different Position-Aware Attentions Hai Pham Guoxin Wang Yijuan Lu D. Florêncio Changrong Zhang 11 9 0 17 Aug 2022
Towards Complex Document Understanding By Discrete Reasoning Fengbin Zhu Wenqiang Lei Fuli Feng Chao Wang Haozhou Zhang Tat-Seng Chua 31 42 0 25 Jul 2022
TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents Zhanzhan Cheng Peng Zhang Can Li Qiao Liang Yunlu Xu Pengfei Li Shiliang Pu Yi Niu Fei Wu 18 10 0 14 Jul 2022
RDU: A Region-based Approach to Form-style Document Understanding Fengbin Zhu Chao Wang Wenqiang Lei Ziyang Liu Tat-Seng Chua 22 2 0 14 Jun 2022
Detection Masking for Improved OCR on Noisy Documents Daniel Rotman Ophir Azulai Inbar Shapira Yevgeny Burshtein Udi Barzelay 38 4 0 17 May 2022
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction Chen-Yu Lee Chun-Liang Li Timothy Dozat Vincent Perot Guolong Su Nan Hua Joshua Ainslie Renshen Wang Yasuhisa Fujii Tomas Pfister 17 77 0 16 Mar 2022
Information Extraction from Visually Rich Documents with Font Style Embeddings Ismail Oussaid William Vanhuffel Pirashanth Ratnamogan Mhamed Hajaiej Alexis Mathey Thomas Gilles 19 1 0 07 Nov 2021
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents Yue Zhang Bo-Wen Zhang Rui Wang Junjie Cao Chen Li Zuyi Bao 40 32 0 19 Oct 2021
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding Junlong Li Yiheng Xu Lei Cui Furu Wei VLM 3DGS 23 59 0 16 Oct 2021
Form2Seq : A Framework for Higher-Order Form Structure Extraction Milan Aggarwal Hiresh Gupta Mausoom Sarkar Balaji Krishnamurthy 3DV 6 22 0 09 Jul 2021
DocFormer: End-to-End Transformer for Document Understanding Srikar Appalaraju Bhavan A. Jasani Bhargava Urala Kota Yusheng Xie R. Manmatha ViT 29 270 0 22 Jun 2021
Doc2Dict: Information Extraction as Text Generation Benjamin Townsend Eamon Ito-Fisher Lily Zhang Madison May 28 7 0 16 May 2021
DocReader: Bounding-Box Free Training of a Document Information Extraction Model S. Klaiman Marius Lehne 13 6 0 10 May 2021
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding Yiheng Xu Tengchao Lv Lei Cui Guoxin Wang Yijuan Lu D. Florêncio Cha Zhang Furu Wei MLLM VLM 29 127 0 18 Apr 2021
DeepCPCFG: Deep Learning and Context Free Grammars for End-to-End Information Extraction Freddy Chongtat Chua Nigel P. Duffy 40 7 0 10 Mar 2021
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer Rafal Powalski Łukasz Borchmann Dawid Jurkiewicz Tomasz Dwojak Michal Pietruszka Gabriela Pałka ViT 30 157 0 18 Feb 2021
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding Yang Xu Yiheng Xu Tengchao Lv Lei Cui Furu Wei ... D. Florêncio Cha Zhang Wanxiang Che Min Zhang Lidong Zhou ViT MLLM 150 498 0 29 Dec 2020
LAMBERT: Layout-Aware (Language) Modeling for information extraction Lukasz Garncarek Rafal Powalski Tomasz Stanislawek Bartosz Topolski Piotr Halama M. Turski Filip Graliñski 8 87 0 19 Feb 2020
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents Guillaume Jaume H. K. Ekenel Jean-Philippe Thiran 134 355 0 27 May 2019