Chargrid: Towards Understanding 2D Documents

24 September 2018

Jean Baptiste Faddoul

ArXiv PDF HTML

Papers citing "Chargrid: Towards Understanding 2D Documents"

32 / 32 papers shown

Title
GraphRevisedIE: Multimodal Information Extraction with Graph-Revised Network Panfeng Cao Jian Wu 25 9 0 02 Oct 2024
DocMamba: Efficient Document Pre-training with State Space Model Pengfei Hu Zhenrong Zhang Jiefeng Ma Shuhang Liu Jun Du Jianshu Zhang Mamba 37 1 0 18 Sep 2024
Noise-Aware Training of Layout-Aware Language Models Ritesh Sarkhel Xiaoqi Ren Lauro Beltrao Costa Guolong Su Vincent Perot Yanan Xie Emmanouil Koukoumidis Arnab Nandi VLM 42 0 0 30 Mar 2024
Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution Jianfeng Kuang Wei Hua Dingkang Liang Mingkun Yang Deqiang Jiang Bo Ren Xiang Bai 27 39 0 12 May 2023
Language Independent Neuro-Symbolic Semantic Parsing for Form Understanding Bhanu Prakash Voutharoja Lizhen Qu Fatemeh Shiri 22 1 0 08 May 2023
DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents M. Dhouib G. Bettaieb A. Shabou 17 20 0 24 Apr 2023
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding Qiming Peng Yinxu Pan Wenjin Wang Bin Luo Zhenyu Zhang ... Shi Feng Yu Sun Hao Tian Hua-Hong Wu Haifeng Wang 8 83 0 12 Oct 2022
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding Wenjin Wang Zhengjie Huang Bin Luo Qianglong Chen Qiming Peng ... Weichong Yin Shi Feng Yu Sun Dianhai Yu Yin Zhang ViT 27 11 0 18 Sep 2022
Knowing Where and What: Unified Word Block Pretraining for Document Understanding Song Tao Zijian Wang Tiantian Fan Canjie Luo Can Huang SSL 27 2 0 28 Jul 2022
LayoutXLM vs. GNN: An Empirical Evaluation of Relation Extraction for Documents Hervé Déjean S. Clinchant Jean-Luc Meunier 15 4 0 09 May 2022
DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition Denis Coquenet Clément Chatelain Thierry Paquet 24 57 0 23 Mar 2022
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding Zhangxuan Gu Changhua Meng Ke Wang Jun Lan Weiqiang Wang Ming Gu Liqing Zhang 29 76 0 14 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding Jiapeng Wang Lianwen Jin Kai Ding VLM 17 138 0 28 Feb 2022
Data-Efficient Information Extraction from Form-Like Documents Beliz Gunel Navneet Potti Sandeep Tata James Bradley Wendt Marc Najork Jing Xie 24 2 0 07 Jan 2022
LAME: Layout Aware Metadata Extraction Approach for Research Articles Jonghyun Choi Hyesoo Kong Hwamook Yoon Heung-Seon Oh Yuchul Jung 31 3 0 23 Dec 2021
Document AI: Benchmarks, Models and Applications Lei Cui Yiheng Xu Tengchao Lv Furu Wei VLM 21 69 0 16 Nov 2021
Information Extraction from Visually Rich Documents with Font Style Embeddings Ismail Oussaid William Vanhuffel Pirashanth Ratnamogan Mhamed Hajaiej Alexis Mathey Thomas Gilles 16 1 0 07 Nov 2021
Skim-Attention: Learning to Focus via Document Layout Laura Nguyen Thomas Scialom Jacopo Staiano Benjamin Piwowarski 11 9 0 02 Sep 2021
Using Neighborhood Context to Improve Information Extraction from Visual Documents Captured on Mobile Phones Kalpa Gunaratna Vijay Srinivasan Sandeep Nama Hongxia Jin 16 5 0 23 Aug 2021
DocFormer: End-to-End Transformer for Document Understanding Srikar Appalaraju Bhavan A. Jasani Bhargava Urala Kota Yusheng Xie R. Manmatha ViT 27 270 0 22 Jun 2021
Key Information Extraction From Documents: Evaluation And Generator Oliver Bensch Mirela C. Popa Constantin Spille 11 13 0 09 Jun 2021
End-to-End Information Extraction by Character-Level Embedding and Multi-Stage Attentional U-Net Tuan-Anh Dang Nguyen Dat Nguyen Thanh 11 16 0 02 Jun 2021
DeepCPCFG: Deep Learning and Context Free Grammars for End-to-End Information Extraction Freddy Chongtat Chua Nigel P. Duffy 32 7 0 10 Mar 2021
Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution Jiapeng Wang Chongyu Liu Lianwen Jin Guozhi Tang Jiaxin Zhang Shuaitao Zhang Qianying Wang Y. Wu Mingxiang Cai 18 82 0 24 Jan 2021
VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach Mohamed Kerroumi Othmane Sayem A. Shabou 13 21 0 05 Oct 2020
Towards a Multi-modal, Multi-task Learning based Pre-training Framework for Document Representation Learning Subhojeet Pramanik Shashank Mujumdar Hima Patel 11 31 0 30 Sep 2020
ZeroShotCeres: Zero-Shot Relation Extraction from Semi-Structured Webpages Colin Lockard Prashant Shiralkar Xin Luna Dong Hannaneh Hajishirzi 10 53 0 14 May 2020
Text Recognition in the Wild: A Survey Xiaoxue Chen Lianwen Jin Yuanzhi Zhu Canjie Luo Tianwei Wang 3DV 23 102 0 07 May 2020
Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout Filip Graliñski Tomasz Stanislawek Anna Wróblewska Dawid Lipiñski Agnieszka Kaliska Paulina Rosalska Bartosz Topolski P. Biecek 23 40 0 04 Mar 2020
Extracting Tables from Documents using Conditional Generative Adversarial Networks and Genetic Algorithms N. Vine Matthew D. Zeigenfuse Mark Rowan GAN 13 12 0 03 Apr 2019
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation Adam Paszke Abhishek Chaurasia Sangpil Kim Eugenio Culurciello SSeg 224 2,056 0 07 Jun 2016
Convolutional Neural Networks for Sentence Classification Yoon Kim AILaw VLM 255 13,364 0 25 Aug 2014