Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.08799
Cited By
Chargrid: Towards Understanding 2D Documents
24 September 2018
Anoop R. Katti
C. Reisswig
Cordula Guder
Sebastian Brarda
S. Bickel
Johannes Höhne
Jean Baptiste Faddoul
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Chargrid: Towards Understanding 2D Documents"
32 / 32 papers shown
Title
GraphRevisedIE: Multimodal Information Extraction with Graph-Revised Network
Panfeng Cao
Jian Wu
25
9
0
02 Oct 2024
DocMamba: Efficient Document Pre-training with State Space Model
Pengfei Hu
Zhenrong Zhang
Jiefeng Ma
Shuhang Liu
Jun Du
Jianshu Zhang
Mamba
37
1
0
18 Sep 2024
Noise-Aware Training of Layout-Aware Language Models
Ritesh Sarkhel
Xiaoqi Ren
Lauro Beltrao Costa
Guolong Su
Vincent Perot
Yanan Xie
Emmanouil Koukoumidis
Arnab Nandi
VLM
42
0
0
30 Mar 2024
Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Jianfeng Kuang
Wei Hua
Dingkang Liang
Mingkun Yang
Deqiang Jiang
Bo Ren
Xiang Bai
27
39
0
12 May 2023
Language Independent Neuro-Symbolic Semantic Parsing for Form Understanding
Bhanu Prakash Voutharoja
Lizhen Qu
Fatemeh Shiri
22
1
0
08 May 2023
DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
M. Dhouib
G. Bettaieb
A. Shabou
17
20
0
24 Apr 2023
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Qiming Peng
Yinxu Pan
Wenjin Wang
Bin Luo
Zhenyu Zhang
...
Shi Feng
Yu Sun
Hao Tian
Hua-Hong Wu
Haifeng Wang
8
83
0
12 Oct 2022
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Wenjin Wang
Zhengjie Huang
Bin Luo
Qianglong Chen
Qiming Peng
...
Weichong Yin
Shi Feng
Yu Sun
Dianhai Yu
Yin Zhang
ViT
27
11
0
18 Sep 2022
Knowing Where and What: Unified Word Block Pretraining for Document Understanding
Song Tao
Zijian Wang
Tiantian Fan
Canjie Luo
Can Huang
SSL
27
2
0
28 Jul 2022
LayoutXLM vs. GNN: An Empirical Evaluation of Relation Extraction for Documents
Hervé Déjean
S. Clinchant
Jean-Luc Meunier
15
4
0
09 May 2022
DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition
Denis Coquenet
Clément Chatelain
Thierry Paquet
24
57
0
23 Mar 2022
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
Zhangxuan Gu
Changhua Meng
Ke Wang
Jun Lan
Weiqiang Wang
Ming Gu
Liqing Zhang
29
76
0
14 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding
Jiapeng Wang
Lianwen Jin
Kai Ding
VLM
17
138
0
28 Feb 2022
Data-Efficient Information Extraction from Form-Like Documents
Beliz Gunel
Navneet Potti
Sandeep Tata
James Bradley Wendt
Marc Najork
Jing Xie
24
2
0
07 Jan 2022
LAME: Layout Aware Metadata Extraction Approach for Research Articles
Jonghyun Choi
Hyesoo Kong
Hwamook Yoon
Heung-Seon Oh
Yuchul Jung
31
3
0
23 Dec 2021
Document AI: Benchmarks, Models and Applications
Lei Cui
Yiheng Xu
Tengchao Lv
Furu Wei
VLM
21
69
0
16 Nov 2021
Information Extraction from Visually Rich Documents with Font Style Embeddings
Ismail Oussaid
William Vanhuffel
Pirashanth Ratnamogan
Mhamed Hajaiej
Alexis Mathey
Thomas Gilles
16
1
0
07 Nov 2021
Skim-Attention: Learning to Focus via Document Layout
Laura Nguyen
Thomas Scialom
Jacopo Staiano
Benjamin Piwowarski
11
9
0
02 Sep 2021
Using Neighborhood Context to Improve Information Extraction from Visual Documents Captured on Mobile Phones
Kalpa Gunaratna
Vijay Srinivasan
Sandeep Nama
Hongxia Jin
16
5
0
23 Aug 2021
DocFormer: End-to-End Transformer for Document Understanding
Srikar Appalaraju
Bhavan A. Jasani
Bhargava Urala Kota
Yusheng Xie
R. Manmatha
ViT
27
270
0
22 Jun 2021
Key Information Extraction From Documents: Evaluation And Generator
Oliver Bensch
Mirela C. Popa
Constantin Spille
11
13
0
09 Jun 2021
End-to-End Information Extraction by Character-Level Embedding and Multi-Stage Attentional U-Net
Tuan-Anh Dang Nguyen
Dat Nguyen Thanh
11
16
0
02 Jun 2021
DeepCPCFG: Deep Learning and Context Free Grammars for End-to-End Information Extraction
Freddy Chongtat Chua
Nigel P. Duffy
32
7
0
10 Mar 2021
Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
Jiapeng Wang
Chongyu Liu
Lianwen Jin
Guozhi Tang
Jiaxin Zhang
Shuaitao Zhang
Qianying Wang
Y. Wu
Mingxiang Cai
18
82
0
24 Jan 2021
VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach
Mohamed Kerroumi
Othmane Sayem
A. Shabou
13
21
0
05 Oct 2020
Towards a Multi-modal, Multi-task Learning based Pre-training Framework for Document Representation Learning
Subhojeet Pramanik
Shashank Mujumdar
Hima Patel
11
31
0
30 Sep 2020
ZeroShotCeres: Zero-Shot Relation Extraction from Semi-Structured Webpages
Colin Lockard
Prashant Shiralkar
Xin Luna Dong
Hannaneh Hajishirzi
10
53
0
14 May 2020
Text Recognition in the Wild: A Survey
Xiaoxue Chen
Lianwen Jin
Yuanzhi Zhu
Canjie Luo
Tianwei Wang
3DV
23
102
0
07 May 2020
Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout
Filip Graliñski
Tomasz Stanislawek
Anna Wróblewska
Dawid Lipiñski
Agnieszka Kaliska
Paulina Rosalska
Bartosz Topolski
P. Biecek
23
40
0
04 Mar 2020
Extracting Tables from Documents using Conditional Generative Adversarial Networks and Genetic Algorithms
N. Vine
Matthew D. Zeigenfuse
Mark Rowan
GAN
13
12
0
03 Apr 2019
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
224
2,056
0
07 Jun 2016
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
255
13,364
0
25 Aug 2014
1