Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.07957
Cited By
A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
17 April 2023
Kai Hu
Zhuoyuan Wu
Zhuoyao Zhong
Weihong Lin
Lei-huan Sun
Qiang Huo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images"
26 / 26 papers shown
Title
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents
Ayesha Amjad
Saurav Sthapit
Tahir Qasim Syed
43
0
0
16 May 2025
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
Zhangxuan Gu
Changhua Meng
Ke Wang
Jun Lan
Weiqiang Wang
Ming Gu
Liqing Zhang
57
79
0
14 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding
Jiapeng Wang
Lianwen Jin
Kai Ding
VLM
51
140
0
28 Feb 2022
Value Retrieval with Arbitrary Queries for Form-like Documents
M. Gao
Le Xue
Chetan Ramaiah
Chen Xing
Ran Xu
Caiming Xiong
123
6
0
15 Dec 2021
Text Classification Models for Form Entity Linking
M. Villota
C. Domínguez
Jónathan Heras
Eloy J. Mata
Vico Pascual
MedIm
36
2
0
14 Dec 2021
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents
Yue Zhang
Bo Zhang
Rui Wang
Junjie Cao
Chen Li
Zuyi Bao
64
32
0
19 Oct 2021
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents
Teakgyu Hong
Donghyun Kim
Mingi Ji
Wonseok Hwang
Daehyun Nam
Sungrae Park
VLM
54
153
0
10 Aug 2021
StrucTexT: Structured Text Understanding with Multi-Modal Transformers
Yulin Li
Yuxi Qian
Yuchen Yu
Xiameng Qin
Chengquan Zhang
Yan Liu
Kun Yao
Junyu Han
Jingtuo Liu
Errui Ding
61
116
0
06 Aug 2021
End-to-End Hierarchical Relation Extraction for Generic Form Understanding
Tuan-Anh Dang Nguyen
Duc Thanh Hoang
Q. Tran
Chih-Wei Pan
T. Nguyen
53
10
0
02 Jun 2021
End-to-End Information Extraction by Character-Level Embedding and Multi-Stage Attentional U-Net
Tuan-Anh Dang Nguyen
Dat Nguyen Thanh
30
16
0
02 Jun 2021
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Weihong Lin
Qifang Gao
Lei-huan Sun
Zhuoyao Zhong
Kaiqin Hu
Qin Ren
Qiang Huo
50
39
0
25 May 2021
Visual FUDGE: Form Understanding via Dynamic Graph Editing
Brian L. Davis
B. Morse
Brian L. Price
Chris Tensmeyer
Curtis Wigington
AI4CE
41
20
0
17 May 2021
ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Zheng Huang
Kai Chen
Jianhua He
X. Bai
Dimosthenis Karatzas
Shijian Lu
C. V. Jawahar
45
311
0
18 Mar 2021
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer
Rafal Powalski
Łukasz Borchmann
Dawid Jurkiewicz
Tomasz Dwojak
Michal Pietruszka
Gabriela Pałka
ViT
53
157
0
18 Feb 2021
DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding
Zilong Wang
Mingjie Zhan
Xuebo Liu
Ding Liang
45
38
0
15 Oct 2020
DocVQA: A Dataset for VQA on Document Images
Minesh Mathew
Dimosthenis Karatzas
C. V. Jawahar
103
700
0
01 Jul 2020
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
314
12,906
0
26 May 2020
Spatial Dependency Parsing for Semi-Structured Document Information Extraction
Wonseok Hwang
Jinyeong Yim
Seunghyun Park
Sohee Yang
Minjoon Seo
61
94
0
01 May 2020
PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks
Wenwen Yu
Ning Lu
Xianbiao Qi
Ping Gong
Rong Xiao
49
136
0
16 Apr 2020
LAMBERT: Layout-Aware (Language) Modeling for information extraction
Lukasz Garncarek
Rafal Powalski
Tomasz Stanislawek
Bartosz Topolski
Piotr Halama
M. Turski
Filip Graliñski
35
87
0
19 Feb 2020
LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Yiheng Xu
Minghao Li
Lei Cui
Shaohan Huang
Furu Wei
Ming Zhou
113
694
0
31 Dec 2019
Deep Visual Template-Free Form Parsing
Brian L. Davis
B. Morse
Scott D. Cohen
Brian L. Price
Chris Tensmeyer
50
42
0
05 Sep 2019
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
151
363
0
27 May 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.1K
93,936
0
11 Oct 2018
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
506
129,831
0
12 Jun 2017
Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval
Adam W. Harley
Alex Ufkes
Konstantinos G. Derpanis
80
393
0
25 Feb 2015
1