ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.07957
  4. Cited By
A Question-Answering Approach to Key Value Pair Extraction from
  Form-like Document Images

A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images

17 April 2023
Kai Hu
Zhuoyuan Wu
Zhuoyao Zhong
Weihong Lin
Lei-huan Sun
Qiang Huo
ArXivPDFHTML

Papers citing "A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images"

26 / 26 papers shown
Title
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents
Ayesha Amjad
Saurav Sthapit
Tahir Qasim Syed
43
0
0
16 May 2025
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich
  Document Understanding
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
Zhangxuan Gu
Changhua Meng
Ke Wang
Jun Lan
Weiqiang Wang
Ming Gu
Liqing Zhang
57
79
0
14 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for
  Structured Document Understanding
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding
Jiapeng Wang
Lianwen Jin
Kai Ding
VLM
51
140
0
28 Feb 2022
Value Retrieval with Arbitrary Queries for Form-like Documents
Value Retrieval with Arbitrary Queries for Form-like Documents
M. Gao
Le Xue
Chetan Ramaiah
Chen Xing
Ran Xu
Caiming Xiong
123
6
0
15 Dec 2021
Text Classification Models for Form Entity Linking
Text Classification Models for Form Entity Linking
M. Villota
C. Domínguez
Jónathan Heras
Eloy J. Mata
Vico Pascual
MedIm
36
2
0
14 Dec 2021
Entity Relation Extraction as Dependency Parsing in Visually Rich
  Documents
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents
Yue Zhang
Bo Zhang
Rui Wang
Junjie Cao
Chen Li
Zuyi Bao
64
32
0
19 Oct 2021
BROS: A Pre-trained Language Model Focusing on Text and Layout for
  Better Key Information Extraction from Documents
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents
Teakgyu Hong
Donghyun Kim
Mingi Ji
Wonseok Hwang
Daehyun Nam
Sungrae Park
VLM
54
153
0
10 Aug 2021
StrucTexT: Structured Text Understanding with Multi-Modal Transformers
StrucTexT: Structured Text Understanding with Multi-Modal Transformers
Yulin Li
Yuxi Qian
Yuchen Yu
Xiameng Qin
Chengquan Zhang
Yan Liu
Kun Yao
Junyu Han
Jingtuo Liu
Errui Ding
61
116
0
06 Aug 2021
End-to-End Hierarchical Relation Extraction for Generic Form
  Understanding
End-to-End Hierarchical Relation Extraction for Generic Form Understanding
Tuan-Anh Dang Nguyen
Duc Thanh Hoang
Q. Tran
Chih-Wei Pan
T. Nguyen
53
10
0
02 Jun 2021
End-to-End Information Extraction by Character-Level Embedding and
  Multi-Stage Attentional U-Net
End-to-End Information Extraction by Character-Level Embedding and Multi-Stage Attentional U-Net
Tuan-Anh Dang Nguyen
Dat Nguyen Thanh
30
16
0
02 Jun 2021
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for
  Key Information Extraction from Documents
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Weihong Lin
Qifang Gao
Lei-huan Sun
Zhuoyao Zhong
Kaiqin Hu
Qin Ren
Qiang Huo
50
39
0
25 May 2021
Visual FUDGE: Form Understanding via Dynamic Graph Editing
Visual FUDGE: Form Understanding via Dynamic Graph Editing
Brian L. Davis
B. Morse
Brian L. Price
Chris Tensmeyer
Curtis Wigington
AI4CE
41
20
0
17 May 2021
ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Zheng Huang
Kai Chen
Jianhua He
X. Bai
Dimosthenis Karatzas
Shijian Lu
C. V. Jawahar
45
311
0
18 Mar 2021
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout
  Transformer
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer
Rafal Powalski
Łukasz Borchmann
Dawid Jurkiewicz
Tomasz Dwojak
Michal Pietruszka
Gabriela Pałka
ViT
53
157
0
18 Feb 2021
DocStruct: A Multimodal Method to Extract Hierarchy Structure in
  Document for General Form Understanding
DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding
Zilong Wang
Mingjie Zhan
Xuebo Liu
Ding Liang
45
38
0
15 Oct 2020
DocVQA: A Dataset for VQA on Document Images
DocVQA: A Dataset for VQA on Document Images
Minesh Mathew
Dimosthenis Karatzas
C. V. Jawahar
103
700
0
01 Jul 2020
End-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
314
12,906
0
26 May 2020
Spatial Dependency Parsing for Semi-Structured Document Information
  Extraction
Spatial Dependency Parsing for Semi-Structured Document Information Extraction
Wonseok Hwang
Jinyeong Yim
Seunghyun Park
Sohee Yang
Minjoon Seo
61
94
0
01 May 2020
PICK: Processing Key Information Extraction from Documents using
  Improved Graph Learning-Convolutional Networks
PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks
Wenwen Yu
Ning Lu
Xianbiao Qi
Ping Gong
Rong Xiao
49
136
0
16 Apr 2020
LAMBERT: Layout-Aware (Language) Modeling for information extraction
LAMBERT: Layout-Aware (Language) Modeling for information extraction
Lukasz Garncarek
Rafal Powalski
Tomasz Stanislawek
Bartosz Topolski
Piotr Halama
M. Turski
Filip Graliñski
35
87
0
19 Feb 2020
LayoutLM: Pre-training of Text and Layout for Document Image
  Understanding
LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Yiheng Xu
Minghao Li
Lei Cui
Shaohan Huang
Furu Wei
Ming Zhou
113
694
0
31 Dec 2019
Deep Visual Template-Free Form Parsing
Deep Visual Template-Free Form Parsing
Brian L. Davis
B. Morse
Scott D. Cohen
Brian L. Price
Chris Tensmeyer
50
42
0
05 Sep 2019
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
151
363
0
27 May 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.1K
93,936
0
11 Oct 2018
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
506
129,831
0
12 Jun 2017
Evaluation of Deep Convolutional Nets for Document Image Classification
  and Retrieval
Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval
Adam W. Harley
Alex Ufkes
Konstantinos G. Derpanis
80
393
0
25 Feb 2015
1