A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images

17 April 2023

Papers citing "A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images"

26 / 26 papers shown

Title
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents Ayesha Amjad Saurav Sthapit Tahir Qasim Syed 43 0 0 16 May 2025
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding Zhangxuan Gu Changhua Meng Ke Wang Jun Lan Weiqiang Wang Ming Gu Liqing Zhang 57 79 0 14 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding Jiapeng Wang Lianwen Jin Kai Ding VLM 51 140 0 28 Feb 2022
Value Retrieval with Arbitrary Queries for Form-like Documents M. Gao Le Xue Chetan Ramaiah Chen Xing Ran Xu Caiming Xiong 123 6 0 15 Dec 2021
Text Classification Models for Form Entity Linking M. Villota C. Domínguez Jónathan Heras Eloy J. Mata Vico Pascual MedIm 36 2 0 14 Dec 2021
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents Yue Zhang Bo Zhang Rui Wang Junjie Cao Chen Li Zuyi Bao 64 32 0 19 Oct 2021
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents Teakgyu Hong Donghyun Kim Mingi Ji Wonseok Hwang Daehyun Nam Sungrae Park VLM 54 153 0 10 Aug 2021
StrucTexT: Structured Text Understanding with Multi-Modal Transformers Yulin Li Yuxi Qian Yuchen Yu Xiameng Qin Chengquan Zhang Yan Liu Kun Yao Junyu Han Jingtuo Liu Errui Ding 61 116 0 06 Aug 2021
End-to-End Hierarchical Relation Extraction for Generic Form Understanding Tuan-Anh Dang Nguyen Duc Thanh Hoang Q. Tran Chih-Wei Pan T. Nguyen 53 10 0 02 Jun 2021
End-to-End Information Extraction by Character-Level Embedding and Multi-Stage Attentional U-Net Tuan-Anh Dang Nguyen Dat Nguyen Thanh 30 16 0 02 Jun 2021
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents Weihong Lin Qifang Gao Lei-huan Sun Zhuoyao Zhong Kaiqin Hu Qin Ren Qiang Huo 50 39 0 25 May 2021
Visual FUDGE: Form Understanding via Dynamic Graph Editing Brian L. Davis B. Morse Brian L. Price Chris Tensmeyer Curtis Wigington AI4CE 41 20 0 17 May 2021
ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction Zheng Huang Kai Chen Jianhua He X. Bai Dimosthenis Karatzas Shijian Lu C. V. Jawahar 45 311 0 18 Mar 2021
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer Rafal Powalski Łukasz Borchmann Dawid Jurkiewicz Tomasz Dwojak Michal Pietruszka Gabriela Pałka ViT 53 157 0 18 Feb 2021
DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding Zilong Wang Mingjie Zhan Xuebo Liu Ding Liang 45 38 0 15 Oct 2020
DocVQA: A Dataset for VQA on Document Images Minesh Mathew Dimosthenis Karatzas C. V. Jawahar 103 700 0 01 Jul 2020
End-to-End Object Detection with Transformers Nicolas Carion Francisco Massa Gabriel Synnaeve Nicolas Usunier Alexander Kirillov Sergey Zagoruyko ViT 3DV PINN 314 12,906 0 26 May 2020
Spatial Dependency Parsing for Semi-Structured Document Information Extraction Wonseok Hwang Jinyeong Yim Seunghyun Park Sohee Yang Minjoon Seo 61 94 0 01 May 2020
PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks Wenwen Yu Ning Lu Xianbiao Qi Ping Gong Rong Xiao 49 136 0 16 Apr 2020
LAMBERT: Layout-Aware (Language) Modeling for information extraction Lukasz Garncarek Rafal Powalski Tomasz Stanislawek Bartosz Topolski Piotr Halama M. Turski Filip Graliñski 35 87 0 19 Feb 2020
LayoutLM: Pre-training of Text and Layout for Document Image Understanding Yiheng Xu Minghao Li Lei Cui Shaohan Huang Furu Wei Ming Zhou 113 694 0 31 Dec 2019
Deep Visual Template-Free Form Parsing Brian L. Davis B. Morse Scott D. Cohen Brian L. Price Chris Tensmeyer 50 42 0 05 Sep 2019
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents Guillaume Jaume H. K. Ekenel Jean-Philippe Thiran 151 363 0 27 May 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova VLM SSL SSeg 1.1K 93,936 0 11 Oct 2018
Attention Is All You Need Ashish Vaswani Noam M. Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan Gomez Lukasz Kaiser Illia Polosukhin 3DV 506 129,831 0 12 Jun 2017
Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval Adam W. Harley Alex Ufkes Konstantinos G. Derpanis 80 393 0 25 Feb 2015