ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.13538
  4. Cited By
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents

FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents

27 May 2019
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
ArXivPDFHTML

Papers citing "FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents"

50 / 62 papers shown
Title
AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling Engine
AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling Engine
Carlo Siebenschuh
Kyle Hippe
Ozan Gokdemir
Alexander Brace
A. Khan
...
V. Vishwanath
R. Stevens
Arvind Ramanathan
Ian Foster
Robert Underwood
MoE
44
0
0
23 Apr 2025
KIEval: Evaluation Metric for Document Key Information Extraction
KIEval: Evaluation Metric for Document Key Information Extraction
Minsoo Khang
Sang Chul Jung
Sungrae Park
Teakgyu Hong
47
0
0
07 Mar 2025
Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models
Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models
Jonathan Bourne
75
0
0
24 Feb 2025
Spatial Information Integration in Small Language Models for Document Layout Generation and Classification
Spatial Information Integration in Small Language Models for Document Layout Generation and Classification
Pablo Melendez
Clemens Havas
34
0
0
09 Jan 2025
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
Gen Luo
Xue Yang
Wenhan Dou
Zhaokai Wang
Jifeng Dai
Jifeng Dai
Yu Qiao
Xizhou Zhu
VLM
MLLM
62
25
0
10 Oct 2024
GraphRevisedIE: Multimodal Information Extraction with Graph-Revised
  Network
GraphRevisedIE: Multimodal Information Extraction with Graph-Revised Network
Panfeng Cao
Jian Wu
23
9
0
02 Oct 2024
DocMamba: Efficient Document Pre-training with State Space Model
DocMamba: Efficient Document Pre-training with State Space Model
Pengfei Hu
Zhenrong Zhang
Jiefeng Ma
Shuhang Liu
Jun Du
Jianshu Zhang
Mamba
37
1
0
18 Sep 2024
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding
Ofir Abramovich
Niv Nayman
Sharon Fogel
I. Lavi
Ron Litman
Shahar Tsiper
Royee Tichauer
Srikar Appalaraju
Shai Mazor
R. Manmatha
VLM
33
3
0
17 Jul 2024
Multimodal Structured Generation: CVPR's 2nd MMFM Challenge Technical Report
Multimodal Structured Generation: CVPR's 2nd MMFM Challenge Technical Report
Franz Louis Cesista
VGen
42
6
0
17 Jun 2024
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications
Jordy Van Landeghem
Subhajit Maity
Ayan Banerjee
Matthew Blaschko
Marie-Francine Moens
Josep Lladós
Sanket Biswas
41
2
0
12 Jun 2024
Reconstructing training data from document understanding models
Reconstructing training data from document understanding models
Jérémie Dentan
Arnaud Paran
A. Shabou
AAML
SyDa
38
1
0
05 Jun 2024
Lightweight Spatial Modeling for Combinatorial Information Extraction
  From Documents
Lightweight Spatial Modeling for Combinatorial Information Extraction From Documents
Yanfei Dong
Lambert Deng
Jiazheng Zhang
Xiaodong Yu
Ting Lin
Francesco Gelli
Soujanya Poria
W. Lee
30
0
0
08 May 2024
Noise-Aware Training of Layout-Aware Language Models
Noise-Aware Training of Layout-Aware Language Models
Ritesh Sarkhel
Xiaoqi Ren
Lauro Beltrao Costa
Guolong Su
Vincent Perot
Yanan Xie
Emmanouil Koukoumidis
Arnab Nandi
VLM
42
0
0
30 Mar 2024
TreeForm: End-to-end Annotation and Evaluation for Form Document Parsing
TreeForm: End-to-end Annotation and Evaluation for Form Document Parsing
Ran Zmigrod
Zhiqiang Ma
Armineh Nourbakhsh
Sameena Shah
24
4
0
07 Feb 2024
DocGraphLM: Documental Graph Language Model for Information Extraction
DocGraphLM: Documental Graph Language Model for Information Extraction
Dongsheng Wang
Zhiqiang Ma
Armineh Nourbakhsh
Kang Gu
Sameena Shah
29
8
0
05 Jan 2024
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
33
6
0
29 Dec 2023
Toward Real Text Manipulation Detection: New Dataset and New Solution
Dongliang Luo
Yuliang Liu
Rui Yang
Xianjin Liu
Jishen Zeng
Yu Zhou
Xiang Bai
27
2
0
12 Dec 2023
Towards Improving Document Understanding: An Exploration on
  Text-Grounding via MLLMs
Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs
Yonghui Wang
Wen-gang Zhou
Hao Feng
Keyi Zhou
Houqiang Li
52
18
0
22 Nov 2023
A Multi-Modal Multilingual Benchmark for Document Image Classification
A Multi-Modal Multilingual Benchmark for Document Image Classification
Yoshinari Fujinuma
Siddharth Varia
Nishant Sankaran
Srikar Appalaraju
Bonan Min
Yogarshi Vyas
VLM
18
4
0
25 Oct 2023
On Evaluation of Document Classification using RVL-CDIP
On Evaluation of Document Classification using RVL-CDIP
Stefan Larson
Gordon Lim
Kevin Leach
26
3
0
21 Jun 2023
Language Independent Neuro-Symbolic Semantic Parsing for Form
  Understanding
Language Independent Neuro-Symbolic Semantic Parsing for Form Understanding
Bhanu Prakash Voutharoja
Lizhen Qu
Fatemeh Shiri
22
1
0
08 May 2023
Modeling Entities as Semantic Points for Visual Information Extraction
  in the Wild
Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Zhibo Yang
Rujiao Long
Pengfei Wang
Sibo Song
Humen Zhong
Wenqing Cheng
X. Bai
Cong Yao
29
19
0
23 Mar 2023
ShabbyPages: A Reproducible Document Denoising and Binarization Dataset
ShabbyPages: A Reproducible Document Denoising and Binarization Dataset
Alexander Groleau
Kok Wei Chee
Stefan Larson
Samay Maini
Jonathan Boarman
14
2
0
16 Mar 2023
Entry Separation using a Mixed Visual and Textual Language Model:
  Application to 19th century French Trade Directories
Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade Directories
Bertrand Duménieu
Edwin Carlinet
N. Abadie
Joseph Chazalon
19
0
0
17 Feb 2023
LoRaLay: A Multilingual and Multimodal Dataset for Long Range and
  Layout-Aware Summarization
LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization
Laura Nguyen
Thomas Scialom
Benjamin Piwowarski
Jacopo Staiano
24
7
0
26 Jan 2023
Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image
  Models
Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models
Lei Wang
Jian He
Xingdong Xu
Ning Liu
Hui-juan Liu
31
2
0
27 Nov 2022
Text Detection Forgot About Document OCR
Text Detection Forgot About Document OCR
Krzysztof Olejniczak
Milan Šulc
32
9
0
14 Oct 2022
Key Information Extraction in Purchase Documents using Deep Learning and
  Rule-based Corrections
Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections
R. Arroyo
J. Yebes
E. Martínez
Hector Corrales
Javier Lorenzo
31
1
0
07 Oct 2022
XDoc: Unified Pre-training for Cross-Format Document Understanding
XDoc: Unified Pre-training for Cross-Format Document Understanding
Jingye Chen
Tengchao Lv
Lei Cui
Changrong Zhang
Furu Wei
48
13
0
06 Oct 2022
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document
  Understanding
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Wenjin Wang
Zhengjie Huang
Bin Luo
Qianglong Chen
Qiming Peng
...
Weichong Yin
Shi Feng
Yu Sun
Dianhai Yu
Yin Zhang
ViT
27
11
0
18 Sep 2022
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Yu-Chung Hsiao
Fedir Zubach
Maria Wang
Jindong Chen
Victor Carbune
Jason Lin
Maria Wang
Yun Zhu
Jindong Chen
RALM
154
25
0
16 Sep 2022
Augraphy: A Data Augmentation Library for Document Images
Augraphy: A Data Augmentation Library for Document Images
Alexander Groleau
Kok Wei Chee
Stefan Larson
Samay Maini
Jonathan Boarman
19
10
0
30 Aug 2022
Doc2Graph: a Task Agnostic Document Understanding Framework based on
  Graph Neural Networks
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks
Andrea Gemelli
Sanket Biswas
Enrico Civitelli
Josep Lladós
S. Marinai
13
15
0
23 Aug 2022
TaCo: Textual Attribute Recognition via Contrastive Learning
TaCo: Textual Attribute Recognition via Contrastive Learning
Chang Nie
Yiqing Hu
Yanqiu Qu
Hao Liu
Deqiang Jiang
Bo Ren
24
0
0
22 Aug 2022
Information Extraction from Scanned Invoice Images using Text Analysis
  and Layout Features
Information Extraction from Scanned Invoice Images using Text Analysis and Layout Features
H. Ha
Ales Horak
15
14
0
08 Aug 2022
Knowing Where and What: Unified Word Block Pretraining for Document Understanding
Song Tao
Zijian Wang
Tiantian Fan
Canjie Luo
Can Huang
SSL
27
2
0
28 Jul 2022
Towards Complex Document Understanding By Discrete Reasoning
Towards Complex Document Understanding By Discrete Reasoning
Fengbin Zhu
Wenqiang Lei
Fuli Feng
Chao Wang
Haozhou Zhang
Tat-Seng Chua
31
42
0
25 Jul 2022
Test-Time Adaptation for Visual Document Understanding
Test-Time Adaptation for Visual Document Understanding
Sayna Ebrahimi
Sercan Ö. Arik
Tomas Pfister
OOD
31
6
0
15 Jun 2022
V-Doc : Visual questions answers with Documents
V-Doc : Visual questions answers with Documents
Yihao Ding
Zhe Huang
Runlin Wang
Yanhang Zhang
Xianru Chen
Yuzhong Ma
Hyunsuk Chung
S. Han
23
15
0
27 May 2022
Detection Masking for Improved OCR on Noisy Documents
Detection Masking for Improved OCR on Noisy Documents
Daniel Rotman
Ophir Azulai
Inbar Shapira
Yevgeny Burshtein
Udi Barzelay
38
4
0
17 May 2022
LayoutXLM vs. GNN: An Empirical Evaluation of Relation Extraction for
  Documents
LayoutXLM vs. GNN: An Empirical Evaluation of Relation Extraction for Documents
Hervé Déjean
S. Clinchant
Jean-Luc Meunier
15
4
0
09 May 2022
Relational Representation Learning in Visually-Rich Documents
Relational Representation Learning in Visually-Rich Documents
Xin Li
Yan Zheng
Yiqing Hu
H. Cao
Yunfei Wu
Deqiang Jiang
Yinsong Liu
Bo Ren
18
12
0
05 May 2022
End-to-end Document Recognition and Understanding with Dessurt
End-to-end Document Recognition and Understanding with Dessurt
Brian L. Davis
B. Morse
Brian L. Price
Chris Tensmeyer
Curtis Wigington
Vlad I. Morariu
VLM
ViT
24
73
0
30 Mar 2022
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich
  Document Understanding
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
Zhangxuan Gu
Changhua Meng
Ke Wang
Jun Lan
Weiqiang Wang
Ming Gu
Liqing Zhang
29
76
0
14 Mar 2022
DiT: Self-supervised Pre-training for Document Image Transformer
DiT: Self-supervised Pre-training for Document Image Transformer
Junlong Li
Yiheng Xu
Tengchao Lv
Lei Cui
Chaoxi Zhang
Furu Wei
ViT
VLM
35
159
0
04 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for
  Structured Document Understanding
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding
Jiapeng Wang
Lianwen Jin
Kai Ding
VLM
17
138
0
28 Feb 2022
DocSegTr: An Instance-Level End-to-End Document Image Segmentation
  Transformer
DocSegTr: An Instance-Level End-to-End Document Image Segmentation Transformer
Sanket Biswas
Ayan Banerjee
Josep Lladós
Umapada Pal
ViT
14
23
0
27 Jan 2022
Value Retrieval with Arbitrary Queries for Form-like Documents
Value Retrieval with Arbitrary Queries for Form-like Documents
M. Gao
Le Xue
Chetan Ramaiah
Chen Xing
Ran Xu
Caiming Xiong
13
6
0
15 Dec 2021
Text Classification Models for Form Entity Linking
Text Classification Models for Form Entity Linking
M. Villota
C. Domínguez
Jónathan Heras
Eloy J. Mata
Vico Pascual
MedIm
13
2
0
14 Dec 2021
Document AI: Benchmarks, Models and Applications
Document AI: Benchmarks, Models and Applications
Lei Cui
Yiheng Xu
Tengchao Lv
Furu Wei
VLM
18
69
0
16 Nov 2021
12
Next