ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.10213
  4. Cited By
ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction

ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction

18 March 2021
Zheng Huang
Kai Chen
Jianhua He
X. Bai
Dimosthenis Karatzas
Shijian Lu
C. V. Jawahar
ArXivPDFHTML

Papers citing "ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction"

50 / 186 papers shown
Title
Long-Range Transformer Architectures for Document Understanding
Long-Range Transformer Architectures for Document Understanding
Thibault Douzon
S. Duffner
Christophe Garcia
Jérémy Espinas
VLM
34
2
0
11 Sep 2023
Improving Information Extraction on Business Documents with Specific
  Pre-Training Tasks
Improving Information Extraction on Business Documents with Specific Pre-Training Tasks
Thibault Douzon
S. Duffner
Christophe Garcia
Jérémy Espinas
27
6
0
11 Sep 2023
ImageBind-LLM: Multi-modality Instruction Tuning
ImageBind-LLM: Multi-modality Instruction Tuning
Jiaming Han
Renrui Zhang
Wenqi Shao
Peng Gao
Peng Xu
...
Yafei Wen
Xiaoxin Chen
Xiangyu Yue
Hongsheng Li
Yu Qiao
MLLM
54
117
0
07 Sep 2023
Attention Where It Matters: Rethinking Visual Document Understanding
  with Selective Region Concentration
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
H. Cao
Changcun Bao
Chaohu Liu
Huang-wei Chen
Kun Yin
Hao Liu
Yinsong Liu
Deqiang Jiang
Xing Sun
28
13
0
03 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
66
35
0
30 Aug 2023
Universal Graph Continual Learning
Universal Graph Continual Learning
Thanh Duc Hoang
Do Viet Tung
Duy-Hung Nguyen
Bao-Sinh Nguyen
Huy Hoang Nguyen
Hung Le
CLL
31
3
0
27 Aug 2023
Beyond Document Page Classification: Design, Datasets, and Challenges
Beyond Document Page Classification: Design, Datasets, and Challenges
Jordy Van Landeghem
Sanket Biswas
Matthew B. Blaschko
Marie-Francine Moens
45
6
0
24 Aug 2023
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual
  Questions
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
Wenbo Hu
Y. Xu
Yuante Li
W. Li
Zhengzhang Chen
Zhuowen Tu
MLLM
VLM
30
123
0
19 Aug 2023
Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Wenqi Shao
Yutao Hu
Peng Gao
Meng Lei
Kaipeng Zhang
...
Peng Xu
Siyuan Huang
Hongsheng Li
Yuning Qiao
Ping Luo
VLM
MLLM
37
2
0
07 Aug 2023
Workshop on Document Intelligence Understanding
Workshop on Document Intelligence Understanding
S. Han
Yihao Ding
Siwen Luo
J. Poon
HeeGuen Yoon
Zhe Huang
P. Duuring
E. Holden
29
0
0
31 Jul 2023
MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary
MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary
Beiya Dai
Xingbiao Li
Qunyi Xie
Yulin Li
Xiameng Qin
Chengquan Zhang
Kun Yao
Junyu Han
39
1
0
24 Jul 2023
Line Graphics Digitization: A Step Towards Full Automation
Line Graphics Digitization: A Step Towards Full Automation
Omar Moured
Jiaming Zhang
Alina Roitberg
Thorsten Schwarz
Rainer Stiefelhagen
16
4
0
05 Jul 2023
Estimating Post-OCR Denoising Complexity on Numerical Texts
Estimating Post-OCR Denoising Complexity on Numerical Texts
Arthur Hemmer
Jérôme Brachat
Mickael Coustaty
J. Ogier
22
3
0
03 Jul 2023
Document Image Cleaning using Budget-Aware Black-Box Approximation
Document Image Cleaning using Budget-Aware Black-Box Approximation
Ganesh Tata
Katyani Singh
E. V. Oeveren
Nilanjan Ray
AAML
23
0
0
22 Jun 2023
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large
  Vision-Language Models
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Peng Xu
Wenqi Shao
Kaipeng Zhang
Peng Gao
Shuo Liu
Meng Lei
Fanqing Meng
Siyuan Huang
Yu Qiao
Ping Luo
ELM
MLLM
41
159
0
15 Jun 2023
DocumentNet: Bridging the Data Gap in Document Pre-Training
DocumentNet: Bridging the Data Gap in Document Pre-Training
Lijun Yu
Jin Miao
Xiaoyu Sun
Jiayi Chen
Alexander G. Hauptmann
H. Dai
Wei Wei
24
3
0
15 Jun 2023
Looking and Listening: Audio Guided Text Recognition
Looking and Listening: Audio Guided Text Recognition
Wenwen Yu
Mingyu Liu
Biao Yang
Enming Zhang
Deqiang Jiang
Xing Sun
Yuliang Liu
Xiang Bai
DiffM
43
1
0
06 Jun 2023
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich
  Document Images
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Wenwen Yu
Chengquan Zhang
H. Cao
Wei Hua
Bohan Li
...
Hao Fei
Dimosthenis Karatzas
Xingchao Sun
Jingdong Wang
Xiang Bai
36
11
0
05 Jun 2023
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training
  for Document Understanding
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding
Yi Tu
Ya Guo
Huan Chen
Jinyang Tang
31
15
0
30 May 2023
RE$^2$: Region-Aware Relation Extraction from Visually Rich Documents
RE2^22: Region-Aware Relation Extraction from Visually Rich Documents
Pritika Ramu
Sijia Wang
Lalla Mouatadid
Joy Rimchala
Lifu Huang
38
0
0
24 May 2023
DiffUTE: Universal Text Editing Diffusion Model
DiffUTE: Universal Text Editing Diffusion Model
Haoxing Chen
Zhuoer Xu
Zhangxuan Gu
Jun Lan
Xing Zheng
Yaohui Li
Changhua Meng
Huijia Zhu
Weiqiang Wang
DiffM
38
34
0
18 May 2023
Visual Information Extraction in the Wild: Practical Dataset and
  End-to-end Solution
Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Jianfeng Kuang
Wei Hua
Dingkang Liang
Mingkun Yang
Deqiang Jiang
Bo Ren
Xiang Bai
29
39
0
12 May 2023
FormNetV2: Multimodal Graph Contrastive Learning for Form Document
  Information Extraction
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Nils Loose
Chun-Liang Li
Hao Zhang
Timothy Dozat
Felix Mächtle
...
Shangbang Long
Siyang Qin
Yasuhisa Fujii
Nan Hua
T. Eisenbarth
SSL
48
19
0
04 May 2023
Information Redundancy and Biases in Public Document Information
  Extraction Benchmarks
Information Redundancy and Biases in Public Document Information Extraction Benchmarks
S. Laatiri
Pirashanth Ratnamogan
Joel Tang
Laurent Lam
William Vanhuffel
Fabien Caspani
33
1
0
28 Apr 2023
Information Extraction from Documents: Question Answering vs Token
  Classification in real-world setups
Information Extraction from Documents: Question Answering vs Token Classification in real-world setups
Laurent Lam
Pirashanth Ratnamogan
Joel Tang
William Vanhuffel
Fabien Caspani
29
0
0
21 Apr 2023
A Question-Answering Approach to Key Value Pair Extraction from
  Form-like Document Images
A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Kai Hu
Zhuoyuan Wu
Zhuoyao Zhong
Weihong Lin
Lei-huan Sun
Qiang Huo
28
11
0
17 Apr 2023
PDFVQA: A New Dataset for Real-World VQA on PDF Documents
PDFVQA: A New Dataset for Real-World VQA on PDF Documents
Yihao Ding
Siwen Luo
Hyunsuk Chung
S. Han
33
17
0
13 Apr 2023
Efficient OCR for Building a Diverse Digital History
Efficient OCR for Building a Diverse Digital History
Jacob Carlson
Tom Bryan
Melissa Dell
43
11
0
05 Apr 2023
Summary of ChatGPT-Related Research and Perspective Towards the Future
  of Large Language Models
Summary of ChatGPT-Related Research and Perspective Towards the Future of Large Language Models
Yi-Hsien Liu
Tianle Han
Siyuan Ma
Jia-Yu Zhang
Yuanyu Yang
...
Xiang Li
Ning Qiang
Dingang Shen
Tianming Liu
Bao Ge
ALM
ELM
AI4CE
LM&MA
LLMAG
45
464
0
04 Apr 2023
Modeling Entities as Semantic Points for Visual Information Extraction
  in the Wild
Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Zhibo Yang
Rujiao Long
Pengfei Wang
Sibo Song
Humen Zhong
Wenqing Cheng
X. Bai
Cong Yao
41
22
0
23 Mar 2023
Automatic Detection and Rectification of Paper Receipts on Smartphones
Automatic Detection and Rectification of Paper Receipts on Smartphones
Edward Whittaker
Masashi Tanaka
I. Kitagishi
9
0
0
10 Mar 2023
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for
  Document Information Extraction
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction
Jiabang He
Lei Wang
Yingpeng Hu
Ning Liu
Hui-juan Liu
Xingdong Xu
Hengtao Shen
MLLM
11
46
0
09 Mar 2023
Entry Separation using a Mixed Visual and Textual Language Model:
  Application to 19th century French Trade Directories
Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade Directories
Bertrand Duménieu
Edwin Carlinet
N. Abadie
Joseph Chazalon
31
0
0
17 Feb 2023
Multimodality Representation Learning: A Survey on Evolution,
  Pretraining and Its Applications
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications
Muhammad Arslan Manzoor
S. Albarri
Ziting Xian
Zaiqiao Meng
Preslav Nakov
Shangsong Liang
AI4TS
42
26
0
01 Feb 2023
LoRaLay: A Multilingual and Multimodal Dataset for Long Range and
  Layout-Aware Summarization
LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization
Laura Nguyen
Thomas Scialom
Benjamin Piwowarski
Jacopo Staiano
29
7
0
26 Jan 2023
An Augmentation Strategy for Visually Rich Documents
An Augmentation Strategy for Visually Rich Documents
Jing Xie
James Bradley Wendt
Yichao Zhou
Seth Ebner
Sandeep Tata
21
0
0
20 Dec 2022
Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document
  Understanding
Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding
Haoli Bai
Zhiguang Liu
Xiaojun Meng
Wentao Li
Shuangning Liu
...
Liangwei Wang
Lu Hou
Jiansheng Wei
Xin Jiang
Qun Liu
ViT
37
13
0
19 Dec 2022
VRDU: A Benchmark for Visually-rich Document Understanding
VRDU: A Benchmark for Visually-rich Document Understanding
Zilong Wang
Yichao Zhou
Wei Wei
Chen-Yu Lee
Sandeep Tata
32
15
0
15 Nov 2022
DoSA : A System to Accelerate Annotations on Business Documents with
  Human-in-the-Loop
DoSA : A System to Accelerate Annotations on Business Documents with Human-in-the-Loop
Neelesh K Shukla
Msp Raja
Raghu Katikeri
Amit Vaid
23
1
0
09 Nov 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich
  Document Understanding
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Qiming Peng
Yinxu Pan
Wenjin Wang
Bin Luo
Zhenyu Zhang
...
Shi Feng
Yu Sun
Hao Tian
Hua Wu
Haifeng Wang
15
83
0
12 Oct 2022
Key Information Extraction in Purchase Documents using Deep Learning and
  Rule-based Corrections
Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections
R. Arroyo
J. Yebes
E. Martínez
Hector Corrales
Javier Lorenzo
37
1
0
07 Oct 2022
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Yu-Chung Hsiao
Fedir Zubach
Maria Wang
Jindong Chen
Victor Carbune
Jason Lin
Maria Wang
Yun Zhu
Jindong Chen
RALM
160
26
0
16 Sep 2022
Doc2Graph: a Task Agnostic Document Understanding Framework based on
  Graph Neural Networks
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks
Andrea Gemelli
Sanket Biswas
Enrico Civitelli
Josep Lladós
S. Marinai
27
16
0
23 Aug 2022
Towards Complex Document Understanding By Discrete Reasoning
Towards Complex Document Understanding By Discrete Reasoning
Fengbin Zhu
Wenqiang Lei
Fuli Feng
Chao Wang
Haozhou Zhang
Tat-Seng Chua
33
43
0
25 Jul 2022
TRIE++: Towards End-to-End Information Extraction from Visually Rich
  Documents
TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents
Zhanzhan Cheng
Peng Zhang
Can Li
Qiao Liang
Yunlu Xu
Pengfei Li
Shiliang Pu
Yi Niu
Fei Wu
23
10
0
14 Jul 2022
Layout-Aware Information Extraction for Document-Grounded Dialogue:
  Dataset, Method and Demonstration
Layout-Aware Information Extraction for Document-Grounded Dialogue: Dataset, Method and Demonstration
Zhenyu Zhang
Yu Bowen
Haiyang Yu
Tingwen Liu
Cheng Fu
Jingyang Li
Chengguang Tang
Jian Sun
Yongbin Li
44
6
0
14 Jul 2022
GMN: Generative Multi-modal Network for Practical Document Information
  Extraction
GMN: Generative Multi-modal Network for Practical Document Information Extraction
H. Cao
Jiefeng Ma
Antai Guo
Yiqing Hu
Hao Liu
Deqiang Jiang
Yinsong Liu
Bo Ren
26
8
0
11 Jul 2022
BusiNet -- a Light and Fast Text Detection Network for Business
  Documents
BusiNet -- a Light and Fast Text Detection Network for Business Documents
O. Naparstek
Ophir Azulai
Daniel Rotman
Yevgeny Burshtein
Peter W. J. Staar
Udi Barzelay
27
2
0
04 Jul 2022
Business Document Information Extraction: Towards Practical Benchmarks
Business Document Information Extraction: Towards Practical Benchmarks
Matyás Skalický
Stepán Simsa
Michal Uřičář
Milan Šulc
33
9
0
20 Jun 2022
Test-Time Adaptation for Visual Document Understanding
Test-Time Adaptation for Visual Document Understanding
Sayna Ebrahimi
Sercan O. Arik
Tomas Pfister
OOD
38
6
0
15 Jun 2022
Previous
1234
Next