ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.11279
  4. Cited By
Graph Convolution for Multimodal Information Extraction from Visually
  Rich Documents

Graph Convolution for Multimodal Information Extraction from Visually Rich Documents

27 March 2019
Xiaojing Liu
Feiyu Gao
Qiong Zhang
Huasha Zhao
ArXivPDFHTML

Papers citing "Graph Convolution for Multimodal Information Extraction from Visually Rich Documents"

32 / 32 papers shown
Title
DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms
DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms
Xiaojun Bi
Shuo Li
ziqi wang
Fuwen Luo
Weizheng Qiao
Lu Han
Ziwei Sun
Peng Li
Yang Liu
132
0
0
05 Mar 2025
GraphRevisedIE: Multimodal Information Extraction with Graph-Revised
  Network
GraphRevisedIE: Multimodal Information Extraction with Graph-Revised Network
Panfeng Cao
Jian Wu
28
9
0
02 Oct 2024
Noise-Aware Training of Layout-Aware Language Models
Noise-Aware Training of Layout-Aware Language Models
Ritesh Sarkhel
Xiaoqi Ren
Lauro Beltrao Costa
Guolong Su
Vincent Perot
Yanan Xie
Emmanouil Koukoumidis
Arnab Nandi
VLM
46
0
0
30 Mar 2024
DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing
  Learning Efficiency
DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency
Azhar Shaikh
Michael Cochez
Denis Diachkov
Michiel de Rijcke
Sahar Yousefi
25
0
0
09 Nov 2023
A Graphical Approach to Document Layout Analysis
A Graphical Approach to Document Layout Analysis
Jilin Wang
Michael Krumdick
Baojia Tong
Hamima Halim
M. Sokolov
Vadym Barda
Delphine Vendryes
Christy Tanner
21
8
0
03 Aug 2023
Visual Information Extraction in the Wild: Practical Dataset and
  End-to-end Solution
Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Jianfeng Kuang
Wei Hua
Dingkang Liang
Mingkun Yang
Deqiang Jiang
Bo Ren
Xiang Bai
27
39
0
12 May 2023
DocParser: End-to-end OCR-free Information Extraction from Visually Rich
  Documents
DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
M. Dhouib
G. Bettaieb
A. Shabou
17
20
0
24 Apr 2023
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for
  Document Information Extraction
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction
Jiabang He
Lei Wang
Yingpeng Hu
Ning Liu
Hui-juan Liu
Xingdong Xu
Hengtao Shen
MLLM
6
47
0
09 Mar 2023
Entry Separation using a Mixed Visual and Textual Language Model:
  Application to 19th century French Trade Directories
Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade Directories
Bertrand Duménieu
Edwin Carlinet
N. Abadie
Joseph Chazalon
27
0
0
17 Feb 2023
Unimodal and Multimodal Representation Training for Relation Extraction
Unimodal and Multimodal Representation Training for Relation Extraction
Ciaran Cooney
Rachel Heyburn
Liam Maddigan
Mairead O'Cuinn
Chloe Thompson
Joana Cavadas
30
2
0
11 Nov 2022
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document
  Understanding
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Wenjin Wang
Zhengjie Huang
Bin Luo
Qianglong Chen
Qiming Peng
...
Weichong Yin
Shi Feng
Yu Sun
Dianhai Yu
Yin Zhang
ViT
30
11
0
18 Sep 2022
Information Extraction from Scanned Invoice Images using Text Analysis
  and Layout Features
Information Extraction from Scanned Invoice Images using Text Analysis and Layout Features
H. Ha
Ales Horak
23
14
0
08 Aug 2022
Relational Representation Learning in Visually-Rich Documents
Relational Representation Learning in Visually-Rich Documents
Xin Li
Yan Zheng
Yiqing Hu
H. Cao
Yunfei Wu
Deqiang Jiang
Yinsong Liu
Bo Ren
18
12
0
05 May 2022
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich
  Document Understanding
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
Zhangxuan Gu
Changhua Meng
Ke Wang
Jun Lan
Weiqiang Wang
Ming Gu
Liqing Zhang
37
76
0
14 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for
  Structured Document Understanding
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding
Jiapeng Wang
Lianwen Jin
Kai Ding
VLM
33
138
0
28 Feb 2022
Document AI: Benchmarks, Models and Applications
Document AI: Benchmarks, Models and Applications
Lei Cui
Yiheng Xu
Tengchao Lv
Furu Wei
VLM
24
69
0
16 Nov 2021
Information Extraction from Visually Rich Documents with Font Style
  Embeddings
Information Extraction from Visually Rich Documents with Font Style Embeddings
Ismail Oussaid
William Vanhuffel
Pirashanth Ratnamogan
Mhamed Hajaiej
Alexis Mathey
Thomas Gilles
19
1
0
07 Nov 2021
Entity Relation Extraction as Dependency Parsing in Visually Rich
  Documents
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents
Yue Zhang
Bo-Wen Zhang
Rui Wang
Junjie Cao
Chen Li
Zuyi Bao
40
32
0
19 Oct 2021
Using Neighborhood Context to Improve Information Extraction from Visual
  Documents Captured on Mobile Phones
Using Neighborhood Context to Improve Information Extraction from Visual Documents Captured on Mobile Phones
Kalpa Gunaratna
Vijay Srinivasan
Sandeep Nama
Hongxia Jin
27
5
0
23 Aug 2021
BROS: A Pre-trained Language Model Focusing on Text and Layout for
  Better Key Information Extraction from Documents
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents
Teakgyu Hong
Donghyun Kim
Mingi Ji
Wonseok Hwang
Daehyun Nam
Sungrae Park
VLM
34
150
0
10 Aug 2021
Multi-Modal Association based Grouping for Form Structure Extraction
Multi-Modal Association based Grouping for Form Structure Extraction
Milan Aggarwal
Mausoom Sarkar
Hiresh Gupta
Balaji Krishnamurthy
17
10
0
09 Jul 2021
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for
  Key Information Extraction from Documents
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Weihong Lin
Qifang Gao
Lei-huan Sun
Zhuoyao Zhong
Kaiqin Hu
Qin Ren
Qiang Huo
31
37
0
25 May 2021
VSR: A Unified Framework for Document Layout Analysis combining Vision,
  Semantics and Relations
VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations
Peng Zhang
Can Li
Liang Qiao
Zhanzhan Cheng
Shiliang Pu
Yi Niu
Fei Wu
31
57
0
13 May 2021
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout
  Transformer
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer
Rafal Powalski
Łukasz Borchmann
Dawid Jurkiewicz
Tomasz Dwojak
Michal Pietruszka
Gabriela Pałka
ViT
36
157
0
18 Feb 2021
Towards Robust Visual Information Extraction in Real World: New Dataset
  and Novel Solution
Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
Jiapeng Wang
Chongyu Liu
Lianwen Jin
Guozhi Tang
Jiaxin Zhang
Shuaitao Zhang
Qianying Wang
Y. Wu
Mingxiang Cai
28
82
0
24 Jan 2021
Learning Graph Normalization for Graph Neural Networks
Learning Graph Normalization for Graph Neural Networks
Yihao Chen
Xin Tang
Xianbiao Qi
Chun-Guang Li
Rong Xiao
AI4CE
14
50
0
24 Sep 2020
ZeroShotCeres: Zero-Shot Relation Extraction from Semi-Structured
  Webpages
ZeroShotCeres: Zero-Shot Relation Extraction from Semi-Structured Webpages
Colin Lockard
Prashant Shiralkar
Xin Luna Dong
Hannaneh Hajishirzi
18
53
0
14 May 2020
Text Recognition in the Wild: A Survey
Text Recognition in the Wild: A Survey
Xiaoxue Chen
Lianwen Jin
Yuanzhi Zhu
Canjie Luo
Tianwei Wang
3DV
27
102
0
07 May 2020
Kleister: A novel task for Information Extraction involving Long
  Documents with Complex Layout
Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout
Filip Graliñski
Tomasz Stanislawek
Anna Wróblewska
Dawid Lipiñski
Agnieszka Kaliska
Paulina Rosalska
Bartosz Topolski
P. Biecek
28
40
0
04 Mar 2020
LAMBERT: Layout-Aware (Language) Modeling for information extraction
LAMBERT: Layout-Aware (Language) Modeling for information extraction
Lukasz Garncarek
Rafal Powalski
Tomasz Stanislawek
Bartosz Topolski
Piotr Halama
M. Turski
Filip Graliñski
8
87
0
19 Feb 2020
Image Generation from Scene Graphs
Image Generation from Scene Graphs
Justin Johnson
Agrim Gupta
Li Fei-Fei
GNN
223
815
0
04 Apr 2018
Convolutional Neural Networks for Sentence Classification
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
255
13,364
0
25 Aug 2014
1