Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.02411
Cited By
Relational Representation Learning in Visually-Rich Documents
5 May 2022
Xin Li
Yan Zheng
Yiqing Hu
H. Cao
Yunfei Wu
Deqiang Jiang
Yinsong Liu
Bo Ren
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Relational Representation Learning in Visually-Rich Documents"
36 / 36 papers shown
Title
Unified Pretraining Framework for Document Understanding
Jiuxiang Gu
Jason Kuen
Vlad I. Morariu
Handong Zhao
Nikolaos Barmpalios
R. Jain
A. Nenkova
Tong Sun
54
96
0
22 Apr 2022
CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Licheng Yu
Jun Chen
Animesh Sinha
Mengjiao MJ Wang
Hugo Chen
Tamara L. Berg
Ning Zhang
VLM
51
39
0
15 Feb 2022
Neural Collaborative Graph Machines for Table Structure Recognition
Hao Liu
Xin Li
Bin Liu
Deqiang Jiang
Yinsong Liu
Bo Ren
LMTD
47
32
0
26 Nov 2021
Parsing Table Structures in the Wild
Rujiao Long
Wen Wang
Nan Xue
Feiyu Gao
Zhibo Yang
Yongpan Wang
Gui-Song Xia
LMTD
43
50
0
06 Sep 2021
LayoutReader: Pre-training of Text and Layout for Reading Order Detection
Zilong Wang
Yiheng Xu
Lei Cui
Jingbo Shang
Furu Wei
44
75
0
26 Aug 2021
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents
Teakgyu Hong
Donghyun Kim
Mingi Ji
Wonseok Hwang
Daehyun Nam
Sungrae Park
VLM
49
153
0
10 Aug 2021
StrucTexT: Structured Text Understanding with Multi-Modal Transformers
Yulin Li
Yuxi Qian
Yuchen Yu
Xiameng Qin
Chengquan Zhang
Yan Liu
Kun Yao
Junyu Han
Jingtuo Liu
Errui Ding
56
116
0
06 Aug 2021
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
Junnan Li
Ramprasaath R. Selvaraju
Akhilesh Deepak Gotmare
Shafiq Joty
Caiming Xiong
Guosheng Lin
FaML
142
1,915
0
16 Jul 2021
DocFormer: End-to-End Transformer for Document Understanding
Srikar Appalaraju
Bhavan A. Jasani
Bhargava Urala Kota
Yusheng Xie
R. Manmatha
ViT
57
274
0
22 Jun 2021
Document-level Relation Extraction as Semantic Segmentation
Ningyu Zhang
Xiang Chen
Xin Xie
Shumin Deng
Chuanqi Tan
Mosha Chen
Fei Huang
Luo Si
Huajun Chen
ViT
49
183
0
07 Jun 2021
StructuralLM: Structural Pre-training for Form Understanding
Chenliang Li
Bin Bi
Ming Yan
Wei Wang
Songfang Huang
Fei Huang
Luo Si
LMTD
AI4CE
57
132
0
24 May 2021
LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding
Te-Lin Wu
Cheng-rong Li
Mingyang Zhang
Tao Chen
Spurthi Amba Hombaiah
Michael Bendersky
33
14
0
16 Apr 2021
An Empirical Study of Training Self-Supervised Vision Transformers
Xinlei Chen
Saining Xie
Kaiming He
ViT
101
1,837
0
05 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
548
28,659
0
26 Feb 2021
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
165
507
0
29 Dec 2020
Table Structure Recognition using Top-Down and Bottom-Up Cues
S. Raja
Ajoy Mondal
C. V. Jawahar
LMTD
34
77
0
09 Oct 2020
Bootstrap your own latent: A new approach to self-supervised Learning
Jean-Bastien Grill
Florian Strub
Florent Altché
Corentin Tallec
Pierre Harvey Richemond
...
M. G. Azar
Bilal Piot
Koray Kavukcuoglu
Rémi Munos
Michal Valko
SSL
247
6,718
0
13 Jun 2020
TRIE: End-to-End Text Reading and Information Extraction for Document Understanding
Peng Zhang
Yunlu Xu
Zhanzhan Cheng
Shiliang Pu
Jing Lu
Liang Qiao
Yi Niu
Fei Wu
SyDa
48
102
0
27 May 2020
Robust Layout-aware IE for Visually Rich Documents with Pre-trained Language Models
Mengxi Wei
Yifan He
Qiong Zhang
VLM
22
39
0
22 May 2020
Spatial Dependency Parsing for Semi-Structured Document Information Extraction
Wonseok Hwang
Jinyeong Yim
Seunghyun Park
Sohee Yang
Minjoon Seo
51
94
0
01 May 2020
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
408
3,397
0
09 Mar 2020
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
158
18,523
0
13 Feb 2020
LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Yiheng Xu
Minghao Li
Lei Cui
Shaohan Huang
Furu Wei
Ming Zhou
92
694
0
31 Dec 2019
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
83
11,959
0
13 Nov 2019
PubLayNet: largest dataset ever for document layout analysis
Xu Zhong
Jianbin Tang
Antonio Jimeno Yepes
24
454
0
16 Aug 2019
Complicated Table Structure Recognition
Zewen Chi
Heyan Huang
Heng-Da Xu
Houjin Yu
Wanxuan Yin
Xian-Ling Mao
LMTD
37
108
0
13 Aug 2019
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
151
363
0
27 May 2019
Graph Convolution for Multimodal Information Extraction from Visually Rich Documents
Xiaojing Liu
Feiyu Gao
Qiong Zhang
Huasha Zhao
53
183
0
27 Mar 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
114
3,707
0
09 Jan 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
815
93,936
0
11 Oct 2018
Reinforcement Learning for Relation Classification from Noisy Data
Jun Feng
Minlie Huang
Li Zhao
Yang Yang
Xiaoyan Zhu
NoLa
40
340
0
24 Aug 2018
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
324
129,831
0
12 Jun 2017
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
253
27,018
0
20 Mar 2017
Feature Pyramid Networks for Object Detection
Nayeon Lee
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
389
21,951
0
09 Dec 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
785
6,768
0
26 Sep 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.1K
192,638
0
10 Dec 2015
1