Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.12029
Cited By
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
24 May 2022
Souhail Bakkali
Zuheng Ming
Mickael Coustaty
Marccal Rusinol
O. R. Terrades
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification"
5 / 5 papers shown
Title
On Evaluation of Document Classification using RVL-CDIP
Stefan Larson
Gordon Lim
Kevin Leach
36
3
0
21 Jun 2023
RegCLR: A Self-Supervised Framework for Tabular Representation Learning in the Wild
Weiyao Wang
Byung-Hak Kim
Varun Ganapathi
SSL
LMTD
27
1
0
02 Nov 2022
Evaluating Out-of-Distribution Performance on Document Image Classifiers
Stefan Larson
Gordon Lim
Yutong Ai
David Kuang
Kevin Leach
OODD
OOD
37
18
0
14 Oct 2022
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
153
498
0
29 Dec 2020
CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images
Madhav Agarwal
Ajoy Mondal
C. V. Jawahar
45
62
0
25 Aug 2020
1