Going Full-TILT Boogie on Document Understanding with Text-Image-Layout
Transformer

Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer

18 February 2021

Łukasz Borchmann

Dawid Jurkiewicz

Michal Pietruszka

Gabriela Pałka

Papers citing "Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer"

17 / 117 papers shown

Title
DiT: Self-supervised Pre-training for Document Image Transformer Junlong Li Yiheng Xu Tengchao Lv Lei Cui Chaoxi Zhang Furu Wei ViT VLM 35 159 0 04 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding Jiapeng Wang Lianwen Jin Kai Ding VLM 30 138 0 28 Feb 2022
OCR-IDL: OCR Annotations for Industry Document Library Dataset Ali Furkan Biten Rubèn Pérez Tito Lluís Gómez Ernest Valveny Dimosthenis Karatzas 25 26 0 25 Feb 2022
Recognition-free Question Answering on Handwritten Document Collections Oliver Tüselmann Friedrich Müller Fabian Wolf G. Fink RALM 8 4 0 12 Feb 2022
WebFormer: The Web-page Transformer for Structure Information Extraction Qifan Wang Yi Fang Anirudh Ravula Fuli Feng Xiaojun Quan Dongfang Liu ViT 141 65 0 01 Feb 2022
OCR-free Document Understanding Transformer Geewook Kim Teakgyu Hong Moonbin Yim Jeongyeon Nam Jinyoung Park Jinyeong Yim Wonseok Hwang Sangdoo Yun Dongyoon Han Seunghyun Park ViT 50 262 0 30 Nov 2021
Document AI: Benchmarks, Models and Applications Lei Cui Yiheng Xu Tengchao Lv Furu Wei VLM 21 69 0 16 Nov 2021
ICDAR 2021 Competition on Document VisualQuestion Answering Rubèn Pérez Tito Minesh Mathew C. V. Jawahar Ernest Valveny Dimosthenis Karatzas 35 23 0 10 Nov 2021
Information Extraction from Visually Rich Documents with Font Style Embeddings Ismail Oussaid William Vanhuffel Pirashanth Ratnamogan Mhamed Hajaiej Alexis Mathey Thomas Gilles 19 1 0 07 Nov 2021
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding Junlong Li Yiheng Xu Lei Cui Furu Wei VLM 3DGS 28 59 0 16 Oct 2021
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents Teakgyu Hong Donghyun Kim Mingi Ji Wonseok Hwang Daehyun Nam Sungrae Park VLM 34 150 0 10 Aug 2021
DocFormer: End-to-End Transformer for Document Understanding Srikar Appalaraju Bhavan A. Jasani Bhargava Urala Kota Yusheng Xie R. Manmatha ViT 32 270 0 22 Jun 2021
InfographicVQA Minesh Mathew Viraj Bagal Rubèn Pérez Tito Dimosthenis Karatzas Ernest Valveny C. V. Jawahar 22 203 0 26 Apr 2021
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding Yang Xu Yiheng Xu Tengchao Lv Lei Cui Furu Wei ... D. Florêncio Cha Zhang Wanxiang Che Min Zhang Lidong Zhou ViT MLLM 153 498 0 29 Dec 2020
From Dataset Recycling to Multi-Property Extraction and Beyond Tomasz Dwojak Michal Pietruszka Łukasz Borchmann Jakub Chlkedowski Filip Graliñski 42 5 0 06 Nov 2020
Spatial Dependency Parsing for Semi-Structured Document Information Extraction Wonseok Hwang Jinyeong Yim Seunghyun Park Sohee Yang Minjoon Seo 34 92 0 01 May 2020
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents Guillaume Jaume H. K. Ekenel Jean-Philippe Thiran 134 355 0 27 May 2019