Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.00923
Cited By
Multimodal grid features and cell pointers for Scene Text Visual Question Answering
1 June 2020
Lluís Gómez
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Marçal Rusiñol
Ernest Valveny
Dimosthenis Karatzas
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multimodal grid features and cell pointers for Scene Text Visual Question Answering"
2 / 2 papers shown
Title
OCR-IDL: OCR Annotations for Industry Document Library Dataset
Ali Furkan Biten
Rubèn Pérez Tito
Lluís Gómez
Ernest Valveny
Dimosthenis Karatzas
25
26
0
25 Feb 2022
LaTr: Layout-Aware Transformer for Scene-Text VQA
Ali Furkan Biten
Ron Litman
Yusheng Xie
Srikar Appalaraju
R. Manmatha
ViT
34
100
0
23 Dec 2021
1