Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.05085
Cited By
Rosetta: Large scale system for text detection and recognition in images
11 October 2019
Fedor Borisyuk
Albert Gordo
V. Sivakumar
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rosetta: Large scale system for text detection and recognition in images"
37 / 37 papers shown
Title
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
41
6
0
29 Dec 2023
Towards Perceiving Small Visual Details in Zero-shot Visual Question Answering with Multimodal LLMs
Jiarui Zhang
Mahyar Khayatkhoei
P. Chhikara
Filip Ilievski
37
2
0
24 Oct 2023
Making the V in Text-VQA Matter
Shamanthak Hegde
Soumya Jahagirdar
Shankar Gangisetty
CoGe
37
4
0
01 Aug 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
CLIP
VLM
28
25
0
23 May 2023
Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature
Ana Claudia Akemi Matsuki de Faria
Felype de Castro Bastos
Jose Victor Nogueira Alves da Silva
Vitor Lopes Fabris
Valeska Uchôa
Décio Gonccalves de Aguiar Neto
C. F. G. Santos
30
23
0
18 May 2023
Weakly-supervised Fingerspelling Recognition in British Sign Language Videos
Prajwal K R
Hannah Bull
Liliane Momeni
Samuel Albanie
Gül Varol
Andrew Zisserman
29
14
0
16 Nov 2022
PromptCap: Prompt-Guided Task-Aware Image Captioning
Yushi Hu
Hang Hua
Zhengyuan Yang
Weijia Shi
Noah A. Smith
Jiebo Luo
51
101
0
15 Nov 2022
Scene Text Recognition with Semantics
Joshua Cesare Placidi
Yishu Miao
Zixu Wang
Lucia Specia
23
1
0
19 Oct 2022
DM
2
^2
2
S
2
^2
2
: Deep Multi-Modal Sequence Sets with Hierarchical Modality Attention
Shunsuke Kitada
Yuki Iwazaki
Riku Togashi
Hitoshi Iyatomi
21
1
0
07 Sep 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
28
169
0
14 Jul 2022
SVTR: Scene Text Recognition with a Single Visual Model
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Tianlun Zheng
Chenxia Li
Yuning Du
Yu-Gang Jiang
24
170
0
30 Apr 2022
On the Cross-dataset Generalization in License Plate Recognition
Rayson Laroca
Everton VIlhena Cardoso
D. Lucio
Valter Estevam
David Menotti
31
42
0
02 Jan 2022
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning
Wenqiao Zhang
Haochen Shi
Jiannan Guo
Shengyu Zhang
Qingpeng Cai
Juncheng Li
Sihui Luo
Yueting Zhuang
DiffM
26
46
0
13 Dec 2021
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages
Shota Orihashi
Yoshihiro Yamazaki
Naoki Makishima
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Ryo Masumura
40
1
0
24 Nov 2021
Oracle Teacher: Leveraging Target Information for Better Knowledge Distillation of CTC Models
J. Yoon
H. Kim
Hyeon Seung Lee
Sunghwan Ahn
N. Kim
38
1
0
05 Nov 2021
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling
Xiaopeng Lu
Zhenhua Fan
Yansen Wang
Jean Oh
Carolyn Rose
27
27
0
20 Aug 2021
Data Augmentation for Scene Text Recognition
Rowel Atienza
38
19
0
16 Aug 2021
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding
Zhanghui Kuang
Hongbin Sun
Zhizhong Li
Xiaoyu Yue
T. Lin
...
Tong Gao
Wenwei Zhang
Kai-xiang Chen
Wayne Zhang
Dahua Lin
VLM
26
71
0
14 Aug 2021
Vision Transformer for Fast and Efficient Scene Text Recognition
Rowel Atienza
ViT
25
144
0
18 May 2021
TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Amanpreet Singh
Guan Pang
Mandy Toh
Jing Huang
Wojciech Galuba
Tal Hassner
19
164
0
12 May 2021
Towards Accurate Text-based Image Captioning with Content Diversity Exploration
Guanghui Xu
Shuaicheng Niu
Mingkui Tan
Yucheng Luo
Qing Du
Qi Wu
DiffM
22
56
0
23 Apr 2021
Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark
Joakim Bruslund Haurum
T. Moeslund
20
60
0
19 Mar 2021
Revisiting Classification Perspective on Scene Text Recognition
Hongxiang Cai
Jun Sun
Yichao Xiong
24
10
0
22 Feb 2021
Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues
A. Alam
I. Ullah
Young-Koo Lee
42
22
0
16 Nov 2020
Towards Image-based Automatic Meter Reading in Unconstrained Scenarios: A Robust and Efficient Approach
Rayson Laroca
Alessandra B. Araujo
L. A. Zanlorensi
E. Almeida
David Menotti
15
35
0
21 Sep 2020
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Andrés Mafla
S. Dey
Ali Furkan Biten
Lluís Gómez
Dimosthenis Karatzas
27
25
0
21 Sep 2020
Text Detection and Recognition in the Wild: A Review
Z. Raisi
Mohamed A. Naiel
Paul Fieguth
Steven Wardell
John S. Zelek
37
35
0
08 Jun 2020
SCATTER: Selective Context Attentional Scene Text Recognizer
Ron Litman
Oron Anschel
Shahar Tsiper
R. Litman
Shai Mazor
R. Manmatha
21
132
0
25 Mar 2020
TextCaps: a Dataset for Image Captioning with Reading Comprehension
Oleksii Sidorov
Ronghang Hu
Marcus Rohrbach
Amanpreet Singh
25
390
0
24 Mar 2020
Deep Multi-Modal Sets
A. Reiter
Menglin Jia
Pu Yang
Ser-Nam Lim
BDL
25
4
0
03 Mar 2020
Scene Text Visual Question Answering
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Lluís Gómez
Marçal Rusiñol
Ernest Valveny
C. V. Jawahar
Dimosthenis Karatzas
36
343
0
31 May 2019
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
143
356
0
27 May 2019
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis
Jeonghun Baek
Geewook Kim
Junyeop Lee
Sungrae Park
Dongyoon Han
Sangdoo Yun
Seong Joon Oh
Hwalsuk Lee
349
475
0
03 Apr 2019
Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Jongsoo Park
Maxim Naumov
Protonu Basu
Summer Deng
Aravind Kalaiah
...
Lin Qiao
Vijay Rao
Nadav Rotem
S. Yoo
M. Smelyanskiy
FedML
GNN
BDL
20
186
0
24 Nov 2018
Scene Text Detection and Recognition: The Deep Learning Era
Shangbang Long
Xin He
Cong Yao
VLM
44
389
0
10 Nov 2018
Scene Text Recognition with Sliding Convolutional Character Models
Fei Yin
Yi-Chao Wu
Xu-Yao Zhang
Cheng-Lin Liu
VLM
3DV
58
77
0
06 Sep 2017
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
188
515
0
26 Jan 2016
1