Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.04396
Cited By
On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention
10 October 2019
Junyeop Lee
Sungrae Park
Jeonghun Baek
Seong Joon Oh
Seonghyeon Kim
Hwalsuk Lee
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention"
50 / 56 papers shown
Title
Text-Aware Image Restoration with Diffusion Models
Jaewon Min
J. Kim
Paul Hyunbin Cho
J. Lee
Jihye Park
Minkyu Park
S. Kim
Hyunhee Park
Seungryong Kim
48
0
0
11 Jun 2025
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Zhengmi Tang
Yuto Mitsui
Tomo Miyazaki
S. Omachi
89
0
0
11 May 2025
Instruction-Guided Scene Text Recognition
Yongkun Du
Z. Chen
Yuchen Su
Caiyan Jia
Yu-Gang Jiang
210
3
0
03 Jan 2025
Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition
T. Lin
Jinglei Zhang
Yi Xu
Kai Chen
Rui Zhang
Chong Chen
98
0
0
18 Nov 2024
Improving Handwritten Text Recognition via 3D Attention and Multi-Scale Training
Zi-Rui Wang
66
0
0
24 Oct 2024
Zero-Shot Paragraph-level Handwriting Imitation with Latent Diffusion Models
Martin Mayr
Marcel Dreier
Florian Kordon
Mathias Seuret
Jochen Zöllner
Fei Wu
Andreas Maier
Vincent Christlein
DiffM
128
1
0
01 Sep 2024
LEGO: Self-Supervised Representation Learning for Scene Text Images
Yujin Ren
Jiaxin Zhang
Lianwen Jin
SSL
69
0
0
04 Aug 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
173
2
0
29 Jul 2024
Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering
Hiba Maryam
Ling Fu
Jiajun Song
Tajrian Abm Shafayet
Qidi Luo
Xiang Bai
Yuliang Liu
50
0
0
21 May 2024
The First Swahili Language Scene Text Detection and Recognition Dataset
Fadila Wendigoundi Douamba
Jianjun Song
Ling Fu
Yuliang Liu
Xiang Bai
54
0
0
19 May 2024
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
Xiang Bai
91
5
0
24 Feb 2024
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
X. Bai
VLM
105
12
0
21 Feb 2024
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition
Xianfu Cheng
Weixiao Zhou
Xiang Li
Xiaoming Chen
Jian Yang
Tongliang Li
Zhoujun Li
103
3
0
18 Jan 2024
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition
Jinzhi Zheng
Ruyi Ji
Libo Zhang
Yanjun Wu
Chen Zhao
62
4
0
18 Jan 2024
Watermark Text Pattern Spotting in Document Images
Mateusz Krubiński
Stefan Matcovici
Diana Grigore
Daniel Voinea
A. Popa
WaLM
57
2
0
10 Jan 2024
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
93
6
0
29 Dec 2023
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
Zhen Zhao
Jingqun Tang
Chunhui Lin
Binghong Wu
Can Huang
Hao Liu
Xin Tan
Zhizhong Zhang
Yuan Xie
104
25
0
22 Nov 2023
Relational Contrastive Learning for Scene Text Recognition
Jinglei Zhang
Tiancheng Lin
Yi Xu
Kaibo Chen
Rui Zhang
69
11
0
01 Aug 2023
A Transformer-based Approach for Arabic Offline Handwritten Text Recognition
Saleh Momeni
B. BabaAli
146
16
0
27 Jul 2023
Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Cheng Da
Peng Wang
Cong Yao
50
9
0
25 Jul 2023
Context Perception Parallel Decoder for Scene Text Recognition
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
104
7
0
23 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
102
41
0
17 Jul 2023
Looking and Listening: Audio Guided Text Recognition
Wenwen Yu
Mingyu Liu
Biao Yang
Enming Zhang
Deqiang Jiang
Xing Sun
Yuliang Liu
Xiang Bai
DiffM
75
1
0
06 Jun 2023
MSdocTr-Lite: A Lite Transformer for Full Page Multi-script Handwriting Recognition
M. Dhiaf
Ahmed Cheikh Rouhou
Yousri Kessentini
Sinda Ben Salem
67
13
0
24 Mar 2023
A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition
Gurkan Soykan
Deniz Yuret
T. M. Sezgin
96
5
0
27 Dec 2022
Pure Transformer with Integrated Experts for Scene Text Recognition
Yew Lee Tan
A. Kong
Jung-jae Kim
ViT
102
18
0
09 Nov 2022
Self-supervised Character-to-Character Distillation for Text Recognition
Tongkun Guan
Wei Shen
Xuehang Yang
Qi Feng
Zekun Jiang
Xiaokang Yang
147
25
0
01 Nov 2022
Levenshtein OCR
Cheng Da
Peng Wang
Cong Yao
ViT
129
32
0
08 Sep 2022
Multi-Granularity Prediction for Scene Text Recognition
Peng Wang
Cheng Da
Cong Yao
136
48
0
08 Sep 2022
Scene Text Recognition with Single-Point Decoding Network
Lei Chen
Haibo Qin
Shi-Xue Zhang
Chun Yang
Xucheng Yin
58
1
0
05 Sep 2022
Vision-Language Adaptive Mutual Decoder for OOV-STR
Jinshui Hu
Chenyu Liu
Qiandong Yan
Xuyang Zhu
Jiajia Wu
Feng Yu
Bing Yin
VLM
98
1
0
02 Sep 2022
Character decomposition to resolve class imbalance problem in Hangul OCR
Geonuk Kim
Jaemin Son
Kanghyu Lee
Jaesik Min
47
2
0
12 Aug 2022
Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition
Xudong Xie
Ling Fu
Zhifei Zhang
Zhaowen Wang
X. Bai
ViT
110
46
0
31 Jul 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
108
173
0
14 Jul 2022
COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
80
15
0
11 Jul 2022
Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Mingkun Yang
Minghui Liao
Pu Lu
Jing Wang
Shenggao Zhu
Hualin Luo
Qingzhen Tian
X. Bai
SSL
117
60
0
01 Jul 2022
An Evaluation of OCR on Egocentric Data
Valentin Popescu
Dima Damen
Toby Perrett
EgoV
56
0
0
11 Jun 2022
Text Detection & Recognition in the Wild for Robot Localization
Z. Raisi
John S. Zelek
68
0
0
17 May 2022
Multimodal Semi-Supervised Learning for Text Recognition
Aviad Aberdam
Roy Ganz
Shai Mazor
Ron Litman
VLM
88
19
0
08 May 2022
IterVM: Iterative Vision Modeling Module for Scene Text Recognition
Xiaojie Chu
Yongtao Wang
58
2
0
06 Apr 2022
Unitail: Detecting, Reading, and Matching in Retail Scene
Fangyi Chen
Han Zhang
Zaiwang Li
Jiachen Dou
Shentong Mo
Hao Chen
Yongxin Zhang
Uzair Ahmed
Chenchen Zhu
Marios Savvides
99
9
0
01 Apr 2022
Lane detection with Position Embedding
Jun Xie
Jiacheng Han
Dezhen Qi
F. Chen
Kaer Huang
Jia Shuai
61
5
0
23 Mar 2022
Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition
Wondimu Dikubab
Dingkang Liang
Minghui Liao
Xiang Bai
30
2
0
23 Mar 2022
Towards Open-Set Text Recognition via Label-to-Prototype Learning
Chang-rui Liu
Chun Yang
Haibo Qin
Xiaobin Zhu
Cheng-Lin Liu
Xu-Cheng Yin
VLM
49
34
0
10 Mar 2022
Self-supervised Implicit Glyph Attention for Text Recognition
Tongkun Guan
Chaochen Gu
Jingzheng Tu
Xuehang Yang
Qi Feng
Yudi Zhao
Xiaokang Yang
Wei Shen
114
25
0
07 Mar 2022
Transformer-Based Approach for Joint Handwriting and Named Entity Recognition in Historical documents
Ahmed Cheikh Rouhoua
M. Dhiaf
Yousri Kessentini
Sinda Ben Salem
76
49
0
08 Dec 2021
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Byeonghu Na
Yoonsik Kim
Sungrae Park
87
54
0
30 Nov 2021
TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
Yuefeng Tao
Zhiwei Jia
Runze Ma
Shugong Xu
ViT
45
6
0
16 Nov 2021
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
246
375
0
21 Sep 2021
Data Augmentation for Scene Text Recognition
Rowel Atienza
67
19
0
16 Aug 2021
1
2
Next