Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.00751
Cited By
Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition
2 November 2018
Hui Li
Peng Wang
Chunhua Shen
Guyu Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition"
50 / 154 papers shown
Title
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Zhengmi Tang
Yuto Mitsui
Tomo Miyazaki
S. Omachi
34
0
0
11 May 2025
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
Naphat Nithisopa
Teerapong Panboonyuen
ViT
31
0
0
07 May 2025
From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction
V. Golovkin
Nikolay Nemtsev
Vasyl Shandyba
Oleg Udin
Nikita Kasatkin
Pavel Kononov
Anton Afanasiev
Sergey Ulasen
Andrei Boiarov
31
0
0
08 Apr 2025
A Lightweight Multi-Module Fusion Approach for Korean Character Recognition
Inho Jake Park
Jaehoon Jay Jeong
Ho-Sang Jo
38
0
0
08 Apr 2025
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
Xingxing Zou
Wen Zhang
Nanxuan Zhao
59
0
0
24 Mar 2025
PLATTER: A Page-Level Handwritten Text Recognition System for Indic Scripts
Badri Vishal Kasuba
Dhruv Kudale
Venkatapathy Subramanian
P. Chaudhuri
Ganesh Ramakrishnan
48
0
0
10 Feb 2025
Instruction-Guided Scene Text Recognition
Yongkun Du
Z. Chen
Yuchen Su
Caiyan Jia
Yu-Gang Jiang
75
3
0
03 Jan 2025
DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness
Ahmad Mohammadshirazi
Pinaki Prasad Guha Neogi
Ser-Nam Lim
R. Ramnath
70
1
0
29 Nov 2024
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
Yongkun Du
Z. Chen
Hongtao Xie
Caiyan Jia
Yu-Gang Jiang
90
1
0
24 Nov 2024
SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition
Jingyang Zhang
Chang-rui Liu
Chun Yang
31
2
0
10 Nov 2024
CodeSCAN: ScreenCast ANalysis for Video Programming Tutorials
Alexander Naumann
Felix Hertlein
Jacqueline Höllig
Lucas Cazzonelli
Steffen Thoma
31
0
0
27 Sep 2024
Decoder Pre-Training with only Text for Scene Text Recognition
Shuai Zhao
Yongkun Du
Zhineng Chen
Yu-Gang Jiang
43
0
0
11 Aug 2024
LEGO: Self-Supervised Representation Learning for Scene Text Images
Yujin Ren
Jiaxin Zhang
Lianwen Jin
SSL
36
0
0
04 Aug 2024
CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction
Liang Zhao
Qing Guo
Xiaoguang Li
Song Wang
DiffM
44
0
0
23 Jul 2024
Out of Length Text Recognition with Sub-String Matching
Yongkun Du
Zhineng Chen
Caiyan Jia
Xieping Gao
Yu-Gang Jiang
63
2
0
17 Jul 2024
Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition
Daiqing Wu
Dongbao Yang
Huawen Shen
Can Ma
Yu Zhou
45
4
0
09 Jul 2024
SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding
Jiefeng Ma
Yan Wang
Chenyu Liu
Jun Du
Yu Hu
Zhenrong Zhang
Pengfei Hu
Qing Wang
Jianshu Zhang
36
0
0
13 Jun 2024
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model
Hongen Liu
Di Sun
Jiahao Wang
Yi Liu
Gang Pan
48
0
0
29 May 2024
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Honghui Chen
Yuhang Qiu
Jiabao Wang
Pingping Chen
Nam Ling
40
0
0
15 May 2024
SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap
Vladimir Somers
Victor Joos
A. Cioppa
Silvio Giancola
Seyed Abolfazl Ghasemzadeh
...
S. Kasaei
Guohao Li
Alexandre Alahi
Marc Van Droogenbroeck
Christophe De Vleeschouwer
34
23
0
17 Apr 2024
Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments
Hieu Nguyen
Cong-Hoang Ta
Phuong-Thuy Le-Nguyen
Minh-Triet Tran
Trung-Truc Huynh-Le
34
0
0
01 Apr 2024
Global License Plate Dataset
Siddharth Agrawal
32
1
0
22 Mar 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
29
3
0
15 Mar 2024
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Nguyen Nguyen
Yapeng Tian
Chenliang Xu
49
1
0
27 Feb 2024
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
X. Bai
VLM
80
10
0
21 Feb 2024
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition
Xianfu Cheng
Weixiao Zhou
Xiang Li
Xiaoming Chen
Jian Yang
Tongliang Li
Zhoujun Li
37
2
0
18 Jan 2024
Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling
Shi-Xue Zhang
Chun Yang
Xiaobin Zhu
Hongyang Zhou
Hongfa Wang
Xu-Cheng Yin
34
6
0
08 Jan 2024
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
41
6
0
29 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
62
1
0
19 Dec 2023
Test-Time Augmentation for 3D Point Cloud Classification and Segmentation
Tuan-Anh Vu
Srinjay Sarkar
Zhiyuan Zhang
Binh-Son Hua
Sai-Kit Yeung
3DPC
37
1
0
22 Nov 2023
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
Zhen Zhao
Jingqun Tang
Chunhui Lin
Binghong Wu
Can Huang
Hao Liu
Xin Tan
Zhizhong Zhang
Yuan Xie
34
23
0
22 Nov 2023
Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved
Y. Sun
Anna M. Kruspe
L. Meng
Y. Tian
E. J. Hoffmann
S. Auer
X. X. Zhu
33
1
0
14 Sep 2023
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
VLM
18
17
0
03 Sep 2023
Orientation-Independent Chinese Text Recognition in Scene Images
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
30
4
0
03 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
56
35
0
30 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
45
15
0
24 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
36
11
0
17 Aug 2023
Context Perception Parallel Decoder for Scene Text Recognition
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
37
7
0
23 Jul 2023
Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
Han Guo
Tao Dai
G. MEng
Shutao Xia
26
11
0
19 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
33
39
0
17 Jul 2023
Writer adaptation for offline text recognition: An exploration of neural network-based methods
Tobias van der Werff
Maruf A. Dhali
Lambert Schomaker
47
0
0
11 Jul 2023
Looking and Listening: Audio Guided Text Recognition
Wenwen Yu
Mingyu Liu
Biao Yang
Enming Zhang
Deqiang Jiang
Xing Sun
Yuliang Liu
Xiang Bai
DiffM
32
1
0
06 Jun 2023
Perception and Semantic Aware Regularization for Sequential Confidence Calibration
Zhenghua Peng
Yuanmao Luo
Tianshui Chen
Keke Xu
Shuangping Huang
AI4TS
30
2
0
31 May 2023
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Jin Wei
Dongbao Yang
Yu Zhou
37
7
0
25 May 2023
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition
Tianlun Zheng
Zhineng Chen
Jinfeng Bai
Hongtao Xie
Yu-Gang Jiang
27
18
0
09 May 2023
Scene Text Recognition with Image-Text Matching-guided Dictionary
Jiajun Wei
Hongjian Zhan
X. Tu
Yue Lu
Umapada Pal
VLM
17
0
0
08 May 2023
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models
Eslam Mohamed Bakr
Pengzhan Sun
Xiaoqian Shen
Faizan Farooq Khan
Li Erran Li
Mohamed Elhoseiny
VLM
24
76
0
11 Apr 2023
Weakly-Supervised Text Instance Segmentation
Xinyan Zu
Haiyang Yu
Bin Li
Xiangyang Xue
ISeg
52
6
0
20 Mar 2023
Augmented Transformers with Adaptive n-grams Embedding for Multilingual Scene Text Recognition
Xueming Yan
Zhihang Fang
Yaochu Jin
ViT
33
1
0
28 Feb 2023
A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition
Gurkan Soykan
Deniz Yuret
T. M. Sezgin
27
3
0
27 Dec 2022
1
2
3
4
Next