ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.00751
  4. Cited By
Show, Attend and Read: A Simple and Strong Baseline for Irregular Text
  Recognition

Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition

2 November 2018
Hui Li
Peng Wang
Chunhua Shen
Guyu Zhang
ArXivPDFHTML

Papers citing "Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition"

50 / 154 papers shown
Title
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Zhengmi Tang
Yuto Mitsui
Tomo Miyazaki
S. Omachi
34
0
0
11 May 2025
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
Naphat Nithisopa
Teerapong Panboonyuen
ViT
31
0
0
07 May 2025
From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction
From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction
V. Golovkin
Nikolay Nemtsev
Vasyl Shandyba
Oleg Udin
Nikita Kasatkin
Pavel Kononov
Anton Afanasiev
Sergey Ulasen
Andrei Boiarov
31
0
0
08 Apr 2025
A Lightweight Multi-Module Fusion Approach for Korean Character Recognition
A Lightweight Multi-Module Fusion Approach for Korean Character Recognition
Inho Jake Park
Jaehoon Jay Jeong
Ho-Sang Jo
38
0
0
08 Apr 2025
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
Xingxing Zou
Wen Zhang
Nanxuan Zhao
59
0
0
24 Mar 2025
PLATTER: A Page-Level Handwritten Text Recognition System for Indic Scripts
Badri Vishal Kasuba
Dhruv Kudale
Venkatapathy Subramanian
P. Chaudhuri
Ganesh Ramakrishnan
48
0
0
10 Feb 2025
Instruction-Guided Scene Text Recognition
Instruction-Guided Scene Text Recognition
Yongkun Du
Z. Chen
Yuchen Su
Caiyan Jia
Yu-Gang Jiang
75
3
0
03 Jan 2025
DLaVA: Document Language and Vision Assistant for Answer Localization
  with Enhanced Interpretability and Trustworthiness
DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness
Ahmad Mohammadshirazi
Pinaki Prasad Guha Neogi
Ser-Nam Lim
R. Ramnath
70
1
0
29 Nov 2024
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
Yongkun Du
Z. Chen
Hongtao Xie
Caiyan Jia
Yu-Gang Jiang
90
1
0
24 Nov 2024
SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text
  Recognition
SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition
Jingyang Zhang
Chang-rui Liu
Chun Yang
31
2
0
10 Nov 2024
CodeSCAN: ScreenCast ANalysis for Video Programming Tutorials
CodeSCAN: ScreenCast ANalysis for Video Programming Tutorials
Alexander Naumann
Felix Hertlein
Jacqueline Höllig
Lucas Cazzonelli
Steffen Thoma
31
0
0
27 Sep 2024
Decoder Pre-Training with only Text for Scene Text Recognition
Decoder Pre-Training with only Text for Scene Text Recognition
Shuai Zhao
Yongkun Du
Zhineng Chen
Yu-Gang Jiang
43
0
0
11 Aug 2024
LEGO: Self-Supervised Representation Learning for Scene Text Images
LEGO: Self-Supervised Representation Learning for Scene Text Images
Yujin Ren
Jiaxin Zhang
Lianwen Jin
SSL
36
0
0
04 Aug 2024
CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction
CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction
Liang Zhao
Qing Guo
Xiaoguang Li
Song Wang
DiffM
44
0
0
23 Jul 2024
Out of Length Text Recognition with Sub-String Matching
Out of Length Text Recognition with Sub-String Matching
Yongkun Du
Zhineng Chen
Caiyan Jia
Xieping Gao
Yu-Gang Jiang
63
2
0
17 Jul 2024
Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via
  Semantics Completion and Decomposition
Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition
Daiqing Wu
Dongbao Yang
Huawen Shen
Can Ma
Yu Zhou
45
4
0
09 Jul 2024
SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction
  Benchmark in Form Understanding
SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding
Jiefeng Ma
Yan Wang
Chenyu Liu
Jun Du
Yu Hu
Zhenrong Zhang
Pengfei Hu
Qing Wang
Jianshu Zhang
36
0
0
13 Jun 2024
LOGO: Video Text Spotting with Language Collaboration and Glyph
  Perception Model
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model
Hongen Liu
Di Sun
Jiahao Wang
Yi Liu
Gang Pan
48
0
0
29 May 2024
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive
  Permutation for Scene Text Recognition
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Honghui Chen
Yuhang Qiu
Jiabao Wang
Pingping Chen
Nam Ling
40
0
0
15 May 2024
SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and
  Identification on a Minimap
SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap
Vladimir Somers
Victor Joos
A. Cioppa
Silvio Giancola
Seyed Abolfazl Ghasemzadeh
...
S. Kasaei
Guohao Li
Alexandre Alahi
Marc Van Droogenbroeck
Christophe De Vleeschouwer
34
23
0
17 Apr 2024
Ensemble Learning for Vietnamese Scene Text Spotting in Urban
  Environments
Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments
Hieu Nguyen
Cong-Hoang Ta
Phuong-Thuy Le-Nguyen
Minh-Triet Tran
Trung-Truc Huynh-Le
34
0
0
01 Apr 2024
Global License Plate Dataset
Global License Plate Dataset
Siddharth Agrawal
32
1
0
22 Mar 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with
  Pre-trained Language Model
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
29
3
0
15 Mar 2024
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Nguyen Nguyen
Yapeng Tian
Chenliang Xu
49
1
0
27 Feb 2024
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
X. Bai
VLM
80
10
0
21 Feb 2024
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text
  Recognition
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition
Xianfu Cheng
Weixiao Zhou
Xiang Li
Xiaoming Chen
Jian Yang
Tongliang Li
Zhoujun Li
37
2
0
18 Jan 2024
Inverse-like Antagonistic Scene Text Spotting via Reading-Order
  Estimation and Dynamic Sampling
Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling
Shi-Xue Zhang
Chun Yang
Xiaobin Zhu
Hongyang Zhou
Hongfa Wang
Xu-Cheng Yin
34
6
0
08 Jan 2024
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
41
6
0
29 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
62
1
0
19 Dec 2023
Test-Time Augmentation for 3D Point Cloud Classification and
  Segmentation
Test-Time Augmentation for 3D Point Cloud Classification and Segmentation
Tuan-Anh Vu
Srinjay Sarkar
Zhiyuan Zhang
Binh-Son Hua
Sai-Kit Yeung
3DPC
37
1
0
22 Nov 2023
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text
  Recognizer
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
Zhen Zhao
Jingqun Tang
Chunhui Lin
Binghong Wu
Can Huang
Hao Liu
Xin Tan
Zhizhong Zhang
Yuan Xie
34
23
0
22 Nov 2023
Towards Large-scale Building Attribute Mapping using Crowdsourced
  Images: Scene Text Recognition on Flickr and Problems to be Solved
Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved
Y. Sun
Anna M. Kruspe
L. Meng
Y. Tian
E. J. Hoffmann
S. Auer
X. X. Zhu
33
1
0
14 Sep 2023
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through
  Image-IDS Aligning
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
VLM
18
17
0
03 Sep 2023
Orientation-Independent Chinese Text Recognition in Scene Images
Orientation-Independent Chinese Text Recognition in Scene Images
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
30
4
0
03 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
56
35
0
30 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
45
15
0
24 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss
  for Text Recognition: A Simple Yet Effective Approach
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
36
11
0
17 Aug 2023
Context Perception Parallel Decoder for Scene Text Recognition
Context Perception Parallel Decoder for Scene Text Recognition
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
37
7
0
23 Jul 2023
Towards Robust Scene Text Image Super-resolution via Explicit Location
  Enhancement
Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
Han Guo
Tao Dai
G. MEng
Shutao Xia
26
11
0
19 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Revisiting Scene Text Recognition: A Data Perspective
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
33
39
0
17 Jul 2023
Writer adaptation for offline text recognition: An exploration of neural
  network-based methods
Writer adaptation for offline text recognition: An exploration of neural network-based methods
Tobias van der Werff
Maruf A. Dhali
Lambert Schomaker
47
0
0
11 Jul 2023
Looking and Listening: Audio Guided Text Recognition
Looking and Listening: Audio Guided Text Recognition
Wenwen Yu
Mingyu Liu
Biao Yang
Enming Zhang
Deqiang Jiang
Xing Sun
Yuliang Liu
Xiang Bai
DiffM
32
1
0
06 Jun 2023
Perception and Semantic Aware Regularization for Sequential Confidence
  Calibration
Perception and Semantic Aware Regularization for Sequential Confidence Calibration
Zhenghua Peng
Yuanmao Luo
Tianshui Chen
Keke Xu
Shuangping Huang
AI4TS
30
2
0
31 May 2023
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Jin Wei
Dongbao Yang
Yu Zhou
37
7
0
25 May 2023
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition
Tianlun Zheng
Zhineng Chen
Jinfeng Bai
Hongtao Xie
Yu-Gang Jiang
27
18
0
09 May 2023
Scene Text Recognition with Image-Text Matching-guided Dictionary
Scene Text Recognition with Image-Text Matching-guided Dictionary
Jiajun Wei
Hongjian Zhan
X. Tu
Yue Lu
Umapada Pal
VLM
17
0
0
08 May 2023
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image
  Models
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models
Eslam Mohamed Bakr
Pengzhan Sun
Xiaoqian Shen
Faizan Farooq Khan
Li Erran Li
Mohamed Elhoseiny
VLM
24
76
0
11 Apr 2023
Weakly-Supervised Text Instance Segmentation
Weakly-Supervised Text Instance Segmentation
Xinyan Zu
Haiyang Yu
Bin Li
Xiangyang Xue
ISeg
52
6
0
20 Mar 2023
Augmented Transformers with Adaptive n-grams Embedding for Multilingual
  Scene Text Recognition
Augmented Transformers with Adaptive n-grams Embedding for Multilingual Scene Text Recognition
Xueming Yan
Zhihang Fang
Yaochu Jin
ViT
33
1
0
28 Feb 2023
A Comprehensive Gold Standard and Benchmark for Comics Text Detection
  and Recognition
A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition
Gurkan Soykan
Deniz Yuret
T. M. Sezgin
27
3
0
27 Dec 2022
1234
Next