ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.12422
  4. Cited By
TextScanner: Reading Characters in Order for Robust Scene Text
  Recognition

TextScanner: Reading Characters in Order for Robust Scene Text Recognition

28 December 2019
Zhaoyi Wan
Minghang He
Haoran Chen
X. Bai
Cong Yao
ArXivPDFHTML

Papers citing "TextScanner: Reading Characters in Order for Robust Scene Text Recognition"

50 / 68 papers shown
Title
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Zhengmi Tang
Yuto Mitsui
Tomo Miyazaki
S. Omachi
34
0
0
11 May 2025
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
Naphat Nithisopa
Teerapong Panboonyuen
ViT
26
0
0
07 May 2025
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Yifei Zhang
Chang-Shu Liu
Jin Wei
Xiaomeng Yang
Yu Zhou
Can Ma
Xiangyang Ji
65
2
0
24 Mar 2025
MX-Font++: Mixture of Heterogeneous Aggregation Experts for Few-shot Font Generation
Weihang Wang
Duolin Sun
Jielei Zhang
Longwen Gao
66
0
0
04 Mar 2025
Instruction-Guided Scene Text Recognition
Instruction-Guided Scene Text Recognition
Yongkun Du
Z. Chen
Yuchen Su
Caiyan Jia
Yu-Gang Jiang
75
3
0
03 Jan 2025
Disentanglement and Compositionality of Letter Identity and Letter
  Position in Variational Auto-Encoder Vision Models
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models
Bruno Bianchi
Aakash Agrawal
S. Dehaene
Emmanuel Chemla
Yair Lakretz
DRL
CoGe
68
0
0
11 Dec 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text
  Recognizer
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
23
1
0
18 Sep 2024
LOGO: Video Text Spotting with Language Collaboration and Glyph
  Perception Model
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model
Hongen Liu
Di Sun
Jiahao Wang
Yi Liu
Gang Pan
45
0
0
29 May 2024
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive
  Permutation for Scene Text Recognition
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Honghui Chen
Yuhang Qiu
Jiabao Wang
Pingping Chen
Nam Ling
35
0
0
15 May 2024
JSTR: Judgment Improves Scene Text Recognition
JSTR: Judgment Improves Scene Text Recognition
Masato Fujitake
38
1
0
09 Apr 2024
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and
  Margin Loss
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Xuhua Ren
Hengcan Shi
Jin Li
VLM
38
0
0
12 Mar 2024
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition
Jinzhi Zheng
Ruyi Ji
Libo Zhang
Yanjun Wu
Chen Zhao
30
4
0
18 Jan 2024
Enhancing Small Object Encoding in Deep Neural Networks: Introducing
  Fast&Focused-Net with Volume-wise Dot Product Layer
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
Tofik Ali
Partha Pratim Roy
ObjD
30
2
0
18 Jan 2024
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
33
6
0
29 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
62
1
0
19 Dec 2023
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text
  Recognizer
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
Zhen Zhao
Jingqun Tang
Chunhui Lin
Binghong Wu
Can Huang
Hao Liu
Xin Tan
Zhizhong Zhang
Yuan Xie
34
20
0
22 Nov 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text
  Recognition
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
47
15
0
08 Oct 2023
Orientation-Independent Chinese Text Recognition in Scene Images
Orientation-Independent Chinese Text Recognition in Scene Images
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
22
4
0
03 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
46
35
0
30 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Changxu Cheng
P. Wang
Cheng Da
Qi Zheng
Cong Yao
32
15
0
24 Aug 2023
Multi-Granularity Prediction with Learnable Fusion for Scene Text
  Recognition
Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Cheng Da
P. Wang
Cong Yao
15
8
0
25 Jul 2023
Context Perception Parallel Decoder for Scene Text Recognition
Context Perception Parallel Decoder for Scene Text Recognition
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
31
7
0
23 Jul 2023
DiffusionSTR: Diffusion Model for Scene Text Recognition
DiffusionSTR: Diffusion Model for Scene Text Recognition
Masato Fujitake
DiffM
15
6
0
29 Jun 2023
Conditional Text Image Generation with Diffusion Models
Conditional Text Image Generation with Diffusion Models
Yuanzhi Zhu
Zhaohai Li
Tianwei Wang
Mengchao He
Cong Yao
VLM
DiffM
62
46
0
19 Jun 2023
Looking and Listening: Audio Guided Text Recognition
Looking and Listening: Audio Guided Text Recognition
Wenwen Yu
Mingyu Liu
Biao Yang
Enming Zhang
Deqiang Jiang
Xing Sun
Yuliang Liu
Xiang Bai
DiffM
25
1
0
06 Jun 2023
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Jin Wei
Dongbao Yang
Yu Zhou
24
7
0
25 May 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained
  Vision-Language Model
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
CLIP
VLM
23
25
0
23 May 2023
Linguistic More: Taking a Further Step toward Efficient and Accurate
  Scene Text Recognition
Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition
Boqiang Zhang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Yongdong Zhang
32
20
0
09 May 2023
Scene Text Recognition with Image-Text Matching-guided Dictionary
Scene Text Recognition with Image-Text Matching-guided Dictionary
Jiajun Wei
Hongjian Zhan
X. Tu
Yue Lu
Umapada Pal
VLM
17
0
0
08 May 2023
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for
  Scene Text Spotting
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
Shancheng Fang
Zhendong Mao
Hongtao Xie
Yuxin Wang
C. Yan
Yongdong Zhang
24
53
0
19 Nov 2022
Portmanteauing Features for Scene Text Recognition
Portmanteauing Features for Scene Text Recognition
Yew Lee Tan
Ernest Yu Kai Chew
A. Kong
Jung-jae Kim
J. Lim
36
0
0
09 Nov 2022
Pure Transformer with Integrated Experts for Scene Text Recognition
Pure Transformer with Integrated Experts for Scene Text Recognition
Yew Lee Tan
A. Kong
Jung-jae Kim
ViT
22
16
0
09 Nov 2022
Self-supervised Character-to-Character Distillation for Text Recognition
Self-supervised Character-to-Character Distillation for Text Recognition
Tongkun Guan
Wei Shen
Xuehang Yang
Qi Feng
Zekun Jiang
Xiaokang Yang
31
26
0
01 Nov 2022
Searching a High-Performance Feature Extractor for Text Recognition
  Network
Searching a High-Performance Feature Extractor for Text Recognition Network
Hui Zhang
Quanming Yao
James T. Kwok
X. Bai
28
7
0
27 Sep 2022
Levenshtein OCR
Levenshtein OCR
Cheng Da
P. Wang
Cong Yao
ViT
73
32
0
08 Sep 2022
Multi-Granularity Prediction for Scene Text Recognition
Multi-Granularity Prediction for Scene Text Recognition
P. Wang
Cheng Da
Cong Yao
66
48
0
08 Sep 2022
Toward Understanding WordArt: Corner-Guided Transformer for Scene Text
  Recognition
Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition
Xudong Xie
Ling Fu
Zhifei Zhang
Zhaowen Wang
X. Bai
ViT
34
45
0
31 Jul 2022
SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily
  Oriented Scene Text Recognition
SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition
Dajian Zhong
Shujing Lyu
P. Shivakumara
Bing Yin
Jiajia Wu
Umapada Pal
Yue Lu
26
20
0
21 Jul 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
26
169
0
14 Jul 2022
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text
  Spotting
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting
Ying Chen
Liang Qiao1
Zhanzhan Cheng
Shiliang Pu
Yi Niu
Xi Li
19
2
0
14 Jul 2022
Reading and Writing: Discriminative and Generative Modeling for
  Self-Supervised Text Recognition
Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Mingkun Yang
Minghui Liao
Pu Lu
Jing Wang
Shenggao Zhu
Hualin Luo
Qingzhen Tian
X. Bai
SSL
33
55
0
01 Jul 2022
Towards Optimizing OCR for Accessibility
Towards Optimizing OCR for Accessibility
Peya Mowar
T. Ganu
Saikat Guha
14
1
0
21 Jun 2022
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Pengyuan Lyu
Chengquan Zhang
Shanshan Liu
Meina Qiao
Yangliu Xu
Liang Wu
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
24
42
0
01 Jun 2022
Multimodal Semi-Supervised Learning for Text Recognition
Multimodal Semi-Supervised Learning for Text Recognition
Aviad Aberdam
Roy Ganz
Shai Mazor
Ron Litman
VLM
24
19
0
08 May 2022
Pushing the Performance Limit of Scene Text Recognizer without Human
  Annotation
Pushing the Performance Limit of Scene Text Recognizer without Human Annotation
Caiyuan Zheng
Hui Li
Seon-Min Rhee
Seungju Han
Jae-Joon Han
Peng Wang
30
12
0
16 Apr 2022
Training Protocol Matters: Towards Accurate Scene Text Recognition via
  Training Protocol Searching
Training Protocol Matters: Towards Accurate Scene Text Recognition via Training Protocol Searching
Xiaojie Chu
Yongtao Wang
Chunhua Shen
Jingdong Chen
Wei Chu
24
0
0
13 Mar 2022
Towards Open-Set Text Recognition via Label-to-Prototype Learning
Towards Open-Set Text Recognition via Label-to-Prototype Learning
Chang-rui Liu
Chun Yang
Haibo Qin
Xiaobin Zhu
Cheng-Lin Liu
Xu-Cheng Yin
VLM
11
34
0
10 Mar 2022
Self-supervised Implicit Glyph Attention for Text Recognition
Self-supervised Implicit Glyph Attention for Text Recognition
Tongkun Guan
Chaochen Gu
Jingzheng Tu
Xuehang Yang
Qi Feng
Yudi Zhao
Xiaokang Yang
Wei Shen
29
25
0
07 Mar 2022
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an
  Empirical Study
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study
Haiyang Yu
Jingye Chen
Bin Li
Jianqi Ma
Mengnan Guan
Xixi Xu
Xiaocong Wang
Shaobo Qu
Xiangyang Xue
19
55
0
30 Dec 2021
Visual Semantics Allow for Textual Reasoning Better in Scene Text
  Recognition
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
Y. He
Chen Chen
Jing Zhang
Juhua Liu
Fengxiang He
Chaoyue Wang
Bo Du
34
55
0
24 Dec 2021
12
Next