ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.06495
  4. Cited By
Read Like Humans: Autonomous, Bidirectional and Iterative Language
  Modeling for Scene Text Recognition

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

11 March 2021
Shancheng Fang
Hongtao Xie
Yuxin Wang
Zhendong Mao
Yongdong Zhang
ArXivPDFHTML

Papers citing "Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition"

50 / 59 papers shown
Title
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Zhengmi Tang
Yuto Mitsui
Tomo Miyazaki
S. Omachi
34
0
0
11 May 2025
NCAP: Scene Text Image Super-Resolution with Non-CAtegorical Prior
NCAP: Scene Text Image Super-Resolution with Non-CAtegorical Prior
Dongwoo Park
Suk Pil Ko
183
0
0
01 Apr 2025
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering
Han Wang
Kai Hu
Liangcai Gao
176
0
0
20 Mar 2025
Historic Scripts to Modern Vision: A Novel Dataset and A VLM Framework for Transliteration of Modi Script to Devanagari
Historic Scripts to Modern Vision: A Novel Dataset and A VLM Framework for Transliteration of Modi Script to Devanagari
Harshal Kausadikar
Tanvi Kale
Onkar Susladkar
Sparsh Mittal
60
0
0
17 Mar 2025
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
Mingyue Cheng
Yucong Luo
Jie Ouyang
Qiang Liu
Huijie Liu
...
Bohou Zhang
Jiawei Cao
Jie Ma
Daoyu Wang
Enhong Chen
3DV
76
3
0
11 Mar 2025
Instruction-Guided Scene Text Recognition
Instruction-Guided Scene Text Recognition
Yongkun Du
Z. Chen
Yuchen Su
Caiyan Jia
Yu-Gang Jiang
75
3
0
03 Jan 2025
Situational Scene Graph for Structured Human-centric Situation Understanding
Situational Scene Graph for Structured Human-centric Situation Understanding
Chinthani Sugandhika
Chen Li
Deepu Rajan
Basura Fernando
194
1
0
30 Oct 2024
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Weichao Zeng
Yan Shu
Zhenhang Li
Dongbao Yang
Yu Zhou
DiffM
24
7
0
14 Oct 2024
General Detection-based Text Line Recognition
General Detection-based Text Line Recognition
Raphael Baena
Syrine Kalleli
Mathieu Aubry
181
0
0
25 Sep 2024
Scene-Text Grounding for Text-Based Video Question Answering
Scene-Text Grounding for Text-Based Video Question Answering
Sheng Zhou
Junbin Xiao
Xun Yang
Peipei Song
Dan Guo
Angela Yao
Meng Wang
Tat-Seng Chua
142
1
0
22 Sep 2024
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
Saumik Bhattacharya
60
2
0
27 Aug 2024
Out of Length Text Recognition with Sub-String Matching
Out of Length Text Recognition with Sub-String Matching
Yongkun Du
Zhineng Chen
Caiyan Jia
Xieping Gao
Yu-Gang Jiang
63
2
0
17 Jul 2024
Focus on the Whole Character: Discriminative Character Modeling for
  Scene Text Recognition
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
Bangbang Zhou
Yadong Qu
Zixiao Wang
Zicheng Li
Boqiang Zhang
Hongtao Xie
47
1
0
08 Jul 2024
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Da Chang
Yu Li
72
2
0
19 Apr 2024
Efficient scene text image super-resolution with semantic guidance
Efficient scene text image super-resolution with semantic guidance
LeoWu TomyEnrique
Xiangcheng Du
Kangliang Liu
Han Yuan
Zhao Zhou
Cheng Jin
VLM
31
2
0
20 Mar 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with
  Pre-trained Language Model
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
29
3
0
15 Mar 2024
Enhancing Small Object Encoding in Deep Neural Networks: Introducing
  Fast&Focused-Net with Volume-wise Dot Product Layer
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
Tofik Ali
Partha Pratim Roy
ObjD
30
2
0
18 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
77
1
0
15 Jan 2024
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
41
6
0
29 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
62
1
0
19 Dec 2023
Scene Text Image Super-resolution based on Text-conditional Diffusion
  Models
Scene Text Image Super-resolution based on Text-conditional Diffusion Models
Chihiro Noguchi
Shun Fukuda
Masao Yamanaka
DiffM
33
10
0
16 Nov 2023
Scene Text Recognition Models Explainability Using Local Features
Scene Text Recognition Models Explainability Using Local Features
M. Ty
Rowel Atienza
39
1
0
14 Oct 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
51
35
0
30 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
40
15
0
24 Aug 2023
Towards Robust Scene Text Image Super-resolution via Explicit Location
  Enhancement
Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
Han Guo
Tao Dai
G. MEng
Shutao Xia
26
11
0
19 Jul 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained
  Vision-Language Model
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
CLIP
VLM
23
25
0
23 May 2023
Deep Image Compression Using Scene Text Quality Assessment
Deep Image Compression Using Scene Text Quality Assessment
Shohei Uchigasaki
Tomo Miyazaki
S. Omachi
24
8
0
19 May 2023
TextDiffuser: Diffusion Models as Text Painters
TextDiffuser: Diffusion Models as Text Painters
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
48
113
0
18 May 2023
Scene Text Recognition with Image-Text Matching-guided Dictionary
Scene Text Recognition with Image-Text Matching-guided Dictionary
Jiajun Wei
Hongjian Zhan
X. Tu
Yue Lu
Umapada Pal
VLM
17
0
0
08 May 2023
ICDAR 2023 Video Text Reading Competition for Dense and Small Text
ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Weijia Wu
Yuzhong Zhao
Zhuangzi Li
Jiahong Li
Mike Zheng Shou
Umapada Pal
Dimosthenis Karatzas
Xiang Bai
28
6
0
10 Apr 2023
Improving Scene Text Image Super-resolution via Dual Prior Modulation
  Network
Improving Scene Text Image Super-resolution via Dual Prior Modulation Network
Shipeng Zhu
Zuoyan Zhao
Pengfei Fang
H. Xue
SupR
DiffM
50
24
0
21 Feb 2023
Transferring General Multimodal Pretrained Models to Text Recognition
Transferring General Multimodal Pretrained Models to Text Recognition
Junyang Lin
Xuancheng Ren
Yichang Zhang
Gao Liu
Peng Wang
An Yang
Chang Zhou
34
4
0
19 Dec 2022
Indian Commercial Truck License Plate Detection and Recognition for
  Weighbridge Automation
Indian Commercial Truck License Plate Detection and Recognition for Weighbridge Automation
Siddharth Agrawal
Keyur D. Joshi
35
4
0
23 Nov 2022
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for
  Scene Text Spotting
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
Shancheng Fang
Zhendong Mao
Hongtao Xie
Yuxin Wang
C. Yan
Yongdong Zhang
32
53
0
19 Nov 2022
Masked Vision-Language Transformers for Scene Text Recognition
Masked Vision-Language Transformers for Scene Text Recognition
Jie Wu
Ying Peng
Shenmin Zhang
Weigang Qi
Jian Zhang
35
3
0
09 Nov 2022
Out-of-Vocabulary Challenge Report
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils
Andrés Mafla
Ali Furkan Biten
Oren Nuriel
Aviad Aberdam
Shai Mazor
Ron Litman
Dimosthenis Karatzas
14
16
0
14 Sep 2022
Scene Text Recognition with Single-Point Decoding Network
Scene Text Recognition with Single-Point Decoding Network
Lei Chen
Haibo Qin
Shi-Xue Zhang
Chun Yang
Xucheng Yin
26
1
0
05 Sep 2022
Vision-Language Adaptive Mutual Decoder for OOV-STR
Vision-Language Adaptive Mutual Decoder for OOV-STR
Jinshui Hu
Chenyu Liu
Qiandong Yan
Xuyang Zhu
Jiajia Wu
Feng Yu
Bing Yin
VLM
32
0
0
02 Sep 2022
1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene
  Text Understanding: End-to-End Recognition of Out of Vocabulary Words
1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: End-to-End Recognition of Out of Vocabulary Words
Zhangzi Zhu
Chuhui Xue
Yu Hao
Wenqing Zhang
Song Bai
56
0
0
01 Sep 2022
Character decomposition to resolve class imbalance problem in Hangul OCR
Character decomposition to resolve class imbalance problem in Hangul OCR
Geonuk Kim
Jaemin Son
Kanghyu Lee
Jaesik Min
23
2
0
12 Aug 2022
Toward Understanding WordArt: Corner-Guided Transformer for Scene Text
  Recognition
Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition
Xudong Xie
Ling Fu
Zhifei Zhang
Zhaowen Wang
X. Bai
ViT
34
45
0
31 Jul 2022
SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily
  Oriented Scene Text Recognition
SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition
Dajian Zhong
Shujing Lyu
P. Shivakumara
Bing Yin
Jiajia Wu
Umapada Pal
Yue Lu
32
20
0
21 Jul 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
26
169
0
14 Jul 2022
Reading and Writing: Discriminative and Generative Modeling for
  Self-Supervised Text Recognition
Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Mingkun Yang
Minghui Liao
Pu Lu
Jing Wang
Shenggao Zhu
Hualin Luo
Qingzhen Tian
X. Bai
SSL
33
55
0
01 Jul 2022
An Evaluation of OCR on Egocentric Data
An Evaluation of OCR on Egocentric Data
Valentin Popescu
Dima Damen
Toby Perrett
EgoV
30
0
0
11 Jun 2022
GIT: A Generative Image-to-text Transformer for Vision and Language
GIT: A Generative Image-to-text Transformer for Vision and Language
Jianfeng Wang
Zhengyuan Yang
Xiaowei Hu
Linjie Li
Kevin Qinghong Lin
Zhe Gan
Zicheng Liu
Ce Liu
Lijuan Wang
VLM
59
529
0
27 May 2022
Multimodal Semi-Supervised Learning for Text Recognition
Multimodal Semi-Supervised Learning for Text Recognition
Aviad Aberdam
Roy Ganz
Shai Mazor
Ron Litman
VLM
24
19
0
08 May 2022
SVTR: Scene Text Recognition with a Single Visual Model
SVTR: Scene Text Recognition with a Single Visual Model
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Tianlun Zheng
Chenxia Li
Yuning Du
Yu-Gang Jiang
22
170
0
30 Apr 2022
C3-STISR: Scene Text Image Super-resolution with Triple Clues
C3-STISR: Scene Text Image Super-resolution with Triple Clues
Minyi Zhao
Miaosen Wang
Fan Bai
Bingjia Li
Jie Wang
Shuigeng Zhou
22
32
0
29 Apr 2022
IterVM: Iterative Vision Modeling Module for Scene Text Recognition
IterVM: Iterative Vision Modeling Module for Scene Text Recognition
Xiaojie Chu
Yongtao Wang
27
2
0
06 Apr 2022
12
Next