ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.11656
  4. Cited By
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text
  Recognizer

VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer

18 September 2024
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
ArXiv (abs)PDFHTML

Papers citing "VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer"

24 / 24 papers shown
Title
Scene Text Recognition with Permuted Autoregressive Sequence Models
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
105
173
0
14 Jul 2022
Reading and Writing: Discriminative and Generative Modeling for
  Self-Supervised Text Recognition
Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Mingkun Yang
Minghui Liao
Pu Lu
Jing Wang
Shenggao Zhu
Hualin Luo
Qingzhen Tian
X. Bai
SSL
98
59
0
01 Jul 2022
TrOCR: Transformer-based Optical Character Recognition with Pre-trained
  Models
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
246
372
0
21 Sep 2021
PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text
  Recognition
PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text Recognition
Zhi Qiao
Yu Zhou
Jin Wei
Wei Wang
Yuanqing Zhang
Ning Jiang
Hongbin Wang
Weiping Wang
60
70
0
09 Sep 2021
From Two to One: A New Scene Text Recognizer with Visual Language
  Modeling Network
From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network
Yuxin Wang
Hongtao Xie
Shancheng Fang
Jing Wang
Shenggao Zhu
Yongdong Zhang
VLM
89
154
0
22 Aug 2021
Open Images V5 Text Annotation and Yet Another Mask Text Spotter
Open Images V5 Text Annotation and Yet Another Mask Text Spotter
Ilya Krylov
S. Nosov
V. Sovrasov
VLM
68
54
0
23 Jun 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
682
41,483
0
22 Oct 2020
Towards Accurate Scene Text Recognition with Semantic Reasoning Networks
Towards Accurate Scene Text Recognition with Semantic Reasoning Networks
Deli Yu
Xuan Li
Chengquan Zhang
Junyu Han
Jingtuo Liu
Errui Ding
95
287
0
27 Mar 2020
GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text
  Recognition
GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition
Wenyang Hu
Xiaocong Cai
Jun Hou
Shuai Yi
Zhiping Lin
3DV
59
130
0
04 Feb 2020
TextScanner: Reading Characters in Order for Robust Scene Text
  Recognition
TextScanner: Reading Characters in Order for Robust Scene Text Recognition
Zhaoyi Wan
Minghang He
Haoran Chen
X. Bai
Cong Yao
67
139
0
28 Dec 2019
Decoupled Attention Network for Text Recognition
Decoupled Attention Network for Text Recognition
Tianwei Wang
Yuanzhi Zhu
Lianwen Jin
Canjie Luo
Xiaoxue Chen
Y. Wu
Qianying Wang
Mingxiang Cai
203
255
0
21 Dec 2019
ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard
ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard
Xi Liu
Rui Zhang
Yongsheng Zhou
Qianyi Jiang
Qi Song
...
X. Bai
Baoguang Shi
Dimosthenis Karatzas
Shijian Lu
C. V. Jawahar
3DV
62
160
0
20 Dec 2019
RandAugment: Practical automated data augmentation with a reduced search
  space
RandAugment: Practical automated data augmentation with a reduced search space
E. D. Cubuk
Barret Zoph
Jonathon Shlens
Quoc V. Le
MQ
260
3,505
0
30 Sep 2019
ICDAR 2019 Competition on Large-scale Street View Text with Partial
  Labeling -- RRC-LSVT
ICDAR 2019 Competition on Large-scale Street View Text with Partial Labeling -- RRC-LSVT
Yipeng Sun
Zihan Ni
Chee-Kheng Chng
Yuliang Liu
Canjie Luo
...
Errui Ding
Jingtuo Liu
Dimosthenis Karatzas
Chee Seng Chan
Lianwen Jin
3DV
104
158
0
17 Sep 2019
ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)
ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)
Chee-Kheng Chng
Yuliang Liu
Yipeng Sun
Chun Chet Ng
Canjie Luo
...
Errui Ding
Jingtuo Liu
Dimosthenis Karatzas
Chee Seng Chan
Lianwen Jin
3DV
95
215
0
16 Sep 2019
ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection
  and Recognition -- RRC-MLT-2019
ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019
Nibal Nayef
Yash J. Patel
M. Busta
Pinaki Nath Chowdhury
Dimosthenis Karatzas
...
Jirí Matas
Umapada Pal
J. Burie
Cheng-Lin Liu
J. Ogier
3DV
86
251
0
01 Jul 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
236
8,451
0
19 Jun 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,229
0
11 Oct 2018
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting
  Text with Arbitrary Shapes
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
Pengyuan Lyu
Minghui Liao
Cong Yao
Wenhao Wu
X. Bai
101
599
0
06 Jul 2018
Synthetic Data for Text Localisation in Natural Images
Synthetic Data for Text Localisation in Natural Images
Ankush Gupta
Andrea Vedaldi
Andrew Zisserman
153
1,430
0
22 Apr 2016
Robust Scene Text Recognition with Automatic Rectification
Robust Scene Text Recognition with Automatic Rectification
Baoguang Shi
Xinggang Wang
Pengyuan Lyu
Cong Yao
X. Bai
3DV
92
587
0
12 Mar 2016
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,364
0
22 Dec 2014
Reading Text in the Wild with Convolutional Neural Networks
Reading Text in the Wild with Convolutional Neural Networks
Max Jaderberg
Karen Simonyan
Andrea Vedaldi
Andrew Zisserman
127
1,166
0
04 Dec 2014
Synthetic Data and Artificial Neural Networks for Natural Scene Text
  Recognition
Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition
Max Jaderberg
Karen Simonyan
Andrea Vedaldi
Andrew Zisserman
159
935
0
09 Jun 2014
1