Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.05717
Cited By
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
21 July 2015
Baoguang Shi
X. Bai
Cong Yao
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition"
50 / 646 papers shown
Title
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Seonghyeon Kim
Seung Shin
Yoonsik Kim
Han-Cheol Cho
Taeho Kil
Jaeheung Surh
Seunghyun Park
Bado Lee
Youngmin Baek
22
8
0
10 Mar 2022
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Mohamed Ali Souibgui
Sanket Biswas
Andrés Mafla
Ali Furkan Biten
Alicia Fornés
Yousri Kessentini
Josep Lladós
Lluís Gómez
Dimosthenis Karatzas
21
23
0
09 Mar 2022
Self-supervised Implicit Glyph Attention for Text Recognition
Tongkun Guan
Chaochen Gu
Jingzheng Tu
Xuehang Yang
Qi Feng
Yudi Zhao
Xiaokang Yang
Wei Shen
34
25
0
07 Mar 2022
Syntax-Aware Network for Handwritten Mathematical Expression Recognition
Ye Yuan
Xiao-Chang Liu
Wondimu Dikubab
Hui Liu
Zhilong Ji
Zhongqin Wu
X. Bai
40
58
0
03 Mar 2022
SMILE: Sequence-to-Sequence Domain Adaption with Minimizing Latent Entropy for Text Image Recognition
Yen-Cheng Chang
Yi-Chang Chen
Yu-Chuan Chang
Yi-Ren Yeh
28
7
0
24 Feb 2022
Auxiliary Cross-Modal Representation Learning with Triplet Loss Functions for Online Handwriting Recognition
Felix Ott
David Rügamer
Lucas Heublein
Bernd Bischl
Christopher Mutschler
56
9
0
16 Feb 2022
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer
Yair Kittenplon
I. Lavi
Sharon Fogel
Yarin Bar
R. Manmatha
Pietro Perona
ViT
18
53
0
11 Feb 2022
AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder Networks
Dmitrijs Kass
Ekta Vats
HAI
40
28
0
23 Jan 2022
Region-based Layout Analysis of Music Score Images
Francisco J. Castellanos
Carlos Garrido-Munoz
Antonio Ríos-Vila
Jorge Calvo-Zaragoza
16
8
0
11 Jan 2022
Towards Boosting the Accuracy of Non-Latin Scene Text Recognition
Sanjana Gunna
Rohit Saluja
C. V. Jawahar
27
5
0
10 Jan 2022
Transfer Learning for Scene Text Recognition in Indian Languages
Sanjana Gunna
Rohit Saluja
C. V. Jawahar
VLM
25
12
0
10 Jan 2022
Image-based Automatic Dial Meter Reading in Unconstrained Scenarios
Gabriel Salomon
Rayson Laroca
David Menotti
18
18
0
08 Jan 2022
On the Cross-dataset Generalization in License Plate Recognition
Rayson Laroca
Everton VIlhena Cardoso
D. Lucio
Valter Estevam
David Menotti
24
42
0
02 Jan 2022
SAFL: A Self-Attention Scene Text Recognizer with Focal Loss
Bao Hieu Tran
Le Thanh
Huu Manh Nguyen
Duc Anh Le
T. Nguyen
Phi Le Nguyen
14
1
0
01 Jan 2022
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study
Haiyang Yu
Jingye Chen
Bin Li
Jianqi Ma
Mengnan Guan
Xixi Xu
Xiaocong Wang
Shaobo Qu
Xiangyang Xue
19
55
0
30 Dec 2021
Contrastive Learning of Semantic and Visual Representations for Text Tracking
Zhuang Li
Weijia Wu
Mike Zheng Shou
Jiahong Li
Size Li
Zhongyuan Wang
Hong Zhou
34
10
0
30 Dec 2021
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
Y. He
Chen Chen
Jing Zhang
Juhua Liu
Fengxiang He
Chaoyue Wang
Bo Du
34
55
0
24 Dec 2021
Image-free multi-character recognition
Huayi Wang
Chunli Zhu
Liheng Bian
11
6
0
20 Dec 2021
Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution
Jingye Chen
Haiyang Yu
Jianqi Ma
Bin Li
Xiangyang Xue
DiffM
25
47
0
13 Dec 2021
A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer
Weijia Wu
Yuanqiang Cai
Debing Zhang
Sibo Wang
Zhuang Li
Jiahong Li
Yejun Tang
Hong Zhou
33
29
0
09 Dec 2021
Visual-Semantic Transformer for Scene Text Recognition
Xin Tang
Yongquan Lai
Ying Liu
Yuanyuan Fu
Rui Fang
ViT
26
8
0
02 Dec 2021
OCR-free Document Understanding Transformer
Geewook Kim
Teakgyu Hong
Moonbin Yim
Jeongyeon Nam
Jinyoung Park
Jinyeong Yim
Wonseok Hwang
Sangdoo Yun
Dongyoon Han
Seunghyun Park
ViT
58
263
0
30 Nov 2021
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Byeonghu Na
Yoonsik Kim
Sungrae Park
46
54
0
30 Nov 2021
Decoupling Visual-Semantic Feature Learning for Robust Scene Text Recognition
Changxu Cheng
Bohan Li
Qi Zheng
Yongpan Wang
Wenyu Liu
21
2
0
24 Nov 2021
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages
Shota Orihashi
Yoshihiro Yamazaki
Naoki Makishima
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Ryo Masumura
33
1
0
24 Nov 2021
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition
Tianlun Zheng
Zhineng Chen
Shancheng Fang
Hongtao Xie
Yu-Gang Jiang
36
51
0
22 Nov 2021
TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
Yuefeng Tao
Zhiwei Jia
Runze Ma
Shugong Xu
ViT
19
6
0
16 Nov 2021
Improving Structured Text Recognition with Regular Expression Biasing
Baoguang Shi
W. Cheng
Yijuan Lu
Cha Zhang
D. Florêncio
14
2
0
10 Nov 2021
Video Text Tracking With a Spatio-Temporal Complementary Model
Yuzhe Gao
Xing Li
Jiajian Zhang
Yu Zhou
Dian Jin
Jing Wang
Shenggao Zhu
Xiang Bai
21
17
0
09 Nov 2021
Oracle Teacher: Leveraging Target Information for Better Knowledge Distillation of CTC Models
J. Yoon
H. Kim
Hyeon Seung Lee
Sunghwan Ahn
N. Kim
36
1
0
05 Nov 2021
Distantly Supervised Semantic Text Detection and Recognition for Broadcast Sports Videos Understanding
Avijit Shah
Topojoy Biswas
Sathish Ramadoss
Deven Santosh Shah
23
4
0
31 Oct 2021
TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene Text Representation
Wei Wang
Yu Zhou
Jiahao Lv
Dayan Wu
Guoqing Zhao
Ning Jiang
Weiping Wang
46
33
0
25 Oct 2021
Ultra Light OCR Competition Technical Report
Shuhan Zhang
S. Moussa
Ziad El-Khatib
A. B. Mnaouer
3DV
34
0
0
25 Oct 2021
Recurrence along Depth: Deep Convolutional Neural Networks with Recurrent Layer Aggregation
Jingyu Zhao
Yanwen Fang
Guodong Li
19
23
0
22 Oct 2021
Accurate Fine-grained Layout Analysis for the Historical Tibetan Document Based on the Instance Segmentation
Penghai Zhao
Weilan Wang
Zhengqi Cai
Guowei Zhang
Yuqi Lu
22
7
0
15 Oct 2021
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang
Hang Lv
Pengcheng Guo
Qijie Shao
Chao Yang
...
Hui Bu
Xiaoyu Chen
Chenchen Zeng
Di Wu
Zhendong Peng
25
217
0
07 Oct 2021
Asking questions on handwritten document collections
Minesh Mathew
Lluís Gómez
Dimosthenis Karatzas
C. V. Jawahar
RALM
31
11
0
02 Oct 2021
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
98
343
0
21 Sep 2021
PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text Recognition
Zhi Qiao
Yu Zhou
Jin Wei
Wei Wang
Yuanqing Zhang
Ning Jiang
Hongbin Wang
Weiping Wang
22
70
0
09 Sep 2021
PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
Yuning Du
Chenxia Li
Ruoyu Guo
Cheng Cui
Weiwei Liu
...
Yehua Yang
Qiwen Liu
Xiaoguang Hu
Dianhai Yu
Yanjun Ma
19
66
0
07 Sep 2021
Meta Self-Learning for Multi-Source Domain Adaptation: A Benchmark
Shuhao Qiu
Chuang Zhu
Wenli Zhou
VLM
OOD
24
8
0
24 Aug 2021
EKTVQA: Generalized use of External Knowledge to empower Scene Text in Text-VQA
Arka Ujjal Dey
Ernest Valveny
Gaurav Harit
6
3
0
22 Aug 2021
From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network
Yuxin Wang
Hongtao Xie
Shancheng Fang
Jing Wang
Shenggao Zhu
Yongdong Zhang
VLM
58
152
0
22 Aug 2021
Data Augmentation for Scene Text Recognition
Rowel Atienza
31
19
0
16 Aug 2021
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding
Zhanghui Kuang
Hongbin Sun
Zhizhong Li
Xiaoyu Yue
T. Lin
...
Tong Gao
Wenwei Zhang
Kai-xiang Chen
Wayne Zhang
Dahua Lin
VLM
26
71
0
14 Aug 2021
IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text Recognition
Zhiwei Jia
Shugong Xu
Shiyi Mu
Y. Tao
Shan Cao
Zhiyong Chen
13
3
0
13 Aug 2021
VTLayout: Fusion of Visual and Text Features for Document Layout Analysis
Shoubin Li
Xuyan Ma
Shuaiqun Pan
Jun Hu
Lin Shi
Qing Wang
20
9
0
12 Aug 2021
StrucTexT: Structured Text Understanding with Multi-Modal Transformers
Yulin Li
Yuxi Qian
Yuchen Yu
Xiameng Qin
Chengquan Zhang
Yan Liu
Kun Yao
Junyu Han
Jingtuo Liu
Errui Ding
27
113
0
06 Aug 2021
Why You Should Try the Real Data for the Scene Text Recognition
V. Loginov
22
6
0
29 Jul 2021
Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition
A. Bhunia
Aneeshan Sain
Amandeep Kumar
S. Ghose
Pinaki Nath Chowdhury
Yi-Zhe Song
29
56
0
26 Jul 2021
Previous
1
2
3
...
6
7
8
...
11
12
13
Next