Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.05717
Cited By
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
21 July 2015
Baoguang Shi
X. Bai
Cong Yao
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition"
50 / 646 papers shown
Title
A3S: Adversarial learning of semantic representations for Scene-Text Spotting
Masato Fujitake
35
8
0
21 Feb 2023
Improving Scene Text Image Super-resolution via Dual Prior Modulation Network
Shipeng Zhu
Zuoyan Zhao
Pengfei Fang
H. Xue
SupR
DiffM
50
24
0
21 Feb 2023
Fine-tuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition
Jan Kohút
Michal Hradiš
62
7
0
13 Feb 2023
Towards Writing Style Adaptation in Handwriting Recognition
Jan Kohút
Michal Hradiš
M. Kišš
AI4TS
78
4
0
13 Feb 2023
DocILE Benchmark for Document Information Localization and Extraction
vStvepán vSimsa
Milan vSulc
Michal Uvrivcávr
Yash J. Patel
Ahmed Hamdi
...
Matyávs Skalický
Jivrí Matas
Antoine Doucet
Mickael Coustaty
Dimosthenis Karatzas
24
33
0
11 Feb 2023
Geometric Perception based Efficient Text Recognition
P.N.Deelaka
D.R.Jayakodi
D.Y.Silva
21
3
0
08 Feb 2023
Benchmarking Probabilistic Deep Learning Methods for License Plate Recognition
Franziska Schirrmacher
Benedikt Lorch
Anatol Maier
Christian Riess
UQCV
30
4
0
02 Feb 2023
Recurrent Generic Contour-based Instance Segmentation with Progressive Learning
Hao Feng
Keyi Zhou
Wen-gang Zhou
Yufei Yin
Jiajun Deng
Qi Sun
Houqiang Li
SSeg
ISeg
45
12
0
21 Jan 2023
IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling
Yilin Wen
Biao Luo
Yuqian Zhao
15
1
0
06 Jan 2023
SPTS v2: Single-Point Scene Text Spotting
Yuliang Liu
Jiaxin Zhang
Dezhi Peng
Mingxin Huang
Xinyu Wang
...
Can Huang
Dahua Lin
Chunhua Shen
Xiang Bai
Lianwen Jin
VLM
34
49
0
04 Jan 2023
From Single-Visit to Multi-Visit Image-Based Models: Single-Visit Models are Enough to Predict Obstructive Hydronephrosis
Stanley Bryan Z. Hua
M. Rickard
J. Weaver
Alice X. Xiang
Daniel Alvarez
...
K. Sheth
Gregory E. Tasian
A. Lorenzo
Anna Goldenberg
L. Erdman
11
0
0
27 Dec 2022
A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition
Gurkan Soykan
Deniz Yuret
T. M. Sezgin
25
3
0
27 Dec 2022
Transferring General Multimodal Pretrained Models to Text Recognition
Junyang Lin
Xuancheng Ren
Yichang Zhang
Gao Liu
Peng Wang
An Yang
Chang Zhou
34
4
0
19 Dec 2022
Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images
Hongkuan Zhang
Edward Whittaker
I. Kitagishi
18
2
0
11 Dec 2022
SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels
M. Kišš
Michal Hradiš
Karel Beneš
Petr Buchal
Michal Kula
57
4
0
05 Dec 2022
Proceedings of the 2nd International Workshop on Reading Music Systems
Jorge Calvo-Zaragoza
Alexander Pacha
14
2
0
01 Dec 2022
Proceedings of the 3rd International Workshop on Reading Music Systems
Jorge Calvo-Zaragoza
Alexander Pacha
15
0
0
01 Dec 2022
Impact of Automatic Image Classification and Blind Deconvolution in Improving Text Detection Performance of the CRAFT Algorithm
Clarisa V. Albarillo
P. Fernandez
32
1
0
29 Nov 2022
Chart-RCNN: Efficient Line Chart Data Extraction from Camera Images
Shufang Li
Congxi Lu
Linkai Li
Haoshuai Zhou
21
0
0
25 Nov 2022
Look, Read and Ask: Learning to Ask Questions by Reading Text in Images
Soumya Jahagirdar
Shankar Gangisetty
Anand Mishra
19
4
0
23 Nov 2022
Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred Thousand-Scale One-Shot Logo Identification
Nakul Sharma
A. S. Penamakuri
Anand Mishra
3DV
VLM
15
1
0
23 Nov 2022
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
Shancheng Fang
Zhendong Mao
Hongtao Xie
Yuxin Wang
C. Yan
Yongdong Zhang
32
53
0
19 Nov 2022
UIT-HWDB: Using Transferring Method to Construct A Novel Benchmark for Evaluating Unconstrained Handwriting Image Recognition in Vietnamese
Nghia Hieu Nguyen
Duong T.D. Vo
Kiet Van Nguyen
24
1
0
10 Nov 2022
Portmanteauing Features for Scene Text Recognition
Yew Lee Tan
Ernest Yu Kai Chew
A. Kong
Jung-jae Kim
J. Lim
41
0
0
09 Nov 2022
Pure Transformer with Integrated Experts for Scene Text Recognition
Yew Lee Tan
A. Kong
Jung-jae Kim
ViT
28
16
0
09 Nov 2022
Masked Vision-Language Transformers for Scene Text Recognition
Jie Wu
Ying Peng
Shenmin Zhang
Weigang Qi
Jian Zhang
32
3
0
09 Nov 2022
Self-supervised Character-to-Character Distillation for Text Recognition
Tongkun Guan
Wei Shen
Xuehang Yang
Qi Feng
Zekun Jiang
Xiaokang Yang
45
26
0
01 Nov 2022
1st Place Solutions for UG2+ Challenge 2022 ATMOSPHERIC TURBULENCE MITIGATION
Zhuang Liu
Zhichao Zhao
Ye Yuan
Zhi Qiao
Jinfeng Bai
Zhilong Ji
27
0
0
30 Oct 2022
Complex Handwriting Trajectory Recovery: Evaluation Metrics and Algorithm
Zhounan Chen
Daihui Yang
Jinglin Liang
Xinwu Liu
Yuyi Wang
Zhenghua Peng
Shuangping Huang
34
7
0
28 Oct 2022
Scene Text Recognition with Semantics
Joshua Cesare Placidi
Yishu Miao
Zixu Wang
Lucia Specia
21
1
0
19 Oct 2022
OCR-VQGAN: Taming Text-within-Image Generation
Juan A. Rodriguez
David Vazquez
I. Laradji
M. Pedersoli
Pau Rodríguez López
30
18
0
19 Oct 2022
COFAR: Commonsense and Factual Reasoning in Image Search
Prajwal Gatti
A. S. Penamakuri
Revant Teotia
Anand Mishra
Shubhashis Sengupta
Roshni Ramnani
ReLM
LRM
20
4
0
16 Oct 2022
MenuAI: Restaurant Food Recommendation System via a Transformer-based Deep Learning Model
Xinwei Ju
Frank P.-W. Lo
Jianing Qiu
Peilun Shi
Jiachuan Peng
Benny Lo
24
3
0
15 Oct 2022
Semi-supervised Body Parsing and Pose Estimation for Enhancing Infant General Movement Assessment
Haomiao Ni
Yuan Xue
Liya Ma
Qian Zhang
Xiaoye Li
Xiaolei Huang
MedIm
27
38
0
14 Oct 2022
Text Detection Forgot About Document OCR
Krzysztof Olejniczak
Milan Šulc
34
9
0
14 Oct 2022
Scene Text Image Super-Resolution via Content Perceptual Loss and Criss-Cross Transformer Blocks
Rui Qin
Bin Wang
Yu-Wing Tai
24
9
0
13 Oct 2022
Text detection and recognition based on a lensless imaging system
Yinger Zhang
Zhouyi Wu
Peiying Lin
Yuting Wu
Lusong Wei
Zhengjie Huang
J. Huangfu
3DV
8
3
0
09 Oct 2022
Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task
Cong Ma
Yaping Zhang
Mei Tu
Xu Han
Linghui Wu
Yang Zhao
Yu Zhou
60
24
0
08 Oct 2022
Reading Chinese in Natural Scenes with a Bag-of-Radicals Prior
Yongbin Liu
Liu Qingjie
Jiaxin Chen
Wang Yunhong
34
1
0
05 Oct 2022
How deep convolutional neural networks lose spatial information with training
Umberto M. Tomasini
Leonardo Petrini
Francesco Cagnetta
M. Wyart
41
9
0
04 Oct 2022
Long-Term Localization using Semantic Cues in Floor Plan Maps
Nicky Zimmerman
Tiziano Guadagnino
Xieyuanli Chen
Jens Behley
C. Stachniss
39
23
0
04 Oct 2022
Searching a High-Performance Feature Extractor for Text Recognition Network
Hui Zhang
Quanming Yao
James T. Kwok
X. Bai
28
7
0
27 Sep 2022
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Ziqing Du
Kai Liu
Xucheng Wan
Huan Zhou
25
0
0
24 Sep 2022
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils
Andrés Mafla
Ali Furkan Biten
Oren Nuriel
Aviad Aberdam
Shai Mazor
Ron Litman
Dimosthenis Karatzas
14
16
0
14 Sep 2022
Lexicon and Attention based Handwritten Text Recognition System
Lalita Kumari
Sukhdeep Singh
Vvs Rathore
Anuj Sharma
HAI
40
4
0
11 Sep 2022
Levenshtein OCR
Cheng Da
Peng Wang
Cong Yao
ViT
76
32
0
08 Sep 2022
Multi-Granularity Prediction for Scene Text Recognition
Peng Wang
Cheng Da
Cong Yao
66
48
0
08 Sep 2022
DM
2
^2
2
S
2
^2
2
: Deep Multi-Modal Sequence Sets with Hierarchical Modality Attention
Shunsuke Kitada
Yuki Iwazaki
Riku Togashi
Hitoshi Iyatomi
21
1
0
07 Sep 2022
Scene Text Recognition with Single-Point Decoding Network
Lei Chen
Haibo Qin
Shi-Xue Zhang
Chun Yang
Xucheng Yin
26
1
0
05 Sep 2022
Vision-Language Adaptive Mutual Decoder for OOV-STR
Jinshui Hu
Chenyu Liu
Qiandong Yan
Xuyang Zhu
Jiajia Wu
Feng Yu
Bing Yin
VLM
26
0
0
02 Sep 2022
Previous
1
2
3
4
5
6
...
11
12
13
Next