Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.05717
Cited By
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
21 July 2015
Baoguang Shi
X. Bai
Cong Yao
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition"
50 / 645 papers shown
Title
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
62
1
0
19 Dec 2023
Cross-Lingual Learning in Multilingual Scene Text Recognition
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
21
0
0
17 Dec 2023
Diffusion-based Blind Text Image Super-Resolution
Yuzhe Zhang
Jiawei Zhang
Hao Li
Zhouxia Wang
Luwei Hou
Dongqing Zou
Liheng Bian
31
8
0
13 Dec 2023
Toward Real Text Manipulation Detection: New Dataset and New Solution
Dongliang Luo
Yuliang Liu
Rui Yang
Xianjin Liu
Jishen Zeng
Yu Zhou
Xiang Bai
35
3
0
12 Dec 2023
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character Recognition
Fatemeh Asadi-zeydabadi
Ali Afkari-Fahandari
Amin Faraji
Elham Shabaninia
Hossein Nezamabadi-pour
21
2
0
02 Dec 2023
Towards Higher Ranks via Adversarial Weight Pruning
Yuchuan Tian
Hanting Chen
Tianyu Guo
Chao Xu
Yunhe Wang
32
2
0
29 Nov 2023
DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and Small Text
Weijia Wu
Yiming Zhang
Yefei He
Luoming Zhang
Zhenyu Lou
Hong Zhou
Xiang Bai
40
5
0
29 Nov 2023
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution
Zuoyan Zhao
Hui Xue
Pengfei Fang
Shipeng Zhu
DiffM
18
4
0
29 Nov 2023
STR-Cert: Robustness Certification for Deep Text Recognition on Deep Learning Pipelines and Vision Transformers
Daqian Shao
Lukas Fesser
Marta Z. Kwiatkowska
33
0
0
28 Nov 2023
Vulnerability Analysis of Transformer-based Optical Character Recognition to Adversarial Attacks
Lucas Beerens
D. Higham
36
1
0
28 Nov 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
27
60
0
28 Nov 2023
Data Generation for Post-OCR correction of Cyrillic handwriting
Evgenii Davydkin
Aleksandr Markelov
Egor Iuldashev
Anton Dudkin
I. Krivorotov
44
3
0
27 Nov 2023
Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution
Yuxuan Zhou
Liangcai Gao
Zhi Tang
Baole Wei
DiffM
32
3
0
22 Nov 2023
Towards Detecting, Recognizing, and Parsing the Address Information from Bangla Signboard: A Deep Learning-based Approach
Hasan Murad
Mohammed Eunus Ali
21
0
0
22 Nov 2023
DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Hao Feng
Qi Liu
Hao Liu
Wen-gang Zhou
Houqiang Li
Can Huang
VLM
25
60
0
20 Nov 2023
Scene Text Image Super-resolution based on Text-conditional Diffusion Models
Chihiro Noguchi
Shun Fukuda
Masao Yamanaka
DiffM
30
10
0
16 Nov 2023
Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method
M. Shahin
Julien Epps
Beena Ahmed
16
1
0
13 Nov 2023
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation
Yongxin Shi
Dezhi Peng
Wenhui Liao
Zening Lin
Xinhong Chen
Chongyu Liu
Yuyi Zhang
Lianwen Jin
MLLM
30
44
0
25 Oct 2023
Adversarial sample generation and training using geometric masks for accurate and resilient license plate character recognition
Bishal Shrestha
Griwan Khakurel
Kritika Simkhada
Badri Adhikari
AAML
27
0
0
25 Oct 2023
Convolutional Bidirectional Variational Autoencoder for Image Domain Translation of Dotted Arabic Expiration
Ahmed Zidane
Ghada Soliman
18
0
0
21 Oct 2023
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge
Tom Bryan
Jacob Carlson
Abhishek Arora
Melissa Dell
31
8
0
16 Oct 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
52
15
0
08 Oct 2023
A Holistic Evaluation of Piano Sound Quality
Monan Zhou
Shangda Wu
Shaohua Ji
Zijin Li
Wei Li
26
0
0
07 Oct 2023
1D-CapsNet-LSTM: A Deep Learning-Based Model for Multi-Step Stock Index Forecasting
Cheng Zhang
N. N. Sjarif
Roslina Ibrahim
AIFin
AI4TS
23
7
0
03 Oct 2023
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
Wenyu Zhang
Xin Deng
Baojun Jia
Xingtong Yu
Yifan Chen
Jin Ma
Qing Ding
Xinming Zhang
27
11
0
16 Sep 2023
DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
Teng Fu
Xiaocong Wang
Haiyang Yu
Ke Niu
Bin Li
Xiangyang Xue
VOT
ViT
39
6
0
09 Sep 2023
Leveraging Model Fusion for Improved License Plate Recognition
Rayson Laroca
L. A. Zanlorensi
Valter Estevam
Rodrigo Minetto
David Menotti
MoMe
29
7
0
08 Sep 2023
STEP -- Towards Structured Scene-Text Spotting
Sergi Garcia-Bordils
Dimosthenis Karatzas
Marccal Rusinol
29
2
0
05 Sep 2023
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
VLM
13
17
0
03 Sep 2023
Orientation-Independent Chinese Text Recognition in Scene Images
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
30
4
0
03 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
49
35
0
30 Aug 2023
Enhancing OCR Performance through Post-OCR Models: Adopting Glyph Embedding for Improved Correction
Yung-Hsin Chen
Yuli Zhou
24
2
0
29 Aug 2023
Vision Grid Transformer for Document Layout Analysis
Cheng Da
Chuwei Luo
Qi Zheng
Cong Yao
ViT
40
27
0
29 Aug 2023
High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net
Zinuo Li
Xuhang Chen
Chi-Man Pun
Xiaodong Cun
37
35
0
27 Aug 2023
Self-supervised Scene Text Segmentation with Object-centric Layered Representations Augmented by Text Regions
Yibo Wang
Yunhu Ye
Yuanpeng Mao
Yanwei Yu
Yuanping Song
30
2
0
25 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
37
15
0
24 Aug 2023
Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition
Zhuang Liu
Ye Yuan
Zhilong Ji
Jingfeng Bai
X. Bai
27
5
0
21 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
28
11
0
17 Aug 2023
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Xugong Qin
Pengyuan Lyu
Chengquan Zhang
Yu Zhou
Kun Yao
Peng-Zhen Zhang
Hailun Lin
Weiping Wang
42
12
0
14 Aug 2023
TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution
Baolin Liu
Zongyuan Yang
Pengfei Wang
Yueze Wang
Ziqi Liu
Ziyi Song
Yan Liu
Yongping Xiong
34
7
0
13 Aug 2023
A Benchmark for Chinese-English Scene Text Image Super-resolution
Jianqi Ma
Zhetong Liang
Wangmeng Xiang
Xi Yang
Lei Zhang
22
8
0
07 Aug 2023
One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer
Han Guo
Tao Dai
Mingyan Zhu
G. MEng
Bin Chen
Zhi Wang
Shutao Xia
30
1
0
05 Aug 2023
CTP-Net: Character Texture Perception Network for Document Image Forgery Localization
Xin Liao
Si-ping Chen
Jiaxin Chen
Tianyi Wang
Xiehua Li
25
2
0
04 Aug 2023
HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Minyi Zhao
Yi Xu
Bingjia Li
Jie Wang
Jihong Guan
Shuigeng Zhou
44
1
0
31 Jul 2023
A Transformer-based Approach for Arabic Offline Handwritten Text Recognition
Saleh Momeni
B. BabaAli
16
12
0
27 Jul 2023
Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Cheng Da
Peng Wang
Cong Yao
20
8
0
25 Jul 2023
Context Perception Parallel Decoder for Scene Text Recognition
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
34
7
0
23 Jul 2023
Physics-Driven Turbulence Image Restoration with Stochastic Refinement
Ajay Jaiswal
Xingguang Zhang
Stanley H. Chan
Zhangyang Wang
29
21
0
20 Jul 2023
Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
Han Guo
Tao Dai
G. MEng
Shutao Xia
26
11
0
19 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
28
39
0
17 Jul 2023
Previous
1
2
3
4
5
6
...
11
12
13
Next