ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.05717
  4. Cited By
An End-to-End Trainable Neural Network for Image-based Sequence
  Recognition and Its Application to Scene Text Recognition

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

21 July 2015
Baoguang Shi
X. Bai
Cong Yao
    VLM
ArXivPDFHTML

Papers citing "An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition"

50 / 645 papers shown
Title
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
62
1
0
19 Dec 2023
Cross-Lingual Learning in Multilingual Scene Text Recognition
Cross-Lingual Learning in Multilingual Scene Text Recognition
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
21
0
0
17 Dec 2023
Diffusion-based Blind Text Image Super-Resolution
Diffusion-based Blind Text Image Super-Resolution
Yuzhe Zhang
Jiawei Zhang
Hao Li
Zhouxia Wang
Luwei Hou
Dongqing Zou
Liheng Bian
31
8
0
13 Dec 2023
Toward Real Text Manipulation Detection: New Dataset and New Solution
Dongliang Luo
Yuliang Liu
Rui Yang
Xianjin Liu
Jishen Zeng
Yu Zhou
Xiang Bai
35
3
0
12 Dec 2023
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical
  Character Recognition
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character Recognition
Fatemeh Asadi-zeydabadi
Ali Afkari-Fahandari
Amin Faraji
Elham Shabaninia
Hossein Nezamabadi-pour
21
2
0
02 Dec 2023
Towards Higher Ranks via Adversarial Weight Pruning
Towards Higher Ranks via Adversarial Weight Pruning
Yuchuan Tian
Hanting Chen
Tianyu Guo
Chao Xu
Yunhe Wang
32
2
0
29 Nov 2023
DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and
  Small Text
DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and Small Text
Weijia Wu
Yiming Zhang
Yefei He
Luoming Zhang
Zhenyu Lou
Hong Zhou
Xiang Bai
40
5
0
29 Nov 2023
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text
  Image Super-Resolution
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution
Zuoyan Zhao
Hui Xue
Pengfei Fang
Shipeng Zhu
DiffM
18
4
0
29 Nov 2023
STR-Cert: Robustness Certification for Deep Text Recognition on Deep
  Learning Pipelines and Vision Transformers
STR-Cert: Robustness Certification for Deep Text Recognition on Deep Learning Pipelines and Vision Transformers
Daqian Shao
Lukas Fesser
Marta Z. Kwiatkowska
33
0
0
28 Nov 2023
Vulnerability Analysis of Transformer-based Optical Character
  Recognition to Adversarial Attacks
Vulnerability Analysis of Transformer-based Optical Character Recognition to Adversarial Attacks
Lucas Beerens
D. Higham
36
1
0
28 Nov 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text
  Rendering
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
27
60
0
28 Nov 2023
Data Generation for Post-OCR correction of Cyrillic handwriting
Data Generation for Post-OCR correction of Cyrillic handwriting
Evgenii Davydkin
Aleksandr Markelov
Egor Iuldashev
Anton Dudkin
I. Krivorotov
44
3
0
27 Nov 2023
Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution
Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution
Yuxuan Zhou
Liangcai Gao
Zhi Tang
Baole Wei
DiffM
32
3
0
22 Nov 2023
Towards Detecting, Recognizing, and Parsing the Address Information from
  Bangla Signboard: A Deep Learning-based Approach
Towards Detecting, Recognizing, and Parsing the Address Information from Bangla Signboard: A Deep Learning-based Approach
Hasan Murad
Mohammed Eunus Ali
21
0
0
22 Nov 2023
DocPedia: Unleashing the Power of Large Multimodal Model in the
  Frequency Domain for Versatile Document Understanding
DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Hao Feng
Qi Liu
Hao Liu
Wen-gang Zhou
Houqiang Li
Can Huang
VLM
25
60
0
20 Nov 2023
Scene Text Image Super-resolution based on Text-conditional Diffusion
  Models
Scene Text Image Super-resolution based on Text-conditional Diffusion Models
Chihiro Noguchi
Shun Fukuda
Masao Yamanaka
DiffM
30
10
0
16 Nov 2023
Phonological Level wav2vec2-based Mispronunciation Detection and
  Diagnosis Method
Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method
M. Shahin
Julien Epps
Beena Ahmed
16
1
0
13 Nov 2023
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and
  In-depth Evaluation
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation
Yongxin Shi
Dezhi Peng
Wenhui Liao
Zening Lin
Xinhong Chen
Chongyu Liu
Yuyi Zhang
Lianwen Jin
MLLM
30
44
0
25 Oct 2023
Adversarial sample generation and training using geometric masks for
  accurate and resilient license plate character recognition
Adversarial sample generation and training using geometric masks for accurate and resilient license plate character recognition
Bishal Shrestha
Griwan Khakurel
Kritika Simkhada
Badri Adhikari
AAML
27
0
0
25 Oct 2023
Convolutional Bidirectional Variational Autoencoder for Image Domain
  Translation of Dotted Arabic Expiration
Convolutional Bidirectional Variational Autoencoder for Image Domain Translation of Dotted Arabic Expiration
Ahmed Zidane
Ghada Soliman
18
0
0
21 Oct 2023
EfficientOCR: An Extensible, Open-Source Package for Efficiently
  Digitizing World Knowledge
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge
Tom Bryan
Jacob Carlson
Abhishek Arora
Melissa Dell
31
8
0
16 Oct 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text
  Recognition
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
52
15
0
08 Oct 2023
A Holistic Evaluation of Piano Sound Quality
A Holistic Evaluation of Piano Sound Quality
Monan Zhou
Shangda Wu
Shaohua Ji
Zijin Li
Wei Li
26
0
0
07 Oct 2023
1D-CapsNet-LSTM: A Deep Learning-Based Model for Multi-Step Stock Index
  Forecasting
1D-CapsNet-LSTM: A Deep Learning-Based Model for Multi-Step Stock Index Forecasting
Cheng Zhang
N. N. Sjarif
Roslina Ibrahim
AIFin
AI4TS
23
7
0
03 Oct 2023
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text
  Image Super-Resolution
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
Wenyu Zhang
Xin Deng
Baojun Jia
Xingtong Yu
Yifan Chen
Jin Ma
Qing Ding
Xinming Zhang
27
11
0
16 Sep 2023
DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
Teng Fu
Xiaocong Wang
Haiyang Yu
Ke Niu
Bin Li
Xiangyang Xue
VOT
ViT
39
6
0
09 Sep 2023
Leveraging Model Fusion for Improved License Plate Recognition
Leveraging Model Fusion for Improved License Plate Recognition
Rayson Laroca
L. A. Zanlorensi
Valter Estevam
Rodrigo Minetto
David Menotti
MoMe
29
7
0
08 Sep 2023
STEP -- Towards Structured Scene-Text Spotting
STEP -- Towards Structured Scene-Text Spotting
Sergi Garcia-Bordils
Dimosthenis Karatzas
Marccal Rusinol
29
2
0
05 Sep 2023
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through
  Image-IDS Aligning
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
VLM
13
17
0
03 Sep 2023
Orientation-Independent Chinese Text Recognition in Scene Images
Orientation-Independent Chinese Text Recognition in Scene Images
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
30
4
0
03 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
49
35
0
30 Aug 2023
Enhancing OCR Performance through Post-OCR Models: Adopting Glyph
  Embedding for Improved Correction
Enhancing OCR Performance through Post-OCR Models: Adopting Glyph Embedding for Improved Correction
Yung-Hsin Chen
Yuli Zhou
24
2
0
29 Aug 2023
Vision Grid Transformer for Document Layout Analysis
Vision Grid Transformer for Document Layout Analysis
Cheng Da
Chuwei Luo
Qi Zheng
Cong Yao
ViT
40
27
0
29 Aug 2023
High-Resolution Document Shadow Removal via A Large-Scale Real-World
  Dataset and A Frequency-Aware Shadow Erasing Net
High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net
Zinuo Li
Xuhang Chen
Chi-Man Pun
Xiaodong Cun
37
35
0
27 Aug 2023
Self-supervised Scene Text Segmentation with Object-centric Layered
  Representations Augmented by Text Regions
Self-supervised Scene Text Segmentation with Object-centric Layered Representations Augmented by Text Regions
Yibo Wang
Yunhu Ye
Yuanpeng Mao
Yanwei Yu
Yuanping Song
30
2
0
25 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
37
15
0
24 Aug 2023
Semantic Graph Representation Learning for Handwritten Mathematical
  Expression Recognition
Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition
Zhuang Liu
Ye Yuan
Zhilong Ji
Jingfeng Bai
X. Bai
27
5
0
21 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss
  for Text Recognition: A Simple Yet Effective Approach
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
28
11
0
17 Aug 2023
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance
  Representation Learning
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Xugong Qin
Pengyuan Lyu
Chengquan Zhang
Yu Zhou
Kun Yao
Peng-Zhen Zhang
Hailun Lin
Weiping Wang
42
12
0
14 Aug 2023
TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution
TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution
Baolin Liu
Zongyuan Yang
Pengfei Wang
Yueze Wang
Ziqi Liu
Ziyi Song
Yan Liu
Yongping Xiong
34
7
0
13 Aug 2023
A Benchmark for Chinese-English Scene Text Image Super-resolution
A Benchmark for Chinese-English Scene Text Image Super-resolution
Jianqi Ma
Zhetong Liang
Wangmeng Xiang
Xi Yang
Lei Zhang
22
8
0
07 Aug 2023
One-stage Low-resolution Text Recognition with High-resolution Knowledge
  Transfer
One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer
Han Guo
Tao Dai
Mingyan Zhu
G. MEng
Bin Chen
Zhi Wang
Shutao Xia
30
1
0
05 Aug 2023
CTP-Net: Character Texture Perception Network for Document Image Forgery
  Localization
CTP-Net: Character Texture Perception Network for Document Image Forgery Localization
Xin Liao
Si-ping Chen
Jiaxin Chen
Tianyi Wang
Xiehua Li
25
2
0
04 Aug 2023
HiREN: Towards Higher Supervision Quality for Better Scene Text Image
  Super-Resolution
HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Minyi Zhao
Yi Xu
Bingjia Li
Jie Wang
Jihong Guan
Shuigeng Zhou
44
1
0
31 Jul 2023
A Transformer-based Approach for Arabic Offline Handwritten Text
  Recognition
A Transformer-based Approach for Arabic Offline Handwritten Text Recognition
Saleh Momeni
B. BabaAli
16
12
0
27 Jul 2023
Multi-Granularity Prediction with Learnable Fusion for Scene Text
  Recognition
Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Cheng Da
Peng Wang
Cong Yao
20
8
0
25 Jul 2023
Context Perception Parallel Decoder for Scene Text Recognition
Context Perception Parallel Decoder for Scene Text Recognition
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
34
7
0
23 Jul 2023
Physics-Driven Turbulence Image Restoration with Stochastic Refinement
Physics-Driven Turbulence Image Restoration with Stochastic Refinement
Ajay Jaiswal
Xingguang Zhang
Stanley H. Chan
Zhangyang Wang
29
21
0
20 Jul 2023
Towards Robust Scene Text Image Super-resolution via Explicit Location
  Enhancement
Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
Han Guo
Tao Dai
G. MEng
Shutao Xia
26
11
0
19 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Revisiting Scene Text Recognition: A Data Perspective
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
28
39
0
17 Jul 2023
Previous
123456...111213
Next