ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.05717
  4. Cited By
An End-to-End Trainable Neural Network for Image-based Sequence
  Recognition and Its Application to Scene Text Recognition

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

21 July 2015
Baoguang Shi
X. Bai
Cong Yao
    VLM
ArXivPDFHTML

Papers citing "An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition"

50 / 645 papers shown
Title
Image-to-LaTeX Converter for Mathematical Formulas and Text
Image-to-LaTeX Converter for Mathematical Formulas and Text
Daniil Gurgurov
Aleksey Morshnev
ViT
VLM
55
1
0
07 Aug 2024
LEGO: Self-Supervised Representation Learning for Scene Text Images
LEGO: Self-Supervised Representation Learning for Scene Text Images
Yujin Ren
Jiaxin Zhang
Lianwen Jin
SSL
36
0
0
04 Aug 2024
Visual Text Generation in the Wild
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
53
10
0
19 Jul 2024
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting
  Recognition
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition
Gagan Bhatia
El Moatez Billah Nagoudi
Fakhraddin Alwajih
Muhammad Abdul-Mageed
34
3
0
18 Jul 2024
Back to Newton's Laws: Learning Vision-based Agile Flight via
  Differentiable Physics
Back to Newton's Laws: Learning Vision-based Agile Flight via Differentiable Physics
Yuang Zhang
Yu Hu
Yunlong Song
Danping Zou
Weiyao Lin
38
17
0
15 Jul 2024
Long-range Turbulence Mitigation: A Large-scale Dataset and A
  Coarse-to-fine Framework
Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework
Shengqi Xu
Run Sun
Yi Chang
Shuning Cao
Xueyao Xiao
Luxin Yan
26
3
0
11 Jul 2024
PosFormer: Recognizing Complex Handwritten Mathematical Expression with
  Position Forest Transformer
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer
Tongkun Guan
Chengyu Lin
Wei Shen
Xiaokang Yang
29
5
0
10 Jul 2024
Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation
Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation
Filipe Lauar
Valentin Laurent
37
0
0
09 Jul 2024
Focus on the Whole Character: Discriminative Character Modeling for
  Scene Text Recognition
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
Bangbang Zhou
Yadong Qu
Zixiao Wang
Zicheng Li
Boqiang Zhang
Hongtao Xie
47
1
0
08 Jul 2024
MixTex: Unambiguous Recognition Should Not Rely Solely on Real Data
MixTex: Unambiguous Recognition Should Not Rely Solely on Real Data
Renqing Luo
Yuhan Xu
41
0
0
24 Jun 2024
AnyTrans: Translate AnyText in the Image with Large Scale Models
AnyTrans: Translate AnyText in the Image with Large Scale Models
Zhipeng Qian
Pei Zhang
Baosong Yang
Kai Fan
Yiwei Ma
Derek F. Wong
Xiaoshuai Sun
Rongrong Ji
VLM
45
1
0
17 Jun 2024
Classification of Non-native Handwritten Characters Using Convolutional
  Neural Network
Classification of Non-native Handwritten Characters Using Convolutional Neural Network
F. A. Mamun
S. Chowdhury
J. E. Giti
H. Sarker
44
1
0
06 Jun 2024
Improving Text Generation on Images with Synthetic Captions
Improving Text Generation on Images with Synthetic Captions
Jun Young Koh
Sang Hyun Park
Joy Song
DiffM
51
2
0
01 Jun 2024
LOGO: Video Text Spotting with Language Collaboration and Glyph
  Perception Model
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model
Hongen Liu
Di Sun
Jiahao Wang
Yi Liu
Gang Pan
48
0
0
29 May 2024
Dataset and Benchmark for Urdu Natural Scenes Text Detection,
  Recognition and Visual Question Answering
Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering
Hiba Maryam
Ling Fu
Jiajun Song
Tajrian Abm Shafayet
Qidi Luo
Xiang Bai
Yuliang Liu
16
0
0
21 May 2024
CustomText: Customized Textual Image Generation using Diffusion Models
CustomText: Customized Textual Image Generation using Diffusion Models
Shubham Paliwal
Arushi Jain
Monika Sharma
Vikram Jamwal
L. Vig
43
0
0
21 May 2024
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive
  Permutation for Scene Text Recognition
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Honghui Chen
Yuhang Qiu
Jiabao Wang
Pingping Chen
Nam Ling
35
0
0
15 May 2024
Self-Supervised Pre-training with Symmetric Superimposition Modeling for
  Scene Text Recognition
Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
Zuan Gao
Yuxin Wang
Yadong Qu
Boqiang Zhang
Zixiao Wang
Jianjun Xu
Hongtao Xie
ViT
45
9
0
09 May 2024
Align, Minimize and Diversify: A Source-Free Unsupervised Domain
  Adaptation Method for Handwritten Text Recognition
Align, Minimize and Diversify: A Source-Free Unsupervised Domain Adaptation Method for Handwritten Text Recognition
María Alfaro-Contreras
Jorge Calvo-Zaragoza
38
0
0
28 Apr 2024
GatedLexiconNet: A Comprehensive End-to-End Handwritten Paragraph Text
  Recognition System
GatedLexiconNet: A Comprehensive End-to-End Handwritten Paragraph Text Recognition System
Lalita Kumari
Sukhdeep Singh
V. Rathore
Anuj Sharma
56
1
0
22 Apr 2024
A Dataset and Model for Realistic License Plate Deblurring
A Dataset and Model for Realistic License Plate Deblurring
Haoyan Gong
Yuzheng Feng
Zhenrong Zhang
Xianxu Hou
Jingxin Liu
Siqi Huang
Hongbin Liu
18
4
0
21 Apr 2024
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Da Chang
Yu Li
72
2
0
19 Apr 2024
JSTR: Judgment Improves Scene Text Recognition
JSTR: Judgment Improves Scene Text Recognition
Masato Fujitake
44
1
0
09 Apr 2024
NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for
  Document Enhancement
NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement
Giordano Cicchetti
Danilo Comminiello
31
4
0
08 Apr 2024
LayoutLLM: Layout Instruction Tuning with Large Language Models for
  Document Understanding
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
Chuwei Luo
Yufan Shen
Zhaoqing Zhu
Qi Zheng
Zhi Yu
Cong Yao
31
38
0
08 Apr 2024
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Mingxin Huang
Hongliang Li
Yuliang Liu
Xiang Bai
Lianwen Jin
35
3
0
06 Apr 2024
OmniParser: A Unified Framework for Text Spotting, Key Information
  Extraction and Table Recognition
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
Jianqiang Wan
Sibo Song
Wenwen Yu
Yuliang Liu
Wenqing Cheng
Fei Huang
Xiang Bai
Cong Yao
Zhibo Yang
51
26
0
28 Mar 2024
Global License Plate Dataset
Global License Plate Dataset
Siddharth Agrawal
32
1
0
22 Mar 2024
Practical End-to-End Optical Music Recognition for Pianoform Music
Practical End-to-End Optical Music Recognition for Pianoform Music
Jirí Mayer
Milan Straka
Jan Hajic
Pavel Pecina
27
2
0
20 Mar 2024
HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text
  Recognition
HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Yuyi Zhang
Yuanzhi Zhu
Dezhi Peng
Peirong Zhang
Zhenhua Yang
Zhibo Yang
Cong Yao
Lianwen Jin
16
4
0
20 Mar 2024
Efficient scene text image super-resolution with semantic guidance
Efficient scene text image super-resolution with semantic guidance
LeoWu TomyEnrique
Xiangcheng Du
Kangliang Liu
Han Yuan
Zhao Zhou
Cheng Jin
VLM
31
2
0
20 Mar 2024
From Pixels to Insights: A Survey on Automatic Chart Understanding in
  the Era of Large Foundation Models
From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models
Kung-Hsiang Huang
Hou Pong Chan
Yi Ren Fung
Haoyi Qiu
Mingyang Zhou
Shafiq R. Joty
Shih-Fu Chang
Heng Ji
AI4TS
64
14
0
18 Mar 2024
OCR is All you need: Importing Multi-Modality into Image-based Defect
  Detection System
OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System
Chih-Chung Hsu
Chia-Ming Lee
Chun-Hung Sun
Kuang-Ming Wu
42
0
0
18 Mar 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with
  Pre-trained Language Model
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
29
3
0
15 Mar 2024
IndicSTR12: A Dataset for Indic Scene Text Recognition
IndicSTR12: A Dataset for Indic Scene Text Recognition
Harsh Lunia
Ajoy Mondal
C. V. Jawahar
24
2
0
12 Mar 2024
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and
  Margin Loss
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Xuhua Ren
Hengcan Shi
Jin Li
VLM
41
0
0
12 Mar 2024
LOCR: Location-Guided Transformer for Optical Character Recognition
LOCR: Location-Guided Transformer for Optical Character Recognition
Yu Sun
Dongzhan Zhou
Chen Lin
Conghui He
Wanli Ouyang
Han-Sen Zhong
40
1
0
04 Mar 2024
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Nguyen Nguyen
Yapeng Tian
Chenliang Xu
49
1
0
27 Feb 2024
Sequential Visual and Semantic Consistency for Semi-supervised Text
  Recognition
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
Xiang Bai
32
5
0
24 Feb 2024
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
X. Bai
VLM
78
10
0
21 Feb 2024
VATr++: Choose Your Words Wisely for Handwritten Text Generation
VATr++: Choose Your Words Wisely for Handwritten Text Generation
Bram Vanherle
Vittorio Pippi
S. Cascianelli
Nick Michiels
F. Reeth
Rita Cucchiara
19
3
0
16 Feb 2024
Sheet Music Transformer: End-To-End Optical Music Recognition Beyond
  Monophonic Transcription
Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription
Antonio Ríos-Vila
Jorge Calvo-Zaragoza
Thierry Paquet
40
9
0
12 Feb 2024
Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual
  Text Processing
Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing
Yan Shu
Weichao Zeng
Zhenhang Li
Fangmin Zhao
Yu Zhou
32
3
0
05 Feb 2024
Text Image Inpainting via Global Structure-Guided Diffusion Models
Text Image Inpainting via Global Structure-Guided Diffusion Models
Shipeng Zhu
Pengfei Fang
Chenjie Zhu
Zuoyan Zhao
Qiang Xu
Hui Xue
DiffM
30
5
0
26 Jan 2024
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text
  Recognition
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition
Xianfu Cheng
Weixiao Zhou
Xiang Li
Xiaoming Chen
Jian Yang
Tongliang Li
Zhoujun Li
37
2
0
18 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
77
1
0
15 Jan 2024
Spatio-Temporal Turbulence Mitigation: A Translational Perspective
Spatio-Temporal Turbulence Mitigation: A Translational Perspective
Xingguang Zhang
Nicholas Chimitt
Yiheng Chi
Zhiyuan Mao
Stanley H. Chan
52
9
0
08 Jan 2024
Inverse-like Antagonistic Scene Text Spotting via Reading-Order
  Estimation and Dynamic Sampling
Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling
Shi-Xue Zhang
Chun Yang
Xiaobin Zhu
Hongyang Zhou
Hongfa Wang
Xu-Cheng Yin
34
6
0
08 Jan 2024
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
41
6
0
29 Dec 2023
Word length-aware text spotting: Enhancing detection and recognition in
  dense text image
Word length-aware text spotting: Enhancing detection and recognition in dense text image
Hao Wang
Huabing Zhou
Yanduo Zhang
Tao Lu
Jiayi Ma
35
1
0
25 Dec 2023
Previous
12345...111213
Next