ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.10282
  4. Cited By
TrOCR: Transformer-based Optical Character Recognition with Pre-trained
  Models

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

21 September 2021
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
    ViT
ArXivPDFHTML

Papers citing "TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models"

45 / 45 papers shown
Title
GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing
GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing
Tong Wang
Ting Liu
Xiaochao Qu
Chengjing Wu
Luoqi Liu
Xiaolin Hu
DiffM
55
0
0
08 May 2025
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
Naphat Nithisopa
Teerapong Panboonyuen
ViT
26
0
0
07 May 2025
GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling
GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling
Siqi Li
Yufan Shen
Xiangnan Chen
Jiayi Chen
Hengwei Ju
...
Licheng Wen
Botian Shi
Y. Liu
Xinyu Cai
Yu Qiao
VLM
ELM
89
0
0
30 Apr 2025
AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling Engine
AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling Engine
Carlo Siebenschuh
Kyle Hippe
Ozan Gokdemir
Alexander Brace
A. Khan
...
V. Vishwanath
R. Stevens
Arvind Ramanathan
Ian Foster
Robert Underwood
MoE
44
0
0
23 Apr 2025
TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model
TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model
Cheng Yang
Yang Sui
Jinqi Xiao
Lingyi Huang
Yu Gong
...
Jinghua Yan
Y. Bai
P. Sadayappan
Xia Hu
Bo Yuan
VLM
53
0
0
24 Mar 2025
Invizo: Arabic Handwritten Document Optical Character Recognition Solution
Alhossien Waly
Bassant Tarek
Ali Feteha
Rewan Yehia
Gasser Amr
Walid Gomaa
Ahmed M. Fares
53
0
0
07 Feb 2025
Towards Making Flowchart Images Machine Interpretable
Towards Making Flowchart Images Machine Interpretable
S. Kamath S
Prajwal Gatti
Yogesh Kumar
Vikash Yadav
Anand Mishra
53
5
0
29 Jan 2025
Enhancing Complex Formula Recognition with Hierarchical Detail-Focused Network
Enhancing Complex Formula Recognition with Hierarchical Detail-Focused Network
Jiale Wang
Junhui Yu
Huanyong Liu
Chenanran Kong
AIMat
44
0
0
10 Jan 2025
Type-R: Automatically Retouching Typos for Text-to-Image Generation
Type-R: Automatically Retouching Typos for Text-to-Image Generation
Wataru Shimoda
Naoto Inoue
Daichi Haraguchi
Hayato Mitani
S. Uchida
Kota Yamaguchi
DiffM
93
0
0
27 Nov 2024
MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild
Xi Fang
Jiankun Wang
X. Cai
Shangqian Chen
Shuwen Yang
Lin Yao
Linfeng Zhang
Guolin Ke
Linfeng Zhang
Guolin Ke
50
1
0
17 Nov 2024
HATFormer: Historic Handwritten Arabic Text Recognition with Transformers
HATFormer: Historic Handwritten Arabic Text Recognition with Transformers
Adrian Chan
Anupam Mijar
Mehreen Saeed
Chau-Wai Wong
Akram Khater
36
0
0
03 Oct 2024
General Detection-based Text Line Recognition
General Detection-based Text Line Recognition
Raphael Baena
Syrine Kalleli
Mathieu Aubry
140
0
0
25 Sep 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text
  Recognizer
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
23
1
0
18 Sep 2024
BMI Prediction from Handwritten English Characters Using a Convolutional Neural Network
BMI Prediction from Handwritten English Characters Using a Convolutional Neural Network
N. T. Diba
N. Akter
S. Chowdhury
J. E. Giti
36
0
0
04 Sep 2024
Self-Supervised Vision Transformers for Writer Retrieval
Self-Supervised Vision Transformers for Writer Retrieval
Tim Raven
Arthur Matei
Gernot A. Fink
ViT
20
0
0
01 Sep 2024
StylusAI: Stylistic Adaptation for Robust German Handwritten Text
  Generation
StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation
Nauman Riaz
S. Saifullah
S. Agne
Andreas Dengel
Sheraz Ahmed
DiffM
35
0
0
22 Jul 2024
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate
  Video-based Bug Reports
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports
Yanfu Yan
Nathan Cooper
Oscar Chaparro
Kevin Moran
Denys Poshyvanyk
43
5
0
11 Jul 2024
DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for
  Efficient Scanned Document Annotation
DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation
Ahmad Mohammadshirazi
Ali Nosrati Firoozsalari
Mengxi Zhou
Dheeraj Kulshrestha
R. Ramnath
31
0
0
25 Jun 2024
AnyTrans: Translate AnyText in the Image with Large Scale Models
AnyTrans: Translate AnyText in the Image with Large Scale Models
Zhipeng Qian
Pei Zhang
Baosong Yang
Kai Fan
Yiwei Ma
Derek F. Wong
Xiaoshuai Sun
Rongrong Ji
VLM
40
1
0
17 Jun 2024
Classification of Non-native Handwritten Characters Using Convolutional
  Neural Network
Classification of Non-native Handwritten Characters Using Convolutional Neural Network
F. A. Mamun
S. Chowdhury
J. E. Giti
H. Sarker
39
1
0
06 Jun 2024
Improving Automatic Text Recognition with Language Models in the PyLaia
  Open-Source Library
Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library
Solène Tarride
Yoann Schneider
Marie Generali-Lince
Mélodie Boillet
Bastien Abadie
Christopher Kermorvant
28
3
0
29 Apr 2024
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Da Chang
Yu Li
64
2
0
19 Apr 2024
Spatial Context-based Self-Supervised Learning for Handwritten Text Recognition
Spatial Context-based Self-Supervised Learning for Handwritten Text Recognition
Carlos Peñarrubia
Carlos Garrido-Munoz
J. J. Valero-Mas
Jorge Calvo-Zaragoza
37
1
0
17 Apr 2024
LOCR: Location-Guided Transformer for Optical Character Recognition
LOCR: Location-Guided Transformer for Optical Character Recognition
Yu Sun
Dongzhan Zhou
Chen Lin
Conghui He
Wanli Ouyang
Han-Sen Zhong
27
1
0
04 Mar 2024
Enhancing Small Object Encoding in Deep Neural Networks: Introducing
  Fast&Focused-Net with Volume-wise Dot Product Layer
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
Tofik Ali
Partha Pratim Roy
ObjD
28
2
0
18 Jan 2024
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
33
6
0
29 Dec 2023
Vulnerability Analysis of Transformer-based Optical Character
  Recognition to Adversarial Attacks
Vulnerability Analysis of Transformer-based Optical Character Recognition to Adversarial Attacks
Lucas Beerens
D. Higham
30
1
0
28 Nov 2023
Automatic Report Generation for Histopathology images using pre-trained
  Vision Transformers
Automatic Report Generation for Histopathology images using pre-trained Vision Transformers
S. Sengupta
Donald E. Brown
VLM
MedIm
ViT
24
8
0
10 Nov 2023
EfficientOCR: An Extensible, Open-Source Package for Efficiently
  Digitizing World Knowledge
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge
Tom Bryan
Jacob Carlson
Abhishek Arora
Melissa Dell
23
8
0
16 Oct 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
43
35
0
30 Aug 2023
Writer adaptation for offline text recognition: An exploration of neural
  network-based methods
Writer adaptation for offline text recognition: An exploration of neural network-based methods
Tobias van der Werff
Maruf A. Dhali
Lambert Schomaker
35
0
0
11 Jul 2023
Combining OCR Models for Reading Early Modern Printed Books
Combining OCR Models for Reading Early Modern Printed Books
Mathias Seuret
Janne van der Loop
Nikolaus Weichselbaumer
Martin Mayr
J. Molnar
Tatjana Hass
Florian Kordon
Anguelos Nicolau
Vincent Christlein
21
2
0
11 May 2023
Efficient OCR for Building a Diverse Digital History
Efficient OCR for Building a Diverse Digital History
Jacob Carlson
Tom Bryan
Melissa Dell
23
11
0
05 Apr 2023
ST-KeyS: Self-Supervised Transformer for Keyword Spotting in Historical
  Handwritten Documents
ST-KeyS: Self-Supervised Transformer for Keyword Spotting in Historical Handwritten Documents
Sana Khamekhem Jemni
Sourour Ammar
Mohamed Ali Souibgui
Yousri Kessentini
A. Cheddad
15
3
0
06 Mar 2023
Fine-tuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition
Fine-tuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition
Jan Kohút
Michal Hradiš
57
7
0
13 Feb 2023
Transferring General Multimodal Pretrained Models to Text Recognition
Transferring General Multimodal Pretrained Models to Text Recognition
Junyang Lin
Xuancheng Ren
Yichang Zhang
Gao Liu
Peng Wang
An Yang
Chang Zhou
32
4
0
19 Dec 2022
Towards Robust Handwritten Text Recognition with On-the-fly User
  Participation
Towards Robust Handwritten Text Recognition with On-the-fly User Participation
Ajoy Mondal
Rohit Saluja
C. V. Jawahar
HAI
16
0
0
17 Dec 2022
Chart-RCNN: Efficient Line Chart Data Extraction from Camera Images
Chart-RCNN: Efficient Line Chart Data Extraction from Camera Images
Shufang Li
Congxi Lu
Linkai Li
Haoshuai Zhou
13
0
0
25 Nov 2022
Boosting Modern and Historical Handwritten Text Recognition with
  Deformable Convolutions
Boosting Modern and Historical Handwritten Text Recognition with Deformable Convolutions
S. Cascianelli
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
19
22
0
17 Aug 2022
The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text
  Recognition
The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition
S. Cascianelli
Vittorio Pippi
Martin Maarand
Marcella Cornia
Lorenzo Baraldi
Christopher Kermorvant
Rita Cucchiara
19
7
0
16 Aug 2022
Easter2.0: Improving convolutional models for handwritten text
  recognition
Easter2.0: Improving convolutional models for handwritten text recognition
Kartik Chaudhary
Raghav Bali
28
9
0
30 May 2022
Vision-Language Pre-Training for Boosting Scene Text Detectors
Vision-Language Pre-Training for Boosting Scene Text Detectors
Sibo Song
Jianqiang Wan
Zhibo Yang
Jun Tang
Wenqing Cheng
Xiang Bai
Cong Yao
VLM
39
24
0
29 Apr 2022
DAN: a Segmentation-free Document Attention Network for Handwritten
  Document Recognition
DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition
Denis Coquenet
Clément Chatelain
Thierry Paquet
24
57
0
23 Mar 2022
Transformer-based HTR for Historical Documents
Transformer-based HTR for Historical Documents
Phillip Benjamin Strobel
Simon Clematide
M. Volk
Tobias Hodel
15
10
0
21 Mar 2022
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,777
0
24 Feb 2021
1