ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.15996
  4. Cited By
DTrOCR: Decoder-only Transformer for Optical Character Recognition

DTrOCR: Decoder-only Transformer for Optical Character Recognition

30 August 2023
Masato Fujitake
ArXivPDFHTML

Papers citing "DTrOCR: Decoder-only Transformer for Optical Character Recognition"

26 / 26 papers shown
Title
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
Naphat Nithisopa
Teerapong Panboonyuen
ViT
26
0
0
07 May 2025
Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition
Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition
Lei Kang
Xuanshuo Fu
Lluís Gómez
Alicia Fornés
Ernest Valveny
Dimosthenis Karatzas
MU
37
0
0
11 Apr 2025
A Lightweight Multi-Module Fusion Approach for Korean Character Recognition
A Lightweight Multi-Module Fusion Approach for Korean Character Recognition
Inho Jake Park
Jaehoon Jay Jeong
Ho-Sang Jo
33
0
0
08 Apr 2025
Leveraging Contrast Information for Efficient Document Shadow Removal
Leveraging Contrast Information for Efficient Document Shadow Removal
Y. Liu
Jiancheng Huang
Na Liu
Mingfu Yan
Yi Huang
Shifeng Chen
30
0
0
01 Apr 2025
InkFM: A Foundational Model for Full-Page Online Handwritten Note Understanding
InkFM: A Foundational Model for Full-Page Online Handwritten Note Understanding
Anastasiia Fadeeva
Vincent Coriou
Diego Antognini
C. Musat
Andrii Maksai
47
0
0
29 Mar 2025
Practical Fine-Tuning of Autoregressive Models on Limited Handwritten Texts
Practical Fine-Tuning of Autoregressive Models on Limited Handwritten Texts
Jan Kohút
Michal Hradiš
78
0
0
25 Mar 2025
TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model
TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model
Cheng Yang
Yang Sui
Jinqi Xiao
Lingyi Huang
Yu Gong
...
Jinghua Yan
Y. Bai
P. Sadayappan
Xia Hu
Bo Yuan
VLM
53
0
0
24 Mar 2025
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation
Andrea Maracani
Savas Ozkan
Sijun Cho
Hyowon Kim
Eunchung Noh
Jeongwon Min
Cho Jung Min
Dookun Park
Mete Ozay
38
0
0
20 Mar 2025
Handwritten Text Recognition: A Survey
Handwritten Text Recognition: A Survey
Carlos Garrido-Munoz
Antonio Ríos-Vila
Jorge Calvo-Zaragoza
101
0
0
12 Feb 2025
Wormhole Memory: A Rubik's Cube for Cross-Dialogue Retrieval
Wormhole Memory: A Rubik's Cube for Cross-Dialogue Retrieval
Libo Wang
111
0
0
24 Jan 2025
DLaVA: Document Language and Vision Assistant for Answer Localization
  with Enhanced Interpretability and Trustworthiness
DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness
Ahmad Mohammadshirazi
Pinaki Prasad Guha Neogi
Ser-Nam Lim
R. Ramnath
70
1
0
29 Nov 2024
HATFormer: Historic Handwritten Arabic Text Recognition with Transformers
HATFormer: Historic Handwritten Arabic Text Recognition with Transformers
Adrian Chan
Anupam Mijar
Mehreen Saeed
Chau-Wai Wong
Akram Khater
36
0
0
03 Oct 2024
JaPOC: Japanese Post-OCR Correction Benchmark using Vouchers
JaPOC: Japanese Post-OCR Correction Benchmark using Vouchers
Masato Fujitake
28
0
0
30 Sep 2024
General Detection-based Text Line Recognition
General Detection-based Text Line Recognition
Raphael Baena
Syrine Kalleli
Mathieu Aubry
140
0
0
25 Sep 2024
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Da Chang
Yu Li
64
2
0
19 Apr 2024
Spatial Context-based Self-Supervised Learning for Handwritten Text Recognition
Spatial Context-based Self-Supervised Learning for Handwritten Text Recognition
Carlos Peñarrubia
Carlos Garrido-Munoz
J. J. Valero-Mas
Jorge Calvo-Zaragoza
37
1
0
17 Apr 2024
JSTR: Judgment Improves Scene Text Recognition
JSTR: Judgment Improves Scene Text Recognition
Masato Fujitake
36
1
0
09 Apr 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with
  Pre-trained Language Model
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
27
3
0
15 Mar 2024
Segmentation-free Connectionist Temporal Classification loss based OCR
  Model for Text Captcha Classification
Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification
V. Khatavkar
M. Velankar
Sneha Petkar
19
5
0
08 Feb 2024
RL-LOGO: Deep Reinforcement Learning Localization for Logo Recognition
RL-LOGO: Deep Reinforcement Learning Localization for Logo Recognition
Masato Fujitake
24
3
0
28 Dec 2023
TPPoet: Transformer-Based Persian Poem Generation using Minimal Data and
  Advanced Decoding Techniques
TPPoet: Transformer-Based Persian Poem Generation using Minimal Data and Advanced Decoding Techniques
Amir Panahandeh
Hanie Asemi
Esmail Nourani
19
0
0
04 Dec 2023
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Jin Wei
Dongbao Yang
Yu Zhou
24
7
0
25 May 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained
  Vision-Language Model
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
CLIP
VLM
21
25
0
23 May 2023
TrOCR: Transformer-based Optical Character Recognition with Pre-trained
  Models
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
98
340
0
21 Sep 2021
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
253
1,986
0
31 Dec 2020
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in
  Natural Images
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
188
515
0
26 Jan 2016
1