DTrOCR: Decoder-only Transformer for Optical Character Recognition

30 August 2023

Papers citing "DTrOCR: Decoder-only Transformer for Optical Character Recognition"

26 / 26 papers shown

Title
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation Naphat Nithisopa Teerapong Panboonyuen ViT 26 0 0 07 May 2025
Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition Lei Kang Xuanshuo Fu Lluís Gómez Alicia Fornés Ernest Valveny Dimosthenis Karatzas MU 37 0 0 11 Apr 2025
A Lightweight Multi-Module Fusion Approach for Korean Character Recognition Inho Jake Park Jaehoon Jay Jeong Ho-Sang Jo 33 0 0 08 Apr 2025
Leveraging Contrast Information for Efficient Document Shadow Removal Y. Liu Jiancheng Huang Na Liu Mingfu Yan Yi Huang Shifeng Chen 30 0 0 01 Apr 2025
InkFM: A Foundational Model for Full-Page Online Handwritten Note Understanding Anastasiia Fadeeva Vincent Coriou Diego Antognini C. Musat Andrii Maksai 47 0 0 29 Mar 2025
Practical Fine-Tuning of Autoregressive Models on Limited Handwritten Texts Jan Kohút Michal Hradiš 78 0 0 25 Mar 2025
TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model Cheng Yang Yang Sui Jinqi Xiao Lingyi Huang Yu Gong ... Jinghua Yan Y. Bai P. Sadayappan Xia Hu Bo Yuan VLM 53 0 0 24 Mar 2025
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation Andrea Maracani Savas Ozkan Sijun Cho Hyowon Kim Eunchung Noh Jeongwon Min Cho Jung Min Dookun Park Mete Ozay 38 0 0 20 Mar 2025
Handwritten Text Recognition: A Survey Carlos Garrido-Munoz Antonio Ríos-Vila Jorge Calvo-Zaragoza 101 0 0 12 Feb 2025
Wormhole Memory: A Rubik's Cube for Cross-Dialogue Retrieval Libo Wang 111 0 0 24 Jan 2025
DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness Ahmad Mohammadshirazi Pinaki Prasad Guha Neogi Ser-Nam Lim R. Ramnath 70 1 0 29 Nov 2024
HATFormer: Historic Handwritten Arabic Text Recognition with Transformers Adrian Chan Anupam Mijar Mehreen Saeed Chau-Wai Wong Akram Khater 36 0 0 03 Oct 2024
JaPOC: Japanese Post-OCR Correction Benchmark using Vouchers Masato Fujitake 28 0 0 30 Sep 2024
General Detection-based Text Line Recognition Raphael Baena Syrine Kalleli Mathieu Aubry 140 0 0 25 Sep 2024
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer Da Chang Yu Li 64 2 0 19 Apr 2024
Spatial Context-based Self-Supervised Learning for Handwritten Text Recognition Carlos Peñarrubia Carlos Garrido-Munoz J. J. Valero-Mas Jorge Calvo-Zaragoza 37 1 0 17 Apr 2024
JSTR: Judgment Improves Scene Text Recognition Masato Fujitake 36 1 0 09 Apr 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model Jiahao Lyu Jin Wei Gangyan Zeng Zeng Li Enze Xie Wei Wang Yu Zhou VLM 27 3 0 15 Mar 2024
Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification V. Khatavkar M. Velankar Sneha Petkar 19 5 0 08 Feb 2024
RL-LOGO: Deep Reinforcement Learning Localization for Logo Recognition Masato Fujitake 24 3 0 28 Dec 2023
TPPoet: Transformer-Based Persian Poem Generation using Minimal Data and Advanced Decoding Techniques Amir Panahandeh Hanie Asemi Esmail Nourani 19 0 0 04 Dec 2023
Masked and Permuted Implicit Context Learning for Scene Text Recognition Xiaomeng Yang Zhi Qiao Jin Wei Dongbao Yang Yu Zhou 24 7 0 25 May 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model Shuai Zhao Xiaohan Wang Linchao Zhu Yezhou Yang CLIP VLM 21 25 0 23 May 2023
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models Minghao Li Tengchao Lv Jingye Chen Lei Cui Yijuan Lu D. Florêncio Cha Zhang Zhoujun Li Furu Wei ViT 98 340 0 21 Sep 2021
The Pile: An 800GB Dataset of Diverse Text for Language Modeling Leo Gao Stella Biderman Sid Black Laurence Golding Travis Hoppe ... Horace He Anish Thite Noa Nabeshima Shawn Presser Connor Leahy AIMat 253 1,986 0 31 Dec 2020
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images Andreas Veit Tomas Matera Lukás Neumann Jirí Matas Serge J. Belongie 188 515 0 26 Jan 2016