Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.04245
Cited By
A Novel Pipeline for Improving Optical Character Recognition through Post-processing Using Natural Language Processing
9 July 2023
Aishik Rakshit
Samyak Mehta
Anirban Dasgupta
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Novel Pipeline for Improving Optical Character Recognition through Post-processing Using Natural Language Processing"
7 / 7 papers shown
Title
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Julien Abadji
Pedro Ortiz Suarez
Laurent Romary
Benoît Sagot
CLL
89
159
0
17 Jan 2022
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
246
372
0
21 Sep 2021
ByT5: Towards a token-free future with pre-trained byte-to-byte models
Linting Xue
Aditya Barua
Noah Constant
Rami Al-Rfou
Sharan Narang
Mihir Kale
Adam Roberts
Colin Raffel
104
506
0
28 May 2021
PP-OCR: A Practical Ultra Lightweight OCR System
Yuning Du
Chenxia Li
Ruoyu Guo
Xiaoting Yin
Weiwei Liu
...
Yifan Bai
Zilin Yu
Yehua Yang
Qingqing Dang
Hongya Wang
78
195
0
21 Sep 2020
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
M. Lewis
Yinhan Liu
Naman Goyal
Marjan Ghazvininejad
Abdel-rahman Mohamed
Omer Levy
Veselin Stoyanov
Luke Zettlemoyer
AIMat
VLM
266
10,861
0
29 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
490
20,342
0
23 Oct 2019
Total-Text: A Comprehensive Dataset for Scene Text Detection and Recognition
Chee-Kheng Chng
Chee Seng Chan
72
462
0
28 Oct 2017
1