Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.05952
Cited By
Adapting the Tesseract Open-Source OCR Engine for Tamil and Sinhala Legacy Fonts and Creating a Parallel Corpus for Tamil-Sinhala-English
13 September 2021
Charangan Vasantharajan
Laksika Tharmalingam
Uthayasanker Thayasivam
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adapting the Tesseract Open-Source OCR Engine for Tamil and Sinhala Legacy Fonts and Creating a Parallel Corpus for Tamil-Sinhala-English"
3 / 3 papers shown
Title
Sinhala-English Word Embedding Alignment: Introducing Datasets and Benchmark for a Low Resource Language
Kasun Wickramasinghe
Nisansa de Silva
19
0
0
17 Nov 2023
Challenges of language technologies for the indigenous languages of the Americas
Manuel Mager
Ximena Gutierrez-Vasques
Gerardo E Sierra
Ivan Vladimir Meza Ruiz
VLM
194
88
0
12 Jun 2018
Enhanced Techniques for PDF Image Segmentation and Text Extraction
D. Sasirekha
Dr. E. Chandra
26
22
0
01 Oct 2012
1