ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.03127
  4. Cited By
ST-KeyS: Self-Supervised Transformer for Keyword Spotting in Historical
  Handwritten Documents

ST-KeyS: Self-Supervised Transformer for Keyword Spotting in Historical Handwritten Documents

6 March 2023
Sana Khamekhem Jemni
Sourour Ammar
Mohamed Ali Souibgui
Yousri Kessentini
A. Cheddad
ArXivPDFHTML

Papers citing "ST-KeyS: Self-Supervised Transformer for Keyword Spotting in Historical Handwritten Documents"

6 / 6 papers shown
Title
A Few Shot Multi-Representation Approach for N-gram Spotting in
  Historical Manuscripts
A Few Shot Multi-Representation Approach for N-gram Spotting in Historical Manuscripts
Giuseppe De Gregorio
Sanket Biswas
Mohamed Ali Souibgui
Asma Bensalah
Josep Lladós
Alicia Fornés
Angelo Marcelli
32
2
0
21 Sep 2022
Reading and Writing: Discriminative and Generative Modeling for
  Self-Supervised Text Recognition
Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Mingkun Yang
Minghui Liao
Pu Lu
Jing Wang
Shenggao Zhu
Hualin Luo
Qingzhen Tian
X. Bai
SSL
33
55
0
01 Jul 2022
DocEnTr: An End-to-End Document Image Enhancement Transformer
DocEnTr: An End-to-End Document Image Enhancement Transformer
Mohamed Ali Souibgui
Sanket Biswas
Sana Khamekhem Jemni
Yousri Kessentini
Alicia Fornés
Josep Lladós
Umapada Pal
ViT
58
45
0
25 Jan 2022
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,443
0
11 Nov 2021
TrOCR: Transformer-based Optical Character Recognition with Pre-trained
  Models
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
98
343
0
21 Sep 2021
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document
  Understanding
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu
Yiheng Xu
Tengchao Lv
Lei Cui
Furu Wei
...
D. Florêncio
Cha Zhang
Wanxiang Che
Min Zhang
Lidong Zhou
ViT
MLLM
153
498
0
29 Dec 2020
1