ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.05508
  4. Cited By
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer

Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer

11 February 2022
Yair Kittenplon
I. Lavi
Sharon Fogel
Yarin Bar
R. Manmatha
Pietro Perona
    ViT
ArXivPDFHTML

Papers citing "Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer"

36 / 36 papers shown
Title
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
Dongliang Luo
Hanshen Zhu
Ziyang Zhang
Dingkang Liang
Xudong Xie
Y. Liu
Xiang Bai
VLM
37
0
0
14 Apr 2025
A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition
A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition
Ritabrata Chakraborty
Shivakumara Palaiahnakote
Umapada Pal
Cheng-Lin Liu
VLM
47
0
0
19 Mar 2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Wenwen Yu
Zhibo Yang
Jianqiang Wan
Sibo Song
J. Tang
Wenqing Cheng
Y. Liu
Xiang Bai
46
1
0
22 Feb 2025
HIP: Hierarchical Point Modeling and Pre-training for Visual Information
  Extraction
HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction
Rujiao Long
Pengfei Wang
Zhibo Yang
Cong Yao
39
0
0
02 Nov 2024
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved
  Denoising Training
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Xi Chen
Qian Qiao
Jun Gao
Tianxiang Wu
Rahul Bhadani
...
Ziqiang Cao
Larry Head
Yue Zhang
Jielei Zhang
Huyang Sun
DiffM
28
5
0
01 Aug 2024
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting
Jingjing Wu
Zhengyao Fang
Pengyuan Lyu
Chengquan Zhang
Fanglin Chen
Guangming Lu
Wenjie Pei
50
2
0
28 Jul 2024
VimTS: A Unified Video and Image Text Spotter for Enhancing the
  Cross-domain Generalization
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Yuliang Liu
Mingxin Huang
Hao Yan
Linger Deng
Weijia Wu
Hao Lu
Chunhua Shen
Lianwen Jin
Xiang Bai
32
0
0
30 Apr 2024
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Mingxin Huang
Hongliang Li
Yuliang Liu
Xiang Bai
Lianwen Jin
35
3
0
06 Apr 2024
OmniParser: A Unified Framework for Text Spotting, Key Information
  Extraction and Table Recognition
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
Jianqiang Wan
Sibo Song
Wenwen Yu
Yuliang Liu
Wenqing Cheng
Fei Huang
Xiang Bai
Cong Yao
Zhibo Yang
43
26
0
28 Mar 2024
TextMonkey: An OCR-Free Large Multimodal Model for Understanding
  Document
TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Yuliang Liu
Biao Yang
Qiang Liu
Zhang Li
Zhiyin Ma
Shuo Zhang
Xiang Bai
MLLM
VLM
41
87
0
07 Mar 2024
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Nguyen Nguyen
Yapeng Tian
Chenliang Xu
47
1
0
27 Feb 2024
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text
  Segmentation
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Maoyuan Ye
Jing Zhang
Juhua Liu
Chenyu Liu
Baocai Yin
Cong Liu
Bo Du
Dacheng Tao
VLM
35
10
0
31 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
72
1
0
15 Jan 2024
GoMatching: A Simple Baseline for Video Text Spotting via Long and Short
  Term Matching
GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching
Haibin He
Maoyuan Ye
Jing Zhang
Juhua Liu
Dacheng Tao
VLM
41
3
0
13 Jan 2024
Inverse-like Antagonistic Scene Text Spotting via Reading-Order
  Estimation and Dynamic Sampling
Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling
Shi-Xue Zhang
Chun Yang
Xiaobin Zhu
Hongyang Zhou
Hongfa Wang
Xu-Cheng Yin
28
6
0
08 Jan 2024
Word length-aware text spotting: Enhancing detection and recognition in
  dense text image
Word length-aware text spotting: Enhancing detection and recognition in dense text image
Hao Wang
Huabing Zhou
Yanduo Zhang
Tao Lu
Jiayi Ma
30
1
0
25 Dec 2023
Progressive Evolution from Single-Point to Polygon for Scene Text
Progressive Evolution from Single-Point to Polygon for Scene Text
Linger Deng
Mingxin Huang
Xudong Xie
Yuliang Liu
Lianwen Jin
Xiang Bai
29
1
0
21 Dec 2023
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Shangbang Long
Siyang Qin
Yasuhisa Fujii
Alessandro Bissacco
Michalis Raptis
22
5
0
25 Oct 2023
SCOB: Universal Text Understanding via Character-wise Supervised
  Contrastive Learning with Online Text Rendering for Bridging Domain Gap
SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
Daehee Kim
Yoon Kim
Donghyun Kim
Yumin Lim
Geewook Kim
Taeho Kil
23
3
0
21 Sep 2023
STEP -- Towards Structured Scene-Text Spotting
STEP -- Towards Structured Scene-Text Spotting
Sergi Garcia-Bordils
Dimosthenis Karatzas
Marccal Rusinol
24
2
0
05 Sep 2023
Turning a CLIP Model into a Scene Text Spotter
Turning a CLIP Model into a Scene Text Spotter
Wenwen Yu
Yuliang Liu
Xingkui Zhu
H. Cao
Xing Sun
Xiang Bai
VLM
CLIP
19
12
0
21 Aug 2023
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy
  in Transformer
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
Mingxin Huang
Jiaxin Zhang
Dezhi Peng
Hao Lu
Can Huang
Yuliang Liu
Xiang Bai
Lianwen Jin
29
26
0
20 Aug 2023
Weakly supervised information extraction from inscrutable handwritten
  document images
Weakly supervised information extraction from inscrutable handwritten document images
S. Paul
Gagan Madan
Akankshya Mishra
N. Hegde
Pradeep Kumar
Gaurav Aggarwal
MedIm
19
3
0
12 Jun 2023
DeepSolo++: Let Transformer Decoder with Explicit Points Solo for
  Multilingual Text Spotting
DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
38
2
0
31 May 2023
ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
Shangbang Long
Siyang Qin
Dmitry Panteleev
Alessandro Bissacco
Yasuhisa Fujii
Michalis Raptis
VLM
37
17
0
16 May 2023
Towards Unified Scene Text Spotting based on Sequence Generation
Towards Unified Scene Text Spotting based on Sequence Generation
Taeho Kil
Seonghyeon Kim
Sukmin Seo
Yoon Kim
Daehee Kim
70
20
0
07 Apr 2023
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
Aviad Aberdam
David Bensaid
Alona Golts
Roy Ganz
Oren Nuriel
Royee Tichauer
Shai Mazor
Ron Litman
VLM
CLIP
24
12
0
18 Jan 2023
Towards Models that Can See and Read
Towards Models that Can See and Read
Roy Ganz
Oren Nuriel
Aviad Aberdam
Yair Kittenplon
Shai Mazor
Ron Litman
16
13
0
18 Jan 2023
SPTS v2: Single-Point Scene Text Spotting
SPTS v2: Single-Point Scene Text Spotting
Yuliang Liu
Jiaxin Zhang
Dezhi Peng
Mingxin Huang
Xinyu Wang
...
Can Huang
Dahua Lin
Chunhua Shen
Xiang Bai
Lianwen Jin
VLM
21
49
0
04 Jan 2023
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text
  Spotting
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
36
70
0
19 Nov 2022
Out-of-Vocabulary Challenge Report
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils
Andrés Mafla
Ali Furkan Biten
Oren Nuriel
Aviad Aberdam
Shai Mazor
Ron Litman
Dimosthenis Karatzas
11
16
0
14 Sep 2022
Single Shot Self-Reliant Scene Text Spotter by Decoupled yet
  Collaborative Detection and Recognition
Single Shot Self-Reliant Scene Text Spotter by Decoupled yet Collaborative Detection and Recognition
Jingjing Wu
Pengyuan Lyu
Guangming Lu
Chengquan Zhang
Wenjie Pei
15
3
0
15 Jul 2022
Text Detection & Recognition in the Wild for Robot Localization
Text Detection & Recognition in the Wild for Robot Localization
Z. Raisi
John S. Zelek
16
0
0
17 May 2022
Language Matters: A Weakly Supervised Vision-Language Pre-training
  Approach for Scene Text Detection and Spotting
Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting
Chuhui Xue
Wenqing Zhang
Yu Hao
Shijian Lu
Philip H. S. Torr
Song Bai
VLM
32
31
0
08 Mar 2022
Convolutional Character Networks
Convolutional Character Networks
Linjie Xing
Zhi Tian
Weilin Huang
Matthew R. Scott
52
157
0
17 Oct 2019
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in
  Natural Images
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
185
515
0
26 Jan 2016
1