ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.17674
  4. Cited By
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

25 October 2023
Shangbang Long
Siyang Qin
Yasuhisa Fujii
Alessandro Bissacco
Michalis Raptis
ArXiv (abs)PDFHTML

Papers citing "Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis"

50 / 59 papers shown
Title
ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
Shangbang Long
Siyang Qin
Dmitry Panteleev
Alessandro Bissacco
Yasuhisa Fujii
Michalis Raptis
VLM
106
20
0
16 May 2023
Towards Unified Scene Text Spotting based on Sequence Generation
Towards Unified Scene Text Spotting based on Sequence Generation
Taeho Kil
Seonghyeon Kim
Sukmin Seo
Yoon Kim
Daehee Kim
109
20
0
07 Apr 2023
Levenshtein OCR
Levenshtein OCR
Cheng Da
Peng Wang
Cong Yao
ViT
115
32
0
08 Sep 2022
Multi-Granularity Prediction for Scene Text Recognition
Multi-Granularity Prediction for Scene Text Recognition
Peng Wang
Cheng Da
Cong Yao
121
48
0
08 Sep 2022
GLASS: Global to Local Attention for Scene-Text Spotting
GLASS: Global to Local Attention for Scene-Text Spotting
Roi Ronen
Shahar Tsiper
Oron Anschel
I. Lavi
Amir Markovitz
R. Manmatha
50
44
0
05 Aug 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
102
173
0
14 Jul 2022
DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in
  Transformer
DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Bo Du
Dacheng Tao
ViT
80
76
0
10 Jul 2022
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
B. Pfitzmann
Christoph Auer
Michele Dolfi
A. Nassar
Peter W. J. Staar
69
88
0
02 Jun 2022
LayoutLMv3: Pre-training for Document AI with Unified Text and Image
  Masking
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking
Yupan Huang
Tengchao Lv
Lei Cui
Yutong Lu
Furu Wei
95
458
0
18 Apr 2022
Text Spotting Transformers
Text Spotting Transformers
Xiang Zhang
Yongwen Su
Subarna Tripathi
Zhuowen Tu
ViT
89
95
0
05 Apr 2022
Few Could Be Better Than All: Feature Sampling and Grouping for Scene
  Text Detection
Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection
Jixin Tang
Wenqing Zhang
Hong-yi Liu
Mingkun Yang
Ziwei He
Guan-Nan Hu
Xiang Bai
ViT
49
67
0
29 Mar 2022
Towards End-to-End Unified Scene Text Detection and Layout Analysis
Towards End-to-End Unified Scene Text Detection and Layout Analysis
Shangbang Long
Siyang Qin
Dmitry Panteleev
Alessandro Bissacco
Yasuhisa Fujii
Michalis Raptis
70
97
0
28 Mar 2022
FormNet: Structural Encoding beyond Sequential Modeling in Form Document
  Information Extraction
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Chen-Yu Lee
Chun-Liang Li
Timothy Dozat
Vincent Perot
Guolong Su
Nan Hua
Joshua Ainslie
Renshen Wang
Yasuhisa Fujii
Tomas Pfister
73
79
0
16 Mar 2022
Arbitrary Shape Text Detection using Transformers
Arbitrary Shape Text Detection using Transformers
Z. Raisi
Georges Younes
John S. Zelek
ViT
69
13
0
22 Feb 2022
Real-Time Scene Text Detection with Differentiable Binarization and
  Adaptive Scale Fusion
Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion
Minghui Liao
Zhisheng Zou
Zhaoyi Wan
Cong Yao
X. Bai
88
237
0
21 Feb 2022
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer
Yair Kittenplon
I. Lavi
Sharon Fogel
Yarin Bar
R. Manmatha
Pietro Perona
ViT
50
55
0
11 Feb 2022
SPTS: Single-Point Text Spotting
SPTS: Single-Point Text Spotting
Dezhi Peng
Xinyu Wang
Yuliang Liu
Jiaxin Zhang
Mingxin Huang
...
Jing Li
Dahua Lin
Chunhua Shen
Xiang Bai
Lianwen Jin
ViT
94
65
0
15 Dec 2021
Multi-modal Text Recognition Networks: Interactive Enhancements between
  Visual and Semantic Features
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Byeonghu Na
Yoonsik Kim
Sungrae Park
70
54
0
30 Nov 2021
SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text
  Recognition Models
SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models
Moonbin Yim
Yoonsik Kim
Han-Cheol Cho
Sungrae Park
45
52
0
20 Jul 2021
Open Images V5 Text Annotation and Yet Another Mask Text Spotter
Open Images V5 Text Annotation and Yet Another Mask Text Spotter
Ilya Krylov
S. Nosov
V. Sovrasov
VLM
68
54
0
23 Jun 2021
StructuralLM: Structural Pre-training for Form Understanding
StructuralLM: Structural Pre-training for Form Understanding
Chenliang Li
Bin Bi
Ming Yan
Wei Wang
Songfang Huang
Fei Huang
Luo Si
LMTDAI4CE
88
134
0
24 May 2021
TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped
  scene text
TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Amanpreet Singh
Guan Pang
Mandy Toh
Jing Huang
Wojciech Galuba
Tal Hassner
64
174
0
12 May 2021
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
Oren Nuriel
Sharon Fogel
Ron Litman
50
9
0
09 May 2021
ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text
  Spotting
ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting
Yuliang Liu
Chunhua Shen
Lianwen Jin
Tong He
Peng Chen
Chongyu Liu
Hao Chen
104
142
0
08 May 2021
Post-OCR Paragraph Recognition by Graph Convolutional Networks
Post-OCR Paragraph Recognition by Graph Convolutional Networks
Renshen Wang
Yasuhisa Fujii
Ashok Popat
GNN
73
20
0
29 Jan 2021
MANGO: A Mask Attention Guided One-Stage Scene Text Spotter
MANGO: A Mask Attention Guided One-Stage Scene Text Spotter
Liang Qiao
Ying-Cong Chen
Zhanzhan Cheng
Yunlu Xu
Yi Niu
Shiliang Pu
Leilei Gan
69
77
0
08 Dec 2020
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
128
531
0
01 Dec 2020
Character Region Attention For Text Spotting
Character Region Attention For Text Spotting
Youngmin Baek
Seung Shin
Jeonghun Baek
Sungrae Park
Junyeop Lee
Daehyun Nam
Hwalsuk Lee
82
72
0
19 Jul 2020
Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text
  Spotting
Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting
Minghui Liao
Guan Pang
Jing Huang
Tal Hassner
X. Bai
84
184
0
18 Jul 2020
End-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT3DVPINN
434
13,108
0
26 May 2020
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal
  World
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
Shangbang Long
Cong Yao
117
67
0
24 Mar 2020
A New Perspective for Flexible Feature Gathering in Scene Text
  Recognition Via Character Anchor Pooling
A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling
Shangbang Long
Yushuo Guan
Kaigui Bian
Cong Yao
73
13
0
10 Feb 2020
Convolutional Character Networks
Convolutional Character Networks
Linjie Xing
Zhi Tian
Weilin Huang
Matthew R. Scott
118
159
0
17 Oct 2019
RandAugment: Practical automated data augmentation with a reduced search
  space
RandAugment: Practical automated data augmentation with a reduced search space
E. D. Cubuk
Barret Zoph
Jonathon Shlens
Quoc V. Le
MQ
258
3,502
0
30 Sep 2019
Rethinking Irregular Scene Text Recognition
Rethinking Irregular Scene Text Recognition
Shangbang Long
Yushuo Guan
Bingxuan Wang
Kaigui Bian
Cong Yao
63
8
0
30 Aug 2019
Towards Unconstrained End-to-End Text Spotting
Towards Unconstrained End-to-End Text Spotting
Siyang Qin
Alessandro Bissacco
Michalis Raptis
Yasuhisa Fujii
Y. Xiao
58
130
0
24 Aug 2019
PubLayNet: largest dataset ever for document layout analysis
PubLayNet: largest dataset ever for document layout analysis
Xu Zhong
Jianbin Tang
Antonio Jimeno Yepes
52
461
0
16 Aug 2019
Scene Text Visual Question Answering
Scene Text Visual Question Answering
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Lluís Gómez
Marçal Rusiñol
Ernest Valveny
C. V. Jawahar
Dimosthenis Karatzas
111
360
0
31 May 2019
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
168
370
0
27 May 2019
Towards VQA Models That Can Read
Towards VQA Models That Can Read
Amanpreet Singh
Vivek Natarajan
Meet Shah
Yu Jiang
Xinlei Chen
Dhruv Batra
Devi Parikh
Marcus Rohrbach
EgoV
111
1,253
0
18 Apr 2019
Character Region Awareness for Text Detection
Character Region Awareness for Text Detection
Youngmin Baek
Bado Lee
Dongyoon Han
Sangdoo Yun
Hwalsuk Lee
64
785
0
03 Apr 2019
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
154
4,180
0
25 Feb 2019
Scene Text Detection and Recognition: The Deep Learning Era
Scene Text Detection and Recognition: The Deep Learning Era
Shangbang Long
Xin He
Cong Yao
VLM
113
398
0
10 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,175
0
11 Oct 2018
Scene Text Recognition from Two-Dimensional Perspective
Scene Text Recognition from Two-Dimensional Perspective
Minghui Liao
Jian Zhang
Zhaoyi Wan
Fengming Xie
Jiajun Liang
Pengyuan Lyu
Cong Yao
X. Bai
3DV
79
234
0
18 Sep 2018
TextSnake: A Flexible Representation for Detecting Text of Arbitrary
  Shapes
TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes
Shangbang Long
Jiaqiang Ruan
Weinan Zhang
Xin He
Wenhao Wu
Cong Yao
78
514
0
04 Jul 2018
MobileNetV2: Inverted Residuals and Linear Bottlenecks
MobileNetV2: Inverted Residuals and Linear Bottlenecks
Mark Sandler
Andrew G. Howard
Menglong Zhu
A. Zhmoginov
Liang-Chieh Chen
204
19,333
0
13 Jan 2018
FOTS: Fast Oriented Text Spotting with a Unified Network
FOTS: Fast Oriented Text Spotting with a Unified Network
Xuebo Liu
Ding Liang
Shipeng Yan
Dagui Chen
Yu Qiao
Junjie Yan
ObjD
81
500
0
05 Jan 2018
SEE: Towards Semi-Supervised End-to-End Scene Text Recognition
SEE: Towards Semi-Supervised End-to-End Scene Text Recognition
Christian Bartz
Haojin Yang
Christoph Meinel
65
65
0
14 Dec 2017
Detecting Curve Text in the Wild: New Dataset and New Solution
Detecting Curve Text in the Wild: New Dataset and New Solution
Liu Yuliang
Jin Lianwen
Shuaitao Zhang
Sheng Zhang
78
254
0
06 Dec 2017
12
Next