Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.10304
Cited By
Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion
21 February 2022
Minghui Liao
Zhisheng Zou
Zhaoyi Wan
Cong Yao
X. Bai
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion"
50 / 66 papers shown
Title
Document Image Rectification Bases on Self-Adaptive Multitask Fusion
Heng Li
Xiangping Wu
Qingcai Chen
43
0
0
09 May 2025
HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation
Kun Liu
Qi Liu
Xinchen Liu
Jie Li
Yongdong Zhang
Jiebo Luo
Xiaodong He
Wu Liu
VGen
40
0
0
31 Mar 2025
MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion
Saron Samuel
Dan DeGenaro
Jimena Guallar-Blasco
Kate Sanders
Oluwaseun Eisape
...
David Etter
Efsun Kayi
Matthew Wiesner
Kenton W. Murray
Reno Kriz
85
0
0
26 Mar 2025
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering
Han Wang
Kai Hu
Liangcai Gao
176
0
0
20 Mar 2025
CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model
Yuxuan Luo
Jiaqi Tang
Chenyi Huang
Feiyang Hao
Zhouhui Lian
VLM
61
0
0
13 Mar 2025
Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
Pei Fu
Tongkun Guan
Zining Wang
Zhentao Guo
Chen Duan
...
Boming Chen
Jiayao Ma
Qianyi Jiang
Kai Zhou
Junfeng Luo
VLM
62
0
0
23 Feb 2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Wenwen Yu
Zhibo Yang
Jianqiang Wan
Sibo Song
J. Tang
Wenqing Cheng
Y. Liu
Xiang Bai
53
2
0
22 Feb 2025
PLATTER: A Page-Level Handwritten Text Recognition System for Indic Scripts
Badri Vishal Kasuba
Dhruv Kudale
Venkatapathy Subramanian
P. Chaudhuri
Ganesh Ramakrishnan
48
0
0
10 Feb 2025
Invizo: Arabic Handwritten Document Optical Character Recognition Solution
Alhossien Waly
Bassant Tarek
Ali Feteha
Rewan Yehia
Gasser Amr
Walid Gomaa
Ahmed M. Fares
66
0
0
07 Feb 2025
Arabic Handwritten Document OCR Solution with Binarization and Adaptive Scale Fusion Detection
Alhossien Waly
Bassant Tarek
Ali Feteha
Rewan Yehia
Gasser Amr
Ahmed Fares
59
1
0
02 Dec 2024
Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
43
0
0
05 Nov 2024
Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
30
0
0
25 Sep 2024
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
34
2
0
25 Sep 2024
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Haoran Wei
Chenglong Liu
Jinyue Chen
Jia Wang
Lingyu Kong
...
Liang Zhao
Jianjian Sun
Yuang Peng
Chunrui Han
Xiangyu Zhang
VLM
52
44
0
03 Sep 2024
Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Peng Wang
Zhaohai Li
Jun Tang
Humen Zhong
Fei Huang
Zhibo Yang
Cong Yao
VLM
ObjD
40
2
0
27 Aug 2024
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
53
10
0
19 Jul 2024
Detecting Omissions in Geographic Maps through Computer Vision
Phuc D. A. Nguyen
Anh Do
Minh Hoai
30
0
0
15 Jul 2024
Artistic-style text detector and a new Movie-Poster dataset
Aoxiang Ning
Yiting Wei
Minglong Xue
Senming Zhong
36
0
0
24 Jun 2024
SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection
Xingjian Hu
Baole Wei
Liangcai Gao
Jun Wang
41
0
0
17 Jun 2024
ArMeme: Propagandistic Content in Arabic Memes
Firoj Alam
A. Hasnat
Fatema Ahmed
Md. Arid Hasan
Maram Hasanain
56
7
0
06 Jun 2024
Towards Unified Multi-granularity Text Detection with Interactive Attention
Xingyu Wan
Chengquan Zhang
Pengyuan Lyu
Sen Fan
Zihan Ni
Kun Yao
Errui Ding
Jingdong Wang
65
1
0
30 May 2024
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model
Hongen Liu
Di Sun
Jiahao Wang
Yi Liu
Gang Pan
48
0
0
29 May 2024
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Tianci Bi
Xiaoyi Zhang
Zhizheng Zhang
Wenxuan Xie
Cuiling Lan
Yan Lu
Nanning Zheng
VLM
55
1
0
13 May 2024
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Yuliang Liu
Mingxin Huang
Hao Yan
Linger Deng
Weijia Wu
Hao Lu
Chunhua Shen
Lianwen Jin
Xiang Bai
40
0
0
30 Apr 2024
Seeing Text in the Dark: Algorithm and Benchmark
Chengpei Xu
Hao Fu
Long Ma
Wenjing Jia
Chengqi Zhang
Feng Xia
Xiaoyu Ai
Binghao Li
Wenjie Zhang
40
13
0
13 Apr 2024
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
Chuwei Luo
Yufan Shen
Zhaoqing Zhu
Qi Zheng
Zhi Yu
Cong Yao
34
39
0
08 Apr 2024
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Mingxin Huang
Hongliang Li
Yuliang Liu
Xiang Bai
Lianwen Jin
35
3
0
06 Apr 2024
Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments
Hieu Nguyen
Cong-Hoang Ta
Phuong-Thuy Le-Nguyen
Minh-Triet Tran
Trung-Truc Huynh-Le
34
0
0
01 Apr 2024
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
Jianqiang Wan
Sibo Song
Wenwen Yu
Yuliang Liu
Wenqing Cheng
Fei Huang
Xiang Bai
Cong Yao
Zhibo Yang
51
27
0
28 Mar 2024
Self-supervised co-salient object detection via feature correspondence at multiple scales
Souradeep Chakraborty
Dimitris Samaras
40
2
0
17 Mar 2024
Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis
Alireza Hosseini
Kiana Hooshanfar
Pouria Omrani
Reza Toosi
Ramin Toosi
Zahra Ebrahimian
M. Akhaee
41
4
0
04 Mar 2024
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Chen Duan
Pei Fu
Shan Guo
Qianyi Jiang
Xiaoming Wei
VLM
50
5
0
01 Mar 2024
CPN: Complementary Proposal Network for Unconstrained Text Detection
Longhuang Wu
Shangxuan Tian
Youxin Wang
Pengfei Xiong
37
0
0
18 Feb 2024
EK-Net:Real-time Scene Text Detection with Expand Kernel Distance
Boyuan Zhu
Fagui Liu
Xi Chen
Quan Tang
27
1
0
22 Jan 2024
Text Region Multiple Information Perception Network for Scene Text Detection
Jinzhi Zheng
Libo Zhang
Yanjun Wu
Chen Zhao
31
0
0
18 Jan 2024
BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text Detection
Jinzhi Zheng
Libo Zhang
Yanjun Wu
Chen Zhao
35
1
0
18 Jan 2024
Watermark Text Pattern Spotting in Document Images
Mateusz Krubiński
Stefan Matcovici
Diana Grigore
Daniel Voinea
A. Popa
WaLM
17
2
0
10 Jan 2024
Research on Multilingual Natural Scene Text Detection Algorithm
Tao Wang
35
0
0
18 Dec 2023
Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
Tongkun Guan
Wei Shen
Xuehang Yang
Xuehui Wang
Xiaokang Yang
36
7
0
08 Dec 2023
DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and Small Text
Weijia Wu
Yiming Zhang
Yefei He
Luoming Zhang
Zhenyu Lou
Hong Zhou
Xiang Bai
43
5
0
29 Nov 2023
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Shangbang Long
Siyang Qin
Yasuhisa Fujii
Alessandro Bissacco
Michalis Raptis
26
5
0
25 Oct 2023
SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus
Haoxu Wang
Fan Yu
Xian Shi
Yuezhang Wang
Shiliang Zhang
Ming Li
29
11
0
11 Sep 2023
Selective Scene Text Removal
Hayato Mitani
Akisato Kimura
Seiichi Uchida
29
1
0
01 Sep 2023
PBFormer: Capturing Complex Scene Text Shape with Polynomial Band Transformer
Ruijin Liu
Ning Lu
Dapeng Chen
Cheng Li
Zejian Yuan
Wei Peng
34
2
0
29 Aug 2023
MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild
Yu Zeng
J. Hsieh
Xuzhao Li
Ming-Ching Chang
37
8
0
23 Aug 2023
Turning a CLIP Model into a Scene Text Spotter
Wenwen Yu
Yuliang Liu
Xingkui Zhu
H. Cao
Xing Sun
Xiang Bai
VLM
CLIP
24
12
0
21 Aug 2023
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
Mingxin Huang
Jiaxin Zhang
Dezhi Peng
Hao Lu
Can Huang
Yuliang Liu
Xiang Bai
Lianwen Jin
38
27
0
20 Aug 2023
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Xugong Qin
Pengyuan Lyu
Chengquan Zhang
Yu Zhou
Kun Yao
Peng-Zhen Zhang
Hailun Lin
Weiping Wang
42
12
0
14 Aug 2023
Universal Defensive Underpainting Patch: Making Your Text Invisible to Optical Character Recognition
Jiacheng Deng
Li Dong
Jiahao Chen
Diqun Yan
Rangding Wang
Dengpan Ye
Lingchen Zhao
Jinyu Tian
27
1
0
04 Aug 2023
CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer
Zhiwen Shao
Yuchen Su
Yong Zhou
Fanrong Meng
Hancheng Zhu
Bing-Quan Liu
Rui Yao
19
9
0
25 Jul 2023
1
2
Next