ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.10304
  4. Cited By
Real-Time Scene Text Detection with Differentiable Binarization and
  Adaptive Scale Fusion

Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion

21 February 2022
Minghui Liao
Zhisheng Zou
Zhaoyi Wan
Cong Yao
X. Bai
ArXivPDFHTML

Papers citing "Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion"

50 / 66 papers shown
Title
Document Image Rectification Bases on Self-Adaptive Multitask Fusion
Document Image Rectification Bases on Self-Adaptive Multitask Fusion
Heng Li
Xiangping Wu
Qingcai Chen
49
0
0
09 May 2025
HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation
HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation
Kun Liu
Qi Liu
Xinchen Liu
Jie Li
Yongdong Zhang
Jiebo Luo
Xiaodong He
Wu Liu
VGen
49
0
0
31 Mar 2025
MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion
MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion
Saron Samuel
Dan DeGenaro
Jimena Guallar-Blasco
Kate Sanders
Oluwaseun Eisape
...
David Etter
Efsun Kayi
Matthew Wiesner
Kenton W. Murray
Reno Kriz
87
0
0
26 Mar 2025
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering
Han Wang
Kai Hu
Liangcai Gao
179
0
0
20 Mar 2025
CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model
Yuxuan Luo
Jiaqi Tang
Chenyi Huang
Feiyang Hao
Zhouhui Lian
VLM
61
0
0
13 Mar 2025
Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
Pei Fu
Tongkun Guan
Zining Wang
Zhentao Guo
Chen Duan
...
Boming Chen
Jiayao Ma
Qianyi Jiang
Kai Zhou
Junfeng Luo
VLM
62
0
0
23 Feb 2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Wenwen Yu
Zhibo Yang
Jianqiang Wan
Sibo Song
J. Tang
Wenqing Cheng
Yunxing Liu
Xiang Bai
53
3
0
22 Feb 2025
PLATTER: A Page-Level Handwritten Text Recognition System for Indic Scripts
Badri Vishal Kasuba
Dhruv Kudale
Venkatapathy Subramanian
P. Chaudhuri
Ganesh Ramakrishnan
48
0
0
10 Feb 2025
Invizo: Arabic Handwritten Document Optical Character Recognition Solution
Alhossien Waly
Bassant Tarek
Ali Feteha
Rewan Yehia
Gasser Amr
Walid Gomaa
Ahmed M. Fares
66
0
0
07 Feb 2025
Arabic Handwritten Document OCR Solution with Binarization and Adaptive
  Scale Fusion Detection
Arabic Handwritten Document OCR Solution with Binarization and Adaptive Scale Fusion Detection
Alhossien Waly
Bassant Tarek
Ali Feteha
Rewan Yehia
Gasser Amr
Ahmed Fares
61
1
0
02 Dec 2024
Real-Time Text Detection with Similar Mask in Traffic, Industrial, and
  Natural Scenes
Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
43
0
0
05 Nov 2024
Focus Entirety and Perceive Environment for Arbitrary-Shaped Text
  Detection
Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
40
0
0
25 Sep 2024
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
36
2
0
25 Sep 2024
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Haoran Wei
Chenglong Liu
Jinyue Chen
Jia Wang
Lingyu Kong
...
Liang Zhao
Jianjian Sun
Yuang Peng
Chunrui Han
Xiangyu Zhang
VLM
52
44
0
03 Sep 2024
Platypus: A Generalized Specialist Model for Reading Text in Various
  Forms
Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Peng Wang
Zhaohai Li
Jun Tang
Humen Zhong
Fei Huang
Zhibo Yang
Cong Yao
VLM
ObjD
40
2
0
27 Aug 2024
Visual Text Generation in the Wild
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
55
10
0
19 Jul 2024
Detecting Omissions in Geographic Maps through Computer Vision
Detecting Omissions in Geographic Maps through Computer Vision
Phuc D. A. Nguyen
Anh Do
Minh Hoai
35
0
0
15 Jul 2024
Artistic-style text detector and a new Movie-Poster dataset
Artistic-style text detector and a new Movie-Poster dataset
Aoxiang Ning
Yiting Wei
Minglong Xue
Senming Zhong
36
0
0
24 Jun 2024
SegHist: A General Segmentation-based Framework for Chinese Historical
  Document Text Line Detection
SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection
Xingjian Hu
Baole Wei
Liangcai Gao
Jun Wang
41
0
0
17 Jun 2024
ArMeme: Propagandistic Content in Arabic Memes
ArMeme: Propagandistic Content in Arabic Memes
Firoj Alam
A. Hasnat
Fatema Ahmed
Md. Arid Hasan
Maram Hasanain
56
7
0
06 Jun 2024
Towards Unified Multi-granularity Text Detection with Interactive
  Attention
Towards Unified Multi-granularity Text Detection with Interactive Attention
Xingyu Wan
Chengquan Zhang
Pengyuan Lyu
Sen Fan
Zihan Ni
Kun Yao
Errui Ding
Jingdong Wang
65
1
0
30 May 2024
LOGO: Video Text Spotting with Language Collaboration and Glyph
  Perception Model
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model
Hongen Liu
Di Sun
Jiahao Wang
Yi Liu
Gang Pan
48
0
0
29 May 2024
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout
  Analysis
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Tianci Bi
Xiaoyi Zhang
Zhizheng Zhang
Wenxuan Xie
Cuiling Lan
Yan Lu
Nanning Zheng
VLM
55
1
0
13 May 2024
VimTS: A Unified Video and Image Text Spotter for Enhancing the
  Cross-domain Generalization
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Yuliang Liu
Mingxin Huang
Hao Yan
Linger Deng
Weijia Wu
Hao Lu
Chunhua Shen
Lianwen Jin
Xiang Bai
40
0
0
30 Apr 2024
Seeing Text in the Dark: Algorithm and Benchmark
Seeing Text in the Dark: Algorithm and Benchmark
Chengpei Xu
Hao Fu
Long Ma
Wenjing Jia
Chengqi Zhang
Feng Xia
Xiaoyu Ai
Binghao Li
Wenjie Zhang
40
13
0
13 Apr 2024
LayoutLLM: Layout Instruction Tuning with Large Language Models for
  Document Understanding
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
Chuwei Luo
Yufan Shen
Zhaoqing Zhu
Qi Zheng
Zhi Yu
Cong Yao
37
39
0
08 Apr 2024
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Mingxin Huang
Hongliang Li
Yuliang Liu
Xiang Bai
Lianwen Jin
35
3
0
06 Apr 2024
Ensemble Learning for Vietnamese Scene Text Spotting in Urban
  Environments
Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments
Hieu Nguyen
Cong-Hoang Ta
Phuong-Thuy Le-Nguyen
Minh-Triet Tran
Trung-Truc Huynh-Le
34
0
0
01 Apr 2024
OmniParser: A Unified Framework for Text Spotting, Key Information
  Extraction and Table Recognition
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
Jianqiang Wan
Sibo Song
Wenwen Yu
Yuliang Liu
Wenqing Cheng
Fei Huang
Xiang Bai
Cong Yao
Zhibo Yang
51
28
0
28 Mar 2024
Self-supervised co-salient object detection via feature correspondence
  at multiple scales
Self-supervised co-salient object detection via feature correspondence at multiple scales
Souradeep Chakraborty
Dimitris Samaras
40
2
0
17 Mar 2024
Brand Visibility in Packaging: A Deep Learning Approach for Logo
  Detection, Saliency-Map Prediction, and Logo Placement Analysis
Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis
Alireza Hosseini
Kiana Hooshanfar
Pouria Omrani
Reza Toosi
Ramin Toosi
Zahra Ebrahimian
M. Akhaee
41
4
0
04 Mar 2024
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text
  Detection and Spotting
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Chen Duan
Pei Fu
Shan Guo
Qianyi Jiang
Xiaoming Wei
VLM
53
5
0
01 Mar 2024
CPN: Complementary Proposal Network for Unconstrained Text Detection
CPN: Complementary Proposal Network for Unconstrained Text Detection
Longhuang Wu
Shangxuan Tian
Youxin Wang
Pengfei Xiong
37
0
0
18 Feb 2024
EK-Net:Real-time Scene Text Detection with Expand Kernel Distance
EK-Net:Real-time Scene Text Detection with Expand Kernel Distance
Boyuan Zhu
Fagui Liu
Xi Chen
Quan Tang
27
1
0
22 Jan 2024
Text Region Multiple Information Perception Network for Scene Text
  Detection
Text Region Multiple Information Perception Network for Scene Text Detection
Jinzhi Zheng
Libo Zhang
Yanjun Wu
Chen Zhao
36
0
0
18 Jan 2024
BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text
  Detection
BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text Detection
Jinzhi Zheng
Libo Zhang
Yanjun Wu
Chen Zhao
35
1
0
18 Jan 2024
Watermark Text Pattern Spotting in Document Images
Watermark Text Pattern Spotting in Document Images
Mateusz Krubiński
Stefan Matcovici
Diana Grigore
Daniel Voinea
A. Popa
WaLM
17
2
0
10 Jan 2024
Research on Multilingual Natural Scene Text Detection Algorithm
Tao Wang
35
0
0
18 Dec 2023
Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
Tongkun Guan
Wei Shen
Xuehang Yang
Xuehui Wang
Xiaokang Yang
47
7
0
08 Dec 2023
DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and
  Small Text
DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and Small Text
Weijia Wu
Yiming Zhang
Yefei He
Luoming Zhang
Zhenyu Lou
Hong Zhou
Xiang Bai
48
5
0
29 Nov 2023
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Shangbang Long
Siyang Qin
Yasuhisa Fujii
Alessandro Bissacco
Michalis Raptis
28
5
0
25 Oct 2023
SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus
SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus
Haoxu Wang
Fan Yu
Xian Shi
Yuezhang Wang
Shiliang Zhang
Ming Li
37
11
0
11 Sep 2023
Selective Scene Text Removal
Selective Scene Text Removal
Hayato Mitani
Akisato Kimura
Seiichi Uchida
29
1
0
01 Sep 2023
PBFormer: Capturing Complex Scene Text Shape with Polynomial Band
  Transformer
PBFormer: Capturing Complex Scene Text Shape with Polynomial Band Transformer
Ruijin Liu
Ning Lu
Dapeng Chen
Cheng Li
Zejian Yuan
Wei Peng
34
2
0
29 Aug 2023
MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild
MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild
Yu Zeng
J. Hsieh
Xuzhao Li
Ming-Ching Chang
37
8
0
23 Aug 2023
Turning a CLIP Model into a Scene Text Spotter
Turning a CLIP Model into a Scene Text Spotter
Wenwen Yu
Yuliang Liu
Xingkui Zhu
H. Cao
Xing Sun
Xiang Bai
VLM
CLIP
24
12
0
21 Aug 2023
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy
  in Transformer
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
Mingxin Huang
Jiaxin Zhang
Dezhi Peng
Hao Lu
Can Huang
Yuliang Liu
Xiang Bai
Lianwen Jin
38
27
0
20 Aug 2023
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance
  Representation Learning
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Xugong Qin
Pengyuan Lyu
Chengquan Zhang
Yu Zhou
Kun Yao
Peng Zhang
Hailun Lin
Weiping Wang
47
12
0
14 Aug 2023
Universal Defensive Underpainting Patch: Making Your Text Invisible to
  Optical Character Recognition
Universal Defensive Underpainting Patch: Making Your Text Invisible to Optical Character Recognition
Jiacheng Deng
Li Dong
Jiahao Chen
Diqun Yan
Rangding Wang
Dengpan Ye
Lingchen Zhao
Jinyu Tian
27
1
0
04 Aug 2023
CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer
CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer
Zhiwen Shao
Yuchen Su
Yong Zhou
Fanrong Meng
Hancheng Zhu
Bing-Quan Liu
Rui Yao
19
9
0
25 Jul 2023
12
Next