Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.03155
Cited By
EAST: An Efficient and Accurate Scene Text Detector
11 April 2017
Xinyu Zhou
Cong Yao
He Wen
Yuzhi Wang
Shuchang Zhou
Weiran He
Jiajun Liang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EAST: An Efficient and Accurate Scene Text Detector"
50 / 405 papers shown
Title
OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery
Chongsheng Zhang
Shuwen Wu
Yingqi Chen
Matthias Aßenmacher
Christian Heumann
Yi Men
Gaojuan Fan
Joao Gama
22
0
0
04 May 2025
GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling
Siqi Li
Yufan Shen
Xiangnan Chen
Jiayi Chen
Hengwei Ju
...
Licheng Wen
Botian Shi
Y. Liu
Xinyu Cai
Yu Qiao
VLM
ELM
91
0
0
30 Apr 2025
XY-Cut++: Advanced Layout Ordering via Hierarchical Mask Mechanism on a Novel Benchmark
Shuai Liu
Youmeng Li
Jizeng Wei
33
0
0
14 Apr 2025
Edge Approximation Text Detector
Chuang Yang
Xu Han
T. Han
Han Han
Bingxuan Zhao
Qi Wang
43
0
0
05 Apr 2025
VISTA-OCR: Towards generative and interactive end to end OCR models
Laziz Hamdi
Amine Tamasna
Pascal Boisson
Thierry Paquet
44
0
0
04 Apr 2025
Optimizing Multi-DNN Inference on Mobile Devices through Heterogeneous Processor Co-Execution
Yunquan Gao
Zhiguo Zhang
Praveen Kumar Donta
C. Dehury
Xinbing Wang
Dusit Niyato
Qiyang Zhang
41
0
0
27 Mar 2025
Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection
Yi Yu
Xue Yang
Yansheng Li
Zhenjun Han
Feipeng Da
Junchi Yan
71
0
0
13 Feb 2025
PLATTER: A Page-Level Handwritten Text Recognition System for Indic Scripts
Badri Vishal Kasuba
Dhruv Kudale
Venkatapathy Subramanian
P. Chaudhuri
Ganesh Ramakrishnan
48
0
0
10 Feb 2025
Invizo: Arabic Handwritten Document Optical Character Recognition Solution
Alhossien Waly
Bassant Tarek
Ali Feteha
Rewan Yehia
Gasser Amr
Walid Gomaa
Ahmed M. Fares
61
0
0
07 Feb 2025
Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances
Yi Yu
Botao Ren
Peiyuan Zhang
Mingxin Liu
Junwei Luo
Shaofeng Zhang
Feipeng Da
Junchi Yan
Xue Yang
3DPC
125
1
0
06 Feb 2025
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Jiawei Liu
Yuanzhi Zhu
Feiyu Gao
Z. Yang
P. Wang
Junyang Lin
Xinyu Wang
Wenyu Liu
DiffM
45
0
0
08 Jan 2025
Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
40
0
0
05 Nov 2024
Context-Based Visual-Language Place Recognition
Soojin Woo
Seong-Woo Kim
24
0
0
25 Oct 2024
Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant
A. S. Penamakuri
Anand Mishra
26
1
0
24 Oct 2024
Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
25
0
0
25 Sep 2024
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
34
2
0
25 Sep 2024
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Haoran Wei
Chenglong Liu
Jinyue Chen
Jia Wang
Lingyu Kong
...
Liang Zhao
Jianjian Sun
Yuang Peng
Chunrui Han
Xiangyu Zhang
VLM
46
41
0
03 Sep 2024
Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Peng Wang
Zhaohai Li
Jun Tang
Humen Zhong
Fei Huang
Zhibo Yang
Cong Yao
VLM
ObjD
40
2
0
27 Aug 2024
A Training-Free Framework for Video License Plate Tracking and Recognition with Only One-Shot
Haoxuan Ding
Qi. Wang
Junyu Gao
Qiang Li
VLM
37
0
0
11 Aug 2024
EAFormer: Scene Text Segmentation with Edge-Aware Transformers
Haiyang Yu
Teng Fu
Bin Li
Xiangyang Xue
20
2
0
24 Jul 2024
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
50
10
0
19 Jul 2024
Emerging Practices for Large Multimodal Model (LMM) Assistance for People with Visual Impairments: Implications for Design
Jingyi Xie
Rui Yu
He Zhang
Sooyeon Lee
Syed Masum Billah
John M. Carroll
40
10
0
11 Jul 2024
Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports
Yanfu Yan
Nathan Cooper
Oscar Chaparro
Kevin Moran
Denys Poshyvanyk
43
5
0
11 Jul 2024
Artistic-style text detector and a new Movie-Poster dataset
Aoxiang Ning
Yiting Wei
Minglong Xue
Senming Zhong
36
0
0
24 Jun 2024
AnyTrans: Translate AnyText in the Image with Large Scale Models
Zhipeng Qian
Pei Zhang
Baosong Yang
Kai Fan
Yiwei Ma
Derek F. Wong
Xiaoshuai Sun
Rongrong Ji
VLM
40
1
0
17 Jun 2024
SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection
Xingjian Hu
Baole Wei
Liangcai Gao
Jun Wang
33
0
0
17 Jun 2024
BD-SAT: High-resolution Land Use Land Cover Dataset & Benchmark Results for Developing Division: Dhaka, BD
Ovi Paul
Abu Bakar Siddik Nayem
Anis Sarker
Amin Ahsan Ali
M Ashraful Amin
AKM Mahbubur Rahman
27
0
0
09 Jun 2024
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Qilong Zhangli
Jindong Jiang
Di Liu
Licheng Yu
Xiaoliang Dai
Ankit Ramchandani
Guan Pang
Dimitris N. Metaxas
Praveen Krishnan
DiffM
45
8
0
03 Jun 2024
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model
Hongen Liu
Di Sun
Jiahao Wang
Yi Liu
Gang Pan
48
0
0
29 May 2024
A better approach to diagnose retinal diseases: Combining our Segmentation-based Vascular Enhancement with deep learning features
Yuzhuo Chen
Zetong Chen
Yuanyuan Liu
31
0
0
25 May 2024
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Yuliang Liu
Mingxin Huang
Hao Yan
Linger Deng
Weijia Wu
Hao Lu
Chunhua Shen
Lianwen Jin
Xiang Bai
37
0
0
30 Apr 2024
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
Chuwei Luo
Yufan Shen
Zhaoqing Zhu
Qi Zheng
Zhi Yu
Cong Yao
31
38
0
08 Apr 2024
AURORA: Navigating UI Tarpits via Automated Neural Screen Understanding
Safwat Ali Khan
Wenyu Wang
Yiran Ren
Bin Zhu
Jiangfan Shi
Alyssa McGowan
Wing Lam
Kevin Moran
37
1
0
01 Apr 2024
Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments
Hieu Nguyen
Cong-Hoang Ta
Phuong-Thuy Le-Nguyen
Minh-Triet Tran
Trung-Truc Huynh-Le
34
0
0
01 Apr 2024
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
Jianqiang Wan
Sibo Song
Wenwen Yu
Yuliang Liu
Wenqing Cheng
Fei Huang
Xiang Bai
Cong Yao
Zhibo Yang
51
26
0
28 Mar 2024
Visually Guided Generative Text-Layout Pre-training for Document Intelligence
Zhiming Mao
Haoli Bai
Lu Hou
Jiansheng Wei
Xin Jiang
Qun Liu
Kam-Fai Wong
32
8
0
25 Mar 2024
LOCR: Location-Guided Transformer for Optical Character Recognition
Yu Sun
Dongzhan Zhou
Chen Lin
Conghui He
Wanli Ouyang
Han-Sen Zhong
40
1
0
04 Mar 2024
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Chen Duan
Pei Fu
Shan Guo
Qianyi Jiang
Xiaoming Wei
VLM
46
5
0
01 Mar 2024
Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing
Jacob Tyo
Motolani Olarinre
Youngseog Chung
Zachary Chase Lipton
38
0
0
12 Feb 2024
Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing
Yan Shu
Weichao Zeng
Zhenhang Li
Fangmin Zhao
Yu Zhou
32
3
0
05 Feb 2024
Text Region Multiple Information Perception Network for Scene Text Detection
Jinzhi Zheng
Libo Zhang
Yanjun Wu
Chen Zhao
31
0
0
18 Jan 2024
Dynamic Relation Transformer for Contextual Text Block Detection
Jiawei Wang
Shunchi Zhang
Kai Hu
Chixiang Ma
Zhuoyao Zhong
Lei-huan Sun
Qiang Huo
32
0
0
17 Jan 2024
SamLP: A Customized Segment Anything Model for License Plate Detection
Haoxuan Ding
Junyuan Gao
Yuan. Yuan
Qi. Wang
MLLM
VLM
34
7
0
12 Jan 2024
GloTSFormer: Global Video Text Spotting Transformer
Hang Wang
Yanjie Wang
Yang Li
Can Huang
37
0
0
08 Jan 2024
CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models
Yaojia Lv
Haojie Pan
Ruiji Fu
Ming Liu
Zhongyuan Wang
Bing Qin
30
5
0
06 Jan 2024
Word length-aware text spotting: Enhancing detection and recognition in dense text image
Hao Wang
Huabing Zhou
Yanduo Zhang
Tao Lu
Jiayi Ma
35
1
0
25 Dec 2023
Progressive Evolution from Single-Point to Polygon for Scene Text
Linger Deng
Mingxin Huang
Xudong Xie
Yuliang Liu
Lianwen Jin
Xiang Bai
31
1
0
21 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
62
1
0
19 Dec 2023
Research on Multilingual Natural Scene Text Detection Algorithm
Tao Wang
29
0
0
18 Dec 2023
Edge Wasserstein Distance Loss for Oriented Object Detection
Yuke Zhu
Yumeng Ruan
Zihua Xiong
Sheng Guo
35
0
0
12 Dec 2023
1
2
3
4
5
6
7
8
9
Next