Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.08719
Cited By
v1
v2 (latest)
M
6
^{6}
6
Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis
15 May 2023
Hiuyi Cheng
Pei-yu Zhang
Sihang Wu
Jiaxin Zhang
Qi Zhu
Zecheng Xie
Jing Li
Kai Ding
Lianwen Jin
Re-assign community
ArXiv (abs)
PDF
HTML
Github (126★)
Papers citing
"M$^{6}$Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis"
23 / 23 papers shown
Title
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
Jiawei Wang
Kai Hu
Qiang Huo
101
0
0
20 Mar 2025
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Linke Ouyang
Yuan Qu
Hongbin Zhou
Jiawei Zhu
Rui Zhang
...
Chao Xu
Bo Zhang
Botian Shi
Zhongying Tu
Zeang Sheng
151
11
0
10 Dec 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin
Xinyu Wei
Ruichuan An
Peng Gao
Bocheng Zou
Yulin Luo
Siyuan Huang
Shanghang Zhang
Hongsheng Li
VLM
147
46
0
29 Mar 2024
Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild
Jiaxin Zhang
Canjie Luo
Lianwen Jin
Fengjun Guo
Kai Ding
85
24
0
23 Jul 2022
VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations
Peng Zhang
Can Li
Liang Qiao
Zhanzhan Cheng
Shiliang Pu
Yi Niu
Leilei Gan
57
59
0
13 May 2021
Instances as Queries
Yuxin Fang
Shusheng Yang
Xinggang Wang
Yu Li
Chen Fang
Ying Shan
Bin Feng
Wenyu Liu
ISeg
74
260
0
05 May 2021
ISTR: End-to-End Instance Segmentation with Transformers
Jie Hu
Liujuan Cao
Yao Lu
Shengchuan Zhang
Yan Wang
Ke Li
Feiyue Huang
Ling Shao
Rongrong Ji
ISeg
64
95
0
03 May 2021
SCNet: Training Inference Sample Consistency for Instance Segmentation
Thang Vu
Haeyong Kang
Chang D. Yoo
ISeg
131
94
0
18 Dec 2020
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
246
5,098
0
08 Oct 2020
Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection
Xiang Li
Wenhai Wang
Lijun Wu
Shuo Chen
Xiaolin Hu
Jun Li
Jinhui Tang
Jian Yang
ObjD
79
1,202
0
08 Jun 2020
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents
D. Prasad
Ayan Gadpal
Kshitij Kapadni
Manish Visave
Kavita A. Sultanpure
LMTD
55
169
0
27 Apr 2020
Cross-Domain Document Object Detection: Benchmark Suite and Method
Keqin Li
Curtis Wigington
Chris Tensmeyer
Handong Zhao
Nikolaos Barmpalios
Vlad I. Morariu
Varun Manjunatha
Tong Sun
Y. Fu
42
45
0
30 Mar 2020
SOLO: Segmenting Objects by Locations
Xinlong Wang
Tao Kong
Chunhua Shen
Yuning Jiang
Lei Li
SSeg
ISeg
73
678
0
10 Dec 2019
PubLayNet: largest dataset ever for document layout analysis
Xu Zhong
Jianbin Tang
Antonio Jimeno Yepes
52
461
0
16 Aug 2019
FCOS: Fully Convolutional One-Stage Object Detection
Zhi Tian
Chunhua Shen
Hao Chen
Tong He
ObjD
143
5,017
0
02 Apr 2019
Hybrid Task Cascade for Instance Segmentation
Kai-xiang Chen
Jiangmiao Pang
Jiaqi Wang
Yu Xiong
Xiaoxiao Li
...
Ziwei Liu
Jianping Shi
Wanli Ouyang
Chen Change Loy
Dahua Lin
ISeg
140
1,307
0
22 Jan 2019
dhSegment: A generic deep-learning approach for document segmentation
S. Oliveira
Benoit Seguin
F. Kaplan
SSeg
54
172
0
27 Apr 2018
YOLOv3: An Incremental Improvement
Joseph Redmon
Ali Farhadi
ObjD
130
21,482
0
08 Apr 2018
Cascade R-CNN: Delving into High Quality Object Detection
Zhaowei Cai
Nuno Vasconcelos
ObjD
147
4,941
0
03 Dec 2017
Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Network
Xiao Yang
Ersin Yumer
P. Asente
Mike Kraley
Daniel Kifer
C. Lee Giles
70
230
0
07 Jun 2017
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
366
27,244
0
20 Mar 2017
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
528
62,377
0
04 Jun 2015
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
432
43,814
0
01 May 2014
1