Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.13773
Cited By
OpenDataLab: Empowering General Artificial Intelligence with Open Datasets
4 June 2024
Conghui He
Wei Li
Zhenjiang Jin
Chao Xu
Bin Wang
Dahua Lin
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenDataLab: Empowering General Artificial Intelligence with Open Datasets"
9 / 9 papers shown
Title
Automatic Task Detection and Heterogeneous LLM Speculative Decoding
Danying Ge
Jianhua Gao
Qizhi Jiang
Yifei Feng
Weixing Ji
39
0
0
13 May 2025
Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR
Yuyao Zhang
Tianyi Liang
Xinyue Huang
Erfei Cui
Xu Guo
Pei Chu
Chenhui Li
Ru Zhang
Wenhai Wang
Gongshen Liu
129
0
0
15 Apr 2025
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
Yuzhe Gu
Ziwei Ji
Wenwei Zhang
Chengqi Lyu
Dahua Lin
Kai Chen
HILM
39
5
0
05 Jul 2024
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
Bin Wang
Zhuangcheng Gu
Chaochao Xu
Bo-Wen Zhang
Botian Shi
Conghui He
OffRL
41
9
0
23 Apr 2024
Parrot Captions Teach CLIP to Spot Text
Yiqi Lin
Conghui He
Alex Jinpeng Wang
Bin Wang
Weijia Li
Mike Zheng Shou
36
7
0
21 Dec 2023
MiChao-HuaFen 1.0: A Specialized Pre-trained Corpus Dataset for Domain-specific Large Models
Yidong Liu
Fu-De Shang
Fang Wang
Rui Xu
Jun Wang
Wei Li
Yaoxin Li
Conghui He
AILaw
AI4TS
20
1
0
21 Sep 2023
WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large Models
Conghui He
Zhenjiang Jin
Chaoxi Xu
Jiantao Qiu
Bin Wang
Wei Li
Hang Yan
Jiaqi Wang
Da Lin
65
34
0
21 Aug 2023
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Yi Wang
Yinan He
Yizhuo Li
Kunchang Li
Jiashuo Yu
...
Ping Luo
Ziwei Liu
Yali Wang
Limin Wang
Yu Qiao
VLM
VGen
33
244
0
13 Jul 2023
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
293
1,084
0
17 Feb 2021
1