Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.10626
Cited By
Dynamic data sampler for cross-language transfer learning in large language models
17 May 2024
Yudong Li
Yuhao Feng
Wen Zhou
Zhe Zhao
Linlin Shen
Cheng-An Hou
Xianxu Hou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dynamic data sampler for cross-language transfer learning in large language models"
4 / 4 papers shown
Title
CMMLU: Measuring massive multitask language understanding in Chinese
Haonan Li
Yixuan Zhang
Fajri Koto
Yifei Yang
Hai Zhao
Yeyun Gong
Nan Duan
Tim Baldwin
ALM
ELM
64
253
0
15 Jun 2023
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
Zhe Zhao
Yudong Li
Cheng-An Hou
Jing-xin Zhao
Rong Tian
...
Xingwu Sun
Zhanhui Kang
Xiaoyong Du
Linlin Shen
Kimmo Yan
VLM
62
24
0
13 Dec 2022
CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language Model
Liang Xu
Xuanwei Zhang
Qianqian Dong
SSL
33
70
0
03 Mar 2020
WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia
Holger Schwenk
Vishrav Chaudhary
Shuo Sun
Hongyu Gong
Francisco Guzmán
CVBM
85
404
0
10 Jul 2019
1