Sub-Character Tokenization for Chinese Pretrained Language Models

Sub-Character Tokenization for Chinese Pretrained Language Models

Chenglei Si
Zhengyan Zhang
Yingfa Chen
Fanchao Qi
Xiaozhi Wang
Zhiyuan Liu
Yasheng Wang
Qun Liu
Maosong Sun

Papers citing "Sub-Character Tokenization for Chinese Pretrained Language Models"

1 / 1 papers shown
Title
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input
  Noises
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Chenglei Si
Zhengyan Zhang
Yingfa Chen
Xiaozhi Wang
Zhiyuan Liu
Maosong Sun
26
1
0
14 Feb 2023