Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.09808
Cited By
Knowledge of Pretrained Language Models on Surface Information of Tokens
15 February 2024
Tatsuya Hiraoka
Naoaki Okazaki
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Knowledge of Pretrained Language Models on Surface Information of Tokens"
3 / 3 papers shown
Title
Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slip Scripts
Yingfa Chen
Chenlong Hu
Cong Feng
Chenyang Song
Shi Yu
Xu Han
Zhiyuan Liu
Maosong Sun
28
0
0
02 Sep 2024
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters
Hicham El Boukkouri
Olivier Ferret
Thomas Lavergne
Hiroshi Noji
Pierre Zweigenbaum
Junichi Tsujii
71
156
0
20 Oct 2020
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
239
31,257
0
16 Jan 2013
1