ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.12142
  4. Cited By
Pretraining without Wordpieces: Learning Over a Vocabulary of Millions
  of Words

Pretraining without Wordpieces: Learning Over a Vocabulary of Millions of Words

24 February 2022
Zhangyin Feng
Duyu Tang
Cong Zhou
Junwei Liao
Shuangzhi Wu
Xiaocheng Feng
Bing Qin
Yunbo Cao
Shuming Shi
    VLM
ArXivPDFHTML

Papers citing "Pretraining without Wordpieces: Learning Over a Vocabulary of Millions of Words"

8 / 8 papers shown
Title
Towards understanding evolution of science through language model series
Towards understanding evolution of science through language model series
Junjie Dong
Zhuoqi Lyu
Qing Ke
AI4TS
35
0
0
15 Sep 2024
Explicit Morphological Knowledge Improves Pre-training of Language
  Models for Hebrew
Explicit Morphological Knowledge Improves Pre-training of Language Models for Hebrew
Eylon Gueta
Omer Goldman
Reut Tsarfaty
11
1
0
01 Nov 2023
Biomedical Language Models are Robust to Sub-optimal Tokenization
Biomedical Language Models are Robust to Sub-optimal Tokenization
Bernal Jiménez Gutiérrez
Huan Sun
Yu-Chuan Su
22
6
0
30 Jun 2023
ESIE-BERT: Enriching Sub-words Information Explicitly with BERT for
  Joint Intent Classification and SlotFilling
ESIE-BERT: Enriching Sub-words Information Explicitly with BERT for Joint Intent Classification and SlotFilling
Yutian Guo
Zhilong Xie
Xingyan Chen
Huangen Chen
Leilei Wang
Huaming Du
Shaopeng Wei
Yu Zhao
Qing Li
Ganglu Wu
14
7
0
27 Nov 2022
Word-Level Representation From Bytes For Language Modeling
Word-Level Representation From Bytes For Language Modeling
Chul Lee
Qipeng Guo
Xipeng Qiu
15
1
0
23 Nov 2022
Topic-Grained Text Representation-based Model for Document Retrieval
Topic-Grained Text Representation-based Model for Document Retrieval
Mengxue Du
Shasha Li
Jie Yu
Jun Ma
Bing Ji
Huijun Liu
Wuhang Lin
Zibo Yi
19
2
0
11 Jul 2022
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,746
0
26 Sep 2016
Efficient Estimation of Word Representations in Vector Space
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
266
31,267
0
16 Jan 2013
1