ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.06226
  4. Cited By
SentencePiece: A simple and language independent subword tokenizer and
  detokenizer for Neural Text Processing

SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018
Taku Kudo
John Richardson
ArXiv (abs)PDFHTMLGithub (10925★)

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

50 / 1,950 papers shown
Title
Assessing Evaluation Metrics for Speech-to-Speech Translation
Assessing Evaluation Metrics for Speech-to-Speech Translation
Elizabeth Salesky
Julian Mäder
Severin Klinger
74
15
0
26 Oct 2021
Improving Non-autoregressive Generation with Mixup Training
Improving Non-autoregressive Generation with Mixup Training
Ting Jiang
Shaohan Huang
Zihan Zhang
Deqing Wang
Fuzhen Zhuang
Furu Wei
Haizhen Huang
Liangjie Zhang
Qi Zhang
38
8
0
21 Oct 2021
Multilingual Unsupervised Neural Machine Translation with Denoising
  Adapters
Multilingual Unsupervised Neural Machine Translation with Denoising Adapters
Ahmet Üstün
Alexandre Berard
Laurent Besacier
Matthias Gallé
84
47
0
20 Oct 2021
Knowledge distillation from language model to acoustic model: a
  hierarchical multi-task learning approach
Knowledge distillation from language model to acoustic model: a hierarchical multi-task learning approach
Mun-Hak Lee
Joon‐Hyuk Chang
27
2
0
20 Oct 2021
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text
  Joint Pre-Training
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Ankur Bapna
Yu-An Chung
Na Wu
Anmol Gulati
Ye Jia
J. Clark
Melvin Johnson
Jason Riesa
Alexis Conneau
Yu Zhang
VLM
137
96
0
20 Oct 2021
Ensemble ALBERT on SQuAD 2.0
Ensemble ALBERT on SQuAD 2.0
Shilun Li
Renee Li
Veronica Peng
MoE
27
6
0
19 Oct 2021
Monotonic Simultaneous Translation with Chunk-wise Reordering and
  Refinement
Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement
HyoJung Han
Seokchan Ahn
Yoonjung Choi
Insoo Chung
Sangha Kim
Kyunghyun Cho
63
6
0
18 Oct 2021
EmbRace: Accelerating Sparse Communication for Distributed Training of
  NLP Neural Networks
EmbRace: Accelerating Sparse Communication for Distributed Training of NLP Neural Networks
Shengwei Li
Zhiquan Lai
Dongsheng Li
Yiming Zhang
Xiangyu Ye
Yabo Duan
FedML
61
3
0
18 Oct 2021
Multilingual unsupervised sequence segmentation transfers to extremely
  low-resource languages
Multilingual unsupervised sequence segmentation transfers to extremely low-resource languages
C.M. Downey
Shannon Drizin
Levon Haroutunian
Shivin Thukral
48
2
0
16 Oct 2021
From Start to Finish: Latency Reduction Strategies for Incremental
  Speech Synthesis in Simultaneous Speech-to-Speech Translation
From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation
Danni Liu
Changhan Wang
Hongyu Gong
Xutai Ma
Yun Tang
J. Pino
98
4
0
15 Oct 2021
Why don't people use character-level machine translation?
Why don't people use character-level machine translation?
Jindrich Libovický
Helmut Schmid
Alexander Fraser
135
29
0
15 Oct 2021
Unifying Cross-lingual Summarization and Machine Translation with
  Compression Rate
Unifying Cross-lingual Summarization and Machine Translation with Compression Rate
Yu Bai
Heyan Huang
Kai Fan
Yang Gao
Yi-Bo Zhu
Jiaao Zhan
Zewen Chi
Boxing Chen
49
9
0
15 Oct 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
422
1,115
0
13 Oct 2021
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation
  with Multi-Armed Bandits
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits
Julia Kreutzer
David Vilar
Artem Sokolov
99
15
0
13 Oct 2021
Automated Essay Scoring Using Transformer Models
Automated Essay Scoring Using Transformer Models
Sabrina Ludwig
Christian W. F. Mayer
Christopher Hansen
Kerstin Eilers
Steffen Brandt
85
40
0
13 Oct 2021
Interactive Feature Fusion for End-to-End Noise-Robust Speech
  Recognition
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition
Yuchen Hu
Nana Hou
Chen Chen
Chng Eng Siong
75
41
0
11 Oct 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Benyou Wang
Qianqian Xie
Jiahuan Pei
Zhihong Chen
Prayag Tiwari
Zhao Li
Jie Fu
LM&MAAI4CE
154
172
0
11 Oct 2021
Towards Learning (Dis)-Similarity of Source Code from Program Contrasts
Towards Learning (Dis)-Similarity of Source Code from Program Contrasts
Yangruibo Ding
Luca Buratti
Saurabh Pujar
Alessandro Morari
Baishakhi Ray
Saikat Chakraborty
82
36
0
08 Oct 2021
Machine Translation Verbosity Control for Automatic Dubbing
Machine Translation Verbosity Control for Automatic Dubbing
Surafel Melaku Lakew
Marcello Federico
Yue Wang
Cuong Hoang
Yogesh Virkar
Roberto Barra-Chicote
Robert Enyedi
61
24
0
08 Oct 2021
Streaming Transformer Transducer Based Speech Recognition Using
  Non-Causal Convolution
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Yangyang Shi
Chunyang Wu
Dilin Wang
Alex Xiao
Jay Mahadeokar
...
Ke Li
Yuan Shangguan
Varun K. Nagaraja
Ozlem Kalinli
M. Seltzer
143
15
0
07 Oct 2021
A Comparative Study of Transformer-Based Language Models on Extractive
  Question Answering
A Comparative Study of Transformer-Based Language Models on Extractive Question Answering
Kate Pearce
Tiffany Zhan
Aneesh Komanduri
J. Zhan
ELM
87
34
0
07 Oct 2021
How BPE Affects Memorization in Transformers
How BPE Affects Memorization in Transformers
Eugene Kharitonov
Marco Baroni
Dieuwke Hupkes
247
33
0
06 Oct 2021
Word Acquisition in Neural Language Models
Word Acquisition in Neural Language Models
Tyler A. Chang
Benjamin Bergen
85
40
0
05 Oct 2021
Swiss-Judgment-Prediction: A Multilingual Legal Judgment Prediction
  Benchmark
Swiss-Judgment-Prediction: A Multilingual Legal Judgment Prediction Benchmark
Joel Niklaus
Ilias Chalkidis
Matthias Sturmer
ELMAILaw
67
70
0
02 Oct 2021
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing
  Language Models
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models
Robert Wolfe
Aylin Caliskan
125
51
0
01 Oct 2021
FastCorrect 2: Fast Error Correction on Multiple Candidates for
  Automatic Speech Recognition
FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition
Yichong Leng
Xu Tan
Rui Wang
Linchen Zhu
Jin Xu
...
Linquan Liu
Tao Qin
Xiang-Yang Li
Ed Lin
Tie-Yan Liu
129
42
0
29 Sep 2021
EdinSaar@WMT21: North-Germanic Low-Resource Multilingual NMT
EdinSaar@WMT21: North-Germanic Low-Resource Multilingual NMT
Svetlana Tchistiakova
Jesujoba Oluwadara Alabi
Koel Dutta Chowdhury
Sourav Dutta
Dana Ruiter
VLM
57
6
0
29 Sep 2021
Improving Arabic Diacritization by Learning to Diacritize and Translate
Improving Arabic Diacritization by Learning to Diacritize and Translate
Brian Thompson
A. Alshehri
67
10
0
29 Sep 2021
PicTalky: Augmentative and Alternative Communication Software for
  Language Developmental Disabilities
PicTalky: Augmentative and Alternative Communication Software for Language Developmental Disabilities
Chanjun Park
Yoonna Jang
Seolhwa Lee
Jaehyung Seo
Kisu Yang
Heuiseok Lim
23
6
0
27 Sep 2021
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with
  Non-Autoregressive Hidden Intermediates
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates
Hirofumi Inaguma
Siddharth Dalmia
Brian Yan
Shinji Watanabe
99
11
0
27 Sep 2021
Beyond Distillation: Task-level Mixture-of-Experts for Efficient
  Inference
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
Sneha Kudugunta
Yanping Huang
Ankur Bapna
M. Krikun
Dmitry Lepikhin
Minh-Thang Luong
Orhan Firat
MoE
257
112
0
24 Sep 2021
Cross-Lingual Language Model Meta-Pretraining
Cross-Lingual Language Model Meta-Pretraining
Zewen Chi
Heyan Huang
Luyang Liu
Yu Bai
Xian-Ling Mao
LRM
210
0
0
23 Sep 2021
Multilingual Document-Level Translation Enables Zero-Shot Transfer From
  Sentences to Documents
Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents
Biao Zhang
Ankur Bapna
Melvin Johnson
A. Dabirmoghaddam
N. Arivazhagan
Orhan Firat
78
14
0
21 Sep 2021
TrOCR: Transformer-based Optical Character Recognition with Pre-trained
  Models
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
246
375
0
21 Sep 2021
BERTweetFR : Domain Adaptation of Pre-Trained Language Models for French
  Tweets
BERTweetFR : Domain Adaptation of Pre-Trained Language Models for French Tweets
Yanzhu Guo
Virgile Rennard
Christos Xypolopoulos
Michalis Vazirgiannis
VLMAI4CE
89
19
0
21 Sep 2021
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese
Nguyen Luong Tran
Duong Minh Le
Dat Quoc Nguyen
70
55
0
20 Sep 2021
CUNI systems for WMT21: Multilingual Low-Resource Translation for
  Indo-European Languages Shared Task
CUNI systems for WMT21: Multilingual Low-Resource Translation for Indo-European Languages Shared Task
Josef Jon
Michal Novák
João Paulo Aires
Duvsan Varivs
Ondrej Bojar
40
3
0
20 Sep 2021
CUNI systems for WMT21: Terminology translation Shared Task
CUNI systems for WMT21: Terminology translation Shared Task
Josef Jon
Michal Novák
João Paulo Aires
Duvsan Varivs
Ondrej Bojar
47
4
0
20 Sep 2021
Augmenting semantic lexicons using word embeddings and transfer learning
Augmenting semantic lexicons using word embeddings and transfer learning
Thayer Alshaabi
C. V. Oort
M. Fudolig
M. V. Arnold
C. Danforth
P. Dodds
61
4
0
18 Sep 2021
Primer: Searching for Efficient Transformers for Language Modeling
Primer: Searching for Efficient Transformers for Language Modeling
David R. So
Wojciech Mañke
Hanxiao Liu
Zihang Dai
Noam M. Shazeer
Quoc V. Le
VLM
277
156
0
17 Sep 2021
Translation Transformers Rediscover Inherent Data Domains
Translation Transformers Rediscover Inherent Data Domains
Maksym Del
Elizaveta Korotkova
Mark Fishel
43
7
0
16 Sep 2021
Challenges in Detoxifying Language Models
Challenges in Detoxifying Language Models
Johannes Welbl
Amelia Glaese
J. Uesato
Sumanth Dathathri
John F. J. Mellor
Lisa Anne Hendricks
Kirsty Anderson
Pushmeet Kohli
Ben Coppin
Po-Sen Huang
LM&MA
313
195
0
15 Sep 2021
Learning When to Translate for Streaming Speech
Learning When to Translate for Streaming Speech
Qianqian Dong
Yaoming Zhu
Mingxuan Wang
Lei Li
100
30
0
15 Sep 2021
The ELITR ECA Corpus
The ELITR ECA Corpus
Philip Williams
Barry Haddow
33
4
0
15 Sep 2021
Allocating Large Vocabulary Capacity for Cross-lingual Language Model
  Pre-training
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training
Bo Zheng
Li Dong
Shaohan Huang
Saksham Singhal
Wanxiang Che
Ting Liu
Xia Song
Furu Wei
VLM
82
22
0
15 Sep 2021
Everything Is All It Takes: A Multipronged Strategy for Zero-Shot
  Cross-Lingual Information Extraction
Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
M. Yarmohammadi
Shijie Wu
Marc Marone
Haoran Xu
Seth Ebner
...
Craig Harman
Kenton W. Murray
Aaron Steven White
Mark Dredze
Benjamin Van Durme
76
30
0
14 Sep 2021
Evaluating Transferability of BERT Models on Uralic Languages
Evaluating Transferability of BERT Models on Uralic Languages
Judit Ács
Dániel Lévai
András Kornai
61
6
0
13 Sep 2021
KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language
  Models via Knowledge Distillation
KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation
Marzieh S. Tahaei
Ella Charlaix
V. Nia
A. Ghodsi
Mehdi Rezagholizadeh
110
22
0
13 Sep 2021
Few-Shot Cross-Lingual Stance Detection with Sentiment-Based
  Pre-Training
Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training
Momchil Hardalov
Arnav Arora
Preslav Nakov
Isabelle Augenstein
88
63
0
13 Sep 2021
Show Me How To Revise: Improving Lexically Constrained Sentence
  Generation with XLNet
Show Me How To Revise: Improving Lexically Constrained Sentence Generation with XLNet
Xingwei He
Victor O.K. Li
BDL
282
24
0
13 Sep 2021
Previous
123...262728...373839
Next