ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.06226
  4. Cited By
SentencePiece: A simple and language independent subword tokenizer and
  detokenizer for Neural Text Processing

SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018
Taku Kudo
John Richardson
ArXiv (abs)PDFHTMLGithub (10925★)

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

50 / 1,950 papers shown
Title
LegoNN: Building Modular Encoder-Decoder Models
LegoNN: Building Modular Encoder-Decoder Models
Siddharth Dalmia
Dmytro Okhonko
M. Lewis
Sergey Edunov
Shinji Watanabe
Florian Metze
Luke Zettlemoyer
Abdel-rahman Mohamed
AuLLMMoE
69
14
0
07 Jun 2022
Intra-agent speech permits zero-shot task acquisition
Intra-agent speech permits zero-shot task acquisition
Chen Yan
Federico Carnevale
Petko Georgiev
Adam Santoro
Aurelia Guy
Alistair Muldal
Chia-Chun Hung
Josh Abramson
Timothy Lillicrap
Greg Wayne
LM&Ro
97
9
0
07 Jun 2022
DynaMaR: Dynamic Prompt with Mask Token Representation
DynaMaR: Dynamic Prompt with Mask Token Representation
Xiaodi Sun
Sunny Rajagopalan
Priyank Nigam
Weiyi Lu
Yi Xu
Belinda Zeng
Trishul Chilimbi
42
1
0
07 Jun 2022
Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture
  of Experts
Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts
Basil Mustafa
C. Riquelme
J. Puigcerver
Rodolphe Jenatton
N. Houlsby
VLMMoE
170
205
0
06 Jun 2022
What do tokens know about their characters and how do they know it?
What do tokens know about their characters and how do they know it?
Ayush Kaushal
Kyle Mahowald
90
31
0
06 Jun 2022
Improving Contrastive Learning of Sentence Embeddings with
  Case-Augmented Positives and Retrieved Negatives
Improving Contrastive Learning of Sentence Embeddings with Case-Augmented Positives and Retrieved Negatives
Wei Wang
Liangzhu Ge
Jingqiao Zhang
Cheng Yang
74
22
0
06 Jun 2022
Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation
Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation
Pengzhi Gao
Zhongjun He
Hua Wu
Haifeng Wang
78
14
0
06 Jun 2022
Variable-rate hierarchical CPC leads to acoustic unit discovery in
  speech
Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Santiago Cuervo
Adrian Lañcucki
R. Marxer
Paweł Rychlikowski
J. Chorowski
SSL
82
13
0
05 Jun 2022
Multilingual Neural Machine Translation with Deep Encoder and Multiple
  Shallow Decoders
Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders
Xiang Kong
Adithya Renduchintala
James Cross
Yuqing Tang
Jiatao Gu
Xian Li
91
32
0
05 Jun 2022
VL-BEiT: Generative Vision-Language Pretraining
VL-BEiT: Generative Vision-Language Pretraining
Hangbo Bao
Wenhui Wang
Li Dong
Furu Wei
VLM
84
45
0
02 Jun 2022
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Sehoon Kim
A. Gholami
Albert Eaton Shaw
Nicholas Lee
K. Mangalam
Jitendra Malik
Michael W. Mahoney
Kurt Keutzer
123
105
0
02 Jun 2022
Exploring Diversity in Back Translation for Low-Resource Machine
  Translation
Exploring Diversity in Back Translation for Low-Resource Machine Translation
Laurie Burchell
Alexandra Birch
Kenneth Heafield
89
15
0
01 Jun 2022
B2T Connection: Serving Stability and Performance in Deep Transformers
B2T Connection: Serving Stability and Performance in Deep Transformers
Sho Takase
Shun Kiyono
Sosuke Kobayashi
Jun Suzuki
104
11
0
01 Jun 2022
EMS: Efficient and Effective Massively Multilingual Sentence Embedding
  Learning
EMS: Efficient and Effective Massively Multilingual Sentence Embedding Learning
Zhuoyuan Mao
Chenhui Chu
Sadao Kurohashi
73
1
0
31 May 2022
Transformer with Tree-order Encoding for Neural Program Generation
Transformer with Tree-order Encoding for Neural Program Generation
Klaudia Thellmann
Bernhard Stadler
Ricardo Usbeck
Jens Lehmann
95
1
0
30 May 2022
Patching Leaks in the Charformer for Efficient Character-Level
  Generation
Patching Leaks in the Charformer for Efficient Character-Level Generation
Lukas Edman
Antonio Toral
Gertjan van Noord
34
1
0
27 May 2022
AutoTSG: Learning and Synthesis for Incident Troubleshooting
AutoTSG: Learning and Synthesis for Incident Troubleshooting
Manish Shetty
Chetan Bansal
Sai Pramod Upadhyayula
Arjun Radhakrishna
Anurag Gupta
88
22
0
26 May 2022
Towards Learning Universal Hyperparameter Optimizers with Transformers
Towards Learning Universal Hyperparameter Optimizers with Transformers
Yutian Chen
Xingyou Song
Chansoo Lee
Zehao Wang
Qiuyi Zhang
...
Greg Kochanski
Arnaud Doucet
MarcÁurelio Ranzato
Sagi Perel
Nando de Freitas
105
65
0
26 May 2022
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation
Tu Vu
Aditya Barua
Brian Lester
Daniel Cer
Mohit Iyyer
Noah Constant
CLL
95
66
0
25 May 2022
Sparse Mixers: Combining MoE and Mixing to build a more efficient BERT
Sparse Mixers: Combining MoE and Mixing to build a more efficient BERT
James Lee-Thorp
Joshua Ainslie
MoE
94
12
0
24 May 2022
Learning to Model Editing Processes
Learning to Model Editing Processes
Machel Reid
Graham Neubig
KELMBDL
187
36
0
24 May 2022
EdiT5: Semi-Autoregressive Text-Editing with T5 Warm-Start
EdiT5: Semi-Autoregressive Text-Editing with T5 Warm-Start
Jonathan Mallinson
Jakub Adamek
Eric Malmi
Aliaksei Severyn
KELM
145
43
0
24 May 2022
PoeLM: A Meter- and Rhyme-Controllable Language Model for Unsupervised
  Poetry Generation
PoeLM: A Meter- and Rhyme-Controllable Language Model for Unsupervised Poetry Generation
Aitor Ormazabal
Mikel Artetxe
Manex Agirrezabal
Aitor Soroa Etxabe
Eneko Agirre
67
21
0
24 May 2022
Associative Learning Mechanism for Drug-Target Interaction Prediction
Associative Learning Mechanism for Drug-Target Interaction Prediction
Zhiqin Zhu
Zheng Yao
Guanqiu Qi
Neal Mazur
Baisheng Cong
OOD
101
36
0
24 May 2022
Local Byte Fusion for Neural Machine Translation
Local Byte Fusion for Neural Machine Translation
Makesh Narsimhan Sreedhar
Xiangpeng Wan
Yu-Jie Cheng
Junjie Hu
97
4
0
23 May 2022
The Importance of Being Parameters: An Intra-Distillation Method for
  Serious Gains
The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains
Haoran Xu
Philipp Koehn
Kenton W. Murray
MoMe
40
5
0
23 May 2022
StreamingQA: A Benchmark for Adaptation to New Knowledge over Time in
  Question Answering Models
StreamingQA: A Benchmark for Adaptation to New Knowledge over Time in Question Answering Models
Adam Livska
Tomávs Kovciský
E. Gribovskaya
Tayfun Terzi
Eren Sezener
...
Susannah Young
Ellen Gilsenan-McMahon
Sophia Austin
Phil Blunsom
Angeliki Lazaridou
KELM
308
104
0
23 May 2022
When does Parameter-Efficient Transfer Learning Work for Machine
  Translation?
When does Parameter-Efficient Transfer Learning Work for Machine Translation?
Ahmet Üstün
Asa Cooper Stickland
95
7
0
23 May 2022
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating
  Low-Resource Natural Language Generation in Bangla
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla
Abhik Bhattacharjee
Tahmid Hasan
Wasi Uddin Ahmad
Rifat Shahriyar
AIMatLM&MA
109
32
0
23 May 2022
Sequence-to-Action: Grammatical Error Correction with Action Guided
  Sequence Generation
Sequence-to-Action: Grammatical Error Correction with Action Guided Sequence Generation
Jiquan Li
Junliang Guo
Yongxin Zhu
Xin Sheng
Deqiang Jiang
Bo Ren
Linli Xu
106
24
0
22 May 2022
Multilingual Machine Translation with Hyper-Adapters
Multilingual Machine Translation with Hyper-Adapters
Christos Baziotis
Mikel Artetxe
James Cross
Shruti Bhosale
124
23
0
22 May 2022
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
Hui Zhang
Tian Yuan
Junkun Chen
Xintong Li
Renjie Zheng
...
Zeyu Chen
Xiaoguang Hu
Dianhai Yu
Yanjun Ma
Liang Huang
AuLLM
69
28
0
20 May 2022
SALTED: A Framework for SAlient Long-Tail Translation Error Detection
SALTED: A Framework for SAlient Long-Tail Translation Error Detection
Vikas Raunak
Matt Post
Arul Menezes
74
25
0
20 May 2022
Content-Context Factorized Representations for Automated Speech
  Recognition
Content-Context Factorized Representations for Automated Speech Recognition
David M. Chan
Shalini Ghosh
81
13
0
19 May 2022
MiDAS: Multi-integrated Domain Adaptive Supervision for Fake News
  Detection
MiDAS: Multi-integrated Domain Adaptive Supervision for Fake News Detection
Abhijit Suprem
C. Pu
115
7
0
19 May 2022
TiBERT: Tibetan Pre-trained Language Model
TiBERT: Tibetan Pre-trained Language Model
Yuan Sun
Sisi Liu
Junjie Deng
Xiaobing Zhao
94
10
0
15 May 2022
Improving Neural Machine Translation of Indigenous Languages with
  Multilingual Transfer Learning
Improving Neural Machine Translation of Indigenous Languages with Multilingual Transfer Learning
Wei-Rui Chen
Muhammad Abdul-Mageed
70
7
0
14 May 2022
IRB-NLP at SemEval-2022 Task 1: Exploring the Relationship Between Words
  and Their Semantic Representations
IRB-NLP at SemEval-2022 Task 1: Exploring the Relationship Between Words and Their Semantic Representations
Damir Korenčić
Ivan Grubišić
55
3
0
13 May 2022
Who Are We Talking About? Handling Person Names in Speech Translation
Who Are We Talking About? Handling Person Names in Speech Translation
Marco Gaido
Matteo Negri
Marco Turchi
80
8
0
13 May 2022
ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language
  Generation
ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language Generation
Long Phan
H. Tran
Hieu Duy Nguyen
Trieu H. Trinh
ViT
109
68
0
13 May 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&RoLLMAGAI4CE
217
827
0
12 May 2022
Controlling Formality in Low-Resource NMT with Domain Adaptation and
  Re-Ranking: SLT-CDT-UoS at IWSLT2022
Controlling Formality in Low-Resource NMT with Domain Adaptation and Re-Ranking: SLT-CDT-UoS at IWSLT2022
S. Vincent
Loïc Barrault
Carolina Scarton
68
6
0
12 May 2022
AppTek's Submission to the IWSLT 2022 Isometric Spoken Language
  Translation Task
AppTek's Submission to the IWSLT 2022 Isometric Spoken Language Translation Task
P. Wilken
E. Matusov
58
5
0
12 May 2022
Separator-Transducer-Segmenter: Streaming Recognition and Segmentation
  of Multi-party Speech
Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Ilya Sklyar
A. Piunova
Christian Osendorfer
66
6
0
10 May 2022
Controlling Extra-Textual Attributes about Dialogue Participants -- A
  Case Study of English-to-Polish Neural Machine Translation
Controlling Extra-Textual Attributes about Dialogue Participants -- A Case Study of English-to-Polish Neural Machine Translation
S. Vincent
Loïc Barrault
Carolina Scarton
62
3
0
10 May 2022
ParaCotta: Synthetic Multilingual Paraphrase Corpora from the Most
  Diverse Translation Sample Pair
ParaCotta: Synthetic Multilingual Paraphrase Corpora from the Most Diverse Translation Sample Pair
Alham Fikri Aji
Radityo Eko Prasojo Tirana Noor Fatyanosa
Radityo Eko Prasojo
Philip Arthur
Suci Fitriany
Salma Qonitah
Nadhifatuz Zulfa
Tomi Santoso
Mahendra Data
SyDa
53
12
0
10 May 2022
Sub-Word Alignment Is Still Useful: A Vest-Pocket Method for Enhancing
  Low-Resource Machine Translation
Sub-Word Alignment Is Still Useful: A Vest-Pocket Method for Enhancing Low-Resource Machine Translation
Minhan Xu
Yu Hong
74
7
0
09 May 2022
Building Machine Translation Systems for the Next Thousand Languages
Building Machine Translation Systems for the Next Thousand Languages
Ankur Bapna
Isaac Caswell
Julia Kreutzer
Orhan Firat
D. Esch
...
Apurva Shah
Yanping Huang
Zhiwen Chen
Yonghui Wu
Macduff Hughes
123
101
0
09 May 2022
Context-Aware Abbreviation Expansion Using Large Language Models
Context-Aware Abbreviation Expansion Using Large Language Models
Shanqing Cai
Subhashini Venugopalan
Katrin Tomanek
Ajit Narayanan
Meredith Ringel Morris
Michael P. Brenner
66
28
0
08 May 2022
Quantifying Synthesis and Fusion and their Impact on Machine Translation
Quantifying Synthesis and Fusion and their Impact on Machine Translation
Arturo Oncevay
Duygu Ataman
N. V. Berkel
Barry Haddow
Alexandra Birch
Johannes Bjerva
42
3
0
06 May 2022
Previous
123...222324...373839
Next