ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.06226
  4. Cited By
SentencePiece: A simple and language independent subword tokenizer and
  detokenizer for Neural Text Processing

SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018
Taku Kudo
John Richardson
ArXiv (abs)PDFHTMLGithub (10925★)

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

50 / 1,950 papers shown
Title
Wine is Not v i n. -- On the Compatibility of Tokenizations Across
  Languages
Wine is Not v i n. -- On the Compatibility of Tokenizations Across Languages
Antonis Maronikolakis
Philipp Dufter
Hinrich Schütze
78
17
0
13 Sep 2021
Multilingual Translation via Grafting Pre-trained Language Models
Multilingual Translation via Grafting Pre-trained Language Models
Zewei Sun
Mingxuan Wang
Lei Li
AI4CE
240
22
0
11 Sep 2021
Empirical Analysis of Training Strategies of Transformer-based Japanese
  Chit-chat Systems
Empirical Analysis of Training Strategies of Transformer-based Japanese Chit-chat Systems
Hiroaki Sugiyama
M. Mizukami
Tsunehiro Arimoto
Hiromi Narimatsu
Yuya Chiba
Hideharu Nakajima
Toyomi Meguro
178
53
0
11 Sep 2021
Remember the context! ASR slot error correction through memorization
Remember the context! ASR slot error correction through memorization
Dhanush Bekal
Ashish Shenoy
Monica Sunkara
S. Bodapati
Katrin Kirchhoff
KELM
57
12
0
10 Sep 2021
Modeling Human Sentence Processing with Left-Corner Recurrent Neural
  Network Grammars
Modeling Human Sentence Processing with Left-Corner Recurrent Neural Network Grammars
Ryo Yoshida
Hiroshi Noji
Yohei Oseki
74
9
0
10 Sep 2021
Integrating Approaches to Word Representation
Integrating Approaches to Word Representation
Yuval Pinter
NAI
94
5
0
10 Sep 2021
Improving Multilingual Translation by Representation and Gradient
  Regularization
Improving Multilingual Translation by Representation and Gradient Regularization
Yilin Yang
Akiko Eriguchi
Alexandre Muzio
Prasad Tadepalli
Stefan Lee
Hany Hassan
83
41
0
10 Sep 2021
AfroMT: Pretraining Strategies and Reproducible Benchmarks for
  Translation of 8 African Languages
AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages
Machel Reid
Junjie Hu
Graham Neubig
Y. Matsuo
158
33
0
10 Sep 2021
What Changes Can Large-scale Language Models Bring? Intensive Study on
  HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers
What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers
Boseop Kim
Hyoungseok Kim
Sang-Woo Lee
Gichang Lee
Donghyun Kwak
...
Jaewook Kang
Inho Kang
Jung-Woo Ha
W. Park
Nako Sung
VLM
292
124
0
10 Sep 2021
Speechformer: Reducing Information Loss in Direct Speech Translation
Speechformer: Reducing Information Loss in Direct Speech Translation
Sara Papi
Marco Gaido
Matteo Negri
Marco Turchi
129
24
0
09 Sep 2021
Non-autoregressive End-to-end Speech Translation with Parallel
  Autoregressive Rescoring
Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Hirofumi Inaguma
Yosuke Higuchi
Kevin Duh
Tatsuya Kawahara
Shinji Watanabe
87
11
0
09 Sep 2021
Translate & Fill: Improving Zero-Shot Multilingual Semantic Parsing with
  Synthetic Data
Translate & Fill: Improving Zero-Shot Multilingual Semantic Parsing with Synthetic Data
Massimo Nicosia
Zhongdi Qu
Yasemin Altun
76
26
0
09 Sep 2021
Distributionally Robust Multilingual Machine Translation
Distributionally Robust Multilingual Machine Translation
Chunting Zhou
Daniel Levy
Xian Li
Marjan Ghazvininejad
Graham Neubig
139
24
0
09 Sep 2021
Competence-based Curriculum Learning for Multilingual Machine
  Translation
Competence-based Curriculum Learning for Multilingual Machine Translation
Mingliang Zhang
Fandong Meng
Y. Tong
Jie Zhou
102
16
0
09 Sep 2021
A Recipe For Arbitrary Text Style Transfer with Large Language Models
A Recipe For Arbitrary Text Style Transfer with Large Language Models
Emily Reif
Daphne Ippolito
Ann Yuan
Andy Coenen
Chris Callison-Burch
Jason W. Wei
318
120
0
08 Sep 2021
Infusing Future Information into Monotonic Attention Through Language
  Models
Infusing Future Information into Monotonic Attention Through Language Models
Mohd Abbas Zaidi
S. Indurthi
Beomseok Lee
Nikhil Kumar Lakumarapu
Sangha Kim
51
2
0
07 Sep 2021
Revisiting Context Choices for Context-aware Machine Translation
Revisiting Context Choices for Context-aware Machine Translation
Matīss Rikters
Toshiaki Nakazawa
LRM
36
5
0
07 Sep 2021
FH-SWF SG at GermEval 2021: Using Transformer-Based Language Models to
  Identify Toxic, Engaging, & Fact-Claiming Comments
FH-SWF SG at GermEval 2021: Using Transformer-Based Language Models to Identify Toxic, Engaging, & Fact-Claiming Comments
Tobias Bornheim
Stephan Bialonski
34
9
0
07 Sep 2021
IndicBART: A Pre-trained Model for Indic Natural Language Generation
IndicBART: A Pre-trained Model for Indic Natural Language Generation
Raj Dabre
Himani Shrotriya
Anoop Kunchukuttan
Ratish Puduppully
Mitesh M. Khapra
Pratyush Kumar
127
74
0
07 Sep 2021
Finetuned Language Models Are Zero-Shot Learners
Finetuned Language Models Are Zero-Shot Learners
Jason W. Wei
Maarten Bosma
Vincent Zhao
Kelvin Guu
Adams Wei Yu
Brian Lester
Nan Du
Andrew M. Dai
Quoc V. Le
ALMUQCV
320
3,805
0
03 Sep 2021
How Suitable Are Subword Segmentation Strategies for Translating
  Non-Concatenative Morphology?
How Suitable Are Subword Segmentation Strategies for Translating Non-Concatenative Morphology?
Chantal Amrhein
Rico Sennrich
88
13
0
02 Sep 2021
LegaLMFiT: Efficient Short Legal Text Classification with LSTM Language
  Model Pre-Training
LegaLMFiT: Efficient Short Legal Text Classification with LSTM Language Model Pre-Training
Benjamin Clavié
Akshita Gheewala
Paul Briton
Marc Alphonsus
Rym Labiyaad
Francesco Piccoli
VLMAILaw
67
5
0
02 Sep 2021
Survey of Low-Resource Machine Translation
Survey of Low-Resource Machine Translation
Barry Haddow
Rachel Bawden
Antonio Valerio Miceli Barone
Jindvrich Helcl
Alexandra Birch
AIMat
118
162
0
01 Sep 2021
Efficient conformer: Progressive downsampling and grouped attention for
  automatic speech recognition
Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition
Maxime Burchi
Valentin Vielzeuf
71
88
0
31 Aug 2021
Shatter: An Efficient Transformer Encoder with Single-Headed
  Self-Attention and Relative Sequence Partitioning
Shatter: An Efficient Transformer Encoder with Single-Headed Self-Attention and Relative Sequence Partitioning
Ran Tian
Joshua Maynez
Ankur P. Parikh
ViT
56
2
0
30 Aug 2021
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text
  Understanding and Generation
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation
Jian Guan
Zhuoer Feng
Yamei Chen
Ru He
Xiaoxi Mao
Changjie Fan
Minlie Huang
118
33
0
30 Aug 2021
Code-switched inspired losses for generic spoken dialog representations
Code-switched inspired losses for generic spoken dialog representations
E. Chapuis
Pierre Colombo
Matthieu Labeau
Chloe Clave
177
12
0
27 Aug 2021
Injecting Text in Self-Supervised Speech Pretraining
Injecting Text in Self-Supervised Speech Pretraining
Zhehuai Chen
Yu Zhang
Andrew Rosenberg
Bhuvana Ramabhadran
Gary Wang
Pedro J. Moreno
SSL
88
36
0
27 Aug 2021
YANMTT: Yet Another Neural Machine Translation Toolkit
YANMTT: Yet Another Neural Machine Translation Toolkit
Raj Dabre
Eiichiro Sumita
72
13
0
25 Aug 2021
Towards Offensive Language Identification for Tamil Code-Mixed YouTube
  Comments and Posts
Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts
Charangan Vasantharajan
Uthayasanker Thayasivam
57
38
0
24 Aug 2021
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Zirui Wang
Jiahui Yu
Adams Wei Yu
Zihang Dai
Yulia Tsvetkov
Yuan Cao
VLMMLLM
153
799
0
24 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer
  Models via Low-Rank Approximation
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
103
12
0
24 Aug 2021
C5T5: Controllable Generation of Organic Molecules with Transformers
C5T5: Controllable Generation of Organic Molecules with Transformers
D. Rothchild
Alex Tamkin
Julie H. Yu
Ujval Misra
Joseph E. Gonzalez
101
29
0
23 Aug 2021
Contributions of Transformer Attention Heads in Multi- and Cross-lingual
  Tasks
Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks
Weicheng Ma
Kai Zhang
Renze Lou
Lili Wang
Soroush Vosoughi
405
17
0
18 Aug 2021
Program Synthesis with Large Language Models
Program Synthesis with Large Language Models
Jacob Austin
Augustus Odena
Maxwell Nye
Maarten Bosma
Henryk Michalewski
...
Ellen Jiang
Carrie J. Cai
Michael Terry
Quoc V. Le
Charles Sutton
ELMAIMatReCodALM
218
2,023
0
16 Aug 2021
On Multi-Modal Learning of Editing Source Code
On Multi-Modal Learning of Editing Source Code
Saikat Chakraborty
Baishakhi Ray
KELM
68
59
0
15 Aug 2021
How Optimal is Greedy Decoding for Extractive Question Answering?
How Optimal is Greedy Decoding for Extractive Question Answering?
Or Castel
Ori Ram
Avia Efrat
Omer Levy
84
4
0
12 Aug 2021
AMMUS : A Survey of Transformer-based Pretrained Models in Natural
  Language Processing
AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing
Katikapalli Subramanyam Kalyan
A. Rajasekharan
S. Sangeetha
VLMLM&MA
103
270
0
12 Aug 2021
The HW-TSC's Offline Speech Translation Systems for IWSLT 2021
  Evaluation
The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Minghan Wang
Yuxia Wang
Chang Su
Jiaxin Guo
Yingtao Zhang
...
Shimin Tao
Xingshan Zeng
Liangyou Li
Hao Yang
Ying Qin
54
6
0
09 Aug 2021
Machine Translation of Low-Resource Indo-European Languages
Machine Translation of Low-Resource Indo-European Languages
Wei-Rui Chen
Muhammad Abdul-Mageed
39
3
0
08 Aug 2021
Facebook AI WMT21 News Translation Task Submission
Facebook AI WMT21 News Translation Task Submission
C. Tran
Shruti Bhosale
James Cross
Philipp Koehn
Sergey Edunov
Angela Fan
VLM
206
82
0
06 Aug 2021
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence
  Pretraining
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining
Machel Reid
Mikel Artetxe
VLM
96
28
0
04 Aug 2021
Bifocal Neural ASR: Exploiting Keyword Spotting for Inference
  Optimization
Bifocal Neural ASR: Exploiting Keyword Spotting for Inference Optimization
J. Macoskey
Grant P. Strimel
Ariya Rastrow
57
19
0
03 Aug 2021
EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative
  Pre-Training
EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training
Hao Zhou
Pei Ke
Zheng Zhang
Yuxian Gu
Yinhe Zheng
...
Xiaocong Yang
Bosi Wen
Xiaoyan Zhu
Minlie Huang
Jie Tang
57
55
0
03 Aug 2021
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Andrew Jaegle
Sebastian Borgeaud
Jean-Baptiste Alayrac
Carl Doersch
Catalin Ionescu
...
Olivier J. Hénaff
M. Botvinick
Andrew Zisserman
Oriol Vinyals
João Carreira
MLLMVLMGNN
133
585
0
30 Jul 2021
Using Perturbed Length-aware Positional Encoding for Non-autoregressive
  Neural Machine Translation
Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation
Yuichi Oka
Katsuhito Sudoh
Satoshi Nakamura
95
4
0
29 Jul 2021
gaBERT -- an Irish Language Model
gaBERT -- an Irish Language Model
James Barry
Joachim Wagner
Lauren Cassidy
Alan Cowap
Teresa Lynn
Abigail Walsh
Mícheál J. Ó Meachair
Jennifer Foster
65
18
0
27 Jul 2021
CarneliNet: Neural Mixture Model for Automatic Speech Recognition
CarneliNet: Neural Mixture Model for Automatic Speech Recognition
A. Kalinov
Somshubra Majumdar
Jagadeesh Balam
Boris Ginsburg
MoE
44
3
0
22 Jul 2021
Multitask-Based Joint Learning Approach To Robust ASR For Radio
  Communication Speech
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Duo Ma
Nana Hou
Van Tung Pham
Haihua Xu
Chng Eng Siong
69
22
0
22 Jul 2021
Comparison of Czech Transformers on Text Classification Tasks
Comparison of Czech Transformers on Text Classification Tasks
Jan Lehevcka
Jan vSvec
VLM
73
13
0
21 Jul 2021
Previous
123...272829...373839
Next