SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

50 / 1,950 papers shown

Title
Wine is Not v i n. -- On the Compatibility of Tokenizations Across Languages Antonis Maronikolakis Philipp Dufter Hinrich Schütze 78 17 0 13 Sep 2021
Multilingual Translation via Grafting Pre-trained Language Models Zewei Sun Mingxuan Wang Lei Li AI4CE 240 22 0 11 Sep 2021
Empirical Analysis of Training Strategies of Transformer-based Japanese Chit-chat Systems Hiroaki Sugiyama M. Mizukami Tsunehiro Arimoto Hiromi Narimatsu Yuya Chiba Hideharu Nakajima Toyomi Meguro 178 53 0 11 Sep 2021
Remember the context! ASR slot error correction through memorization Dhanush Bekal Ashish Shenoy Monica Sunkara S. Bodapati Katrin Kirchhoff KELM 57 12 0 10 Sep 2021
Modeling Human Sentence Processing with Left-Corner Recurrent Neural Network Grammars Ryo Yoshida Hiroshi Noji Yohei Oseki 74 9 0 10 Sep 2021
Integrating Approaches to Word Representation Yuval Pinter NAI 94 5 0 10 Sep 2021
Improving Multilingual Translation by Representation and Gradient Regularization Yilin Yang Akiko Eriguchi Alexandre Muzio Prasad Tadepalli Stefan Lee Hany Hassan 83 41 0 10 Sep 2021
AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages Machel Reid Junjie Hu Graham Neubig Y. Matsuo 158 33 0 10 Sep 2021
What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers Boseop Kim Hyoungseok Kim Sang-Woo Lee Gichang Lee Donghyun Kwak ... Jaewook Kang Inho Kang Jung-Woo Ha W. Park Nako Sung VLM 292 124 0 10 Sep 2021
Speechformer: Reducing Information Loss in Direct Speech Translation Sara Papi Marco Gaido Matteo Negri Marco Turchi 129 24 0 09 Sep 2021
Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring Hirofumi Inaguma Yosuke Higuchi Kevin Duh Tatsuya Kawahara Shinji Watanabe 87 11 0 09 Sep 2021
Translate & Fill: Improving Zero-Shot Multilingual Semantic Parsing with Synthetic Data Massimo Nicosia Zhongdi Qu Yasemin Altun 76 26 0 09 Sep 2021
Distributionally Robust Multilingual Machine Translation Chunting Zhou Daniel Levy Xian Li Marjan Ghazvininejad Graham Neubig 139 24 0 09 Sep 2021
Competence-based Curriculum Learning for Multilingual Machine Translation Mingliang Zhang Fandong Meng Y. Tong Jie Zhou 102 16 0 09 Sep 2021
A Recipe For Arbitrary Text Style Transfer with Large Language Models Emily Reif Daphne Ippolito Ann Yuan Andy Coenen Chris Callison-Burch Jason W. Wei 318 120 0 08 Sep 2021
Infusing Future Information into Monotonic Attention Through Language Models Mohd Abbas Zaidi S. Indurthi Beomseok Lee Nikhil Kumar Lakumarapu Sangha Kim 51 2 0 07 Sep 2021
Revisiting Context Choices for Context-aware Machine Translation Matīss Rikters Toshiaki Nakazawa LRM 36 5 0 07 Sep 2021
FH-SWF SG at GermEval 2021: Using Transformer-Based Language Models to Identify Toxic, Engaging, & Fact-Claiming Comments Tobias Bornheim Stephan Bialonski 34 9 0 07 Sep 2021
IndicBART: A Pre-trained Model for Indic Natural Language Generation Raj Dabre Himani Shrotriya Anoop Kunchukuttan Ratish Puduppully Mitesh M. Khapra Pratyush Kumar 127 74 0 07 Sep 2021
Finetuned Language Models Are Zero-Shot Learners Jason W. Wei Maarten Bosma Vincent Zhao Kelvin Guu Adams Wei Yu Brian Lester Nan Du Andrew M. Dai Quoc V. Le ALM UQCV 320 3,805 0 03 Sep 2021
How Suitable Are Subword Segmentation Strategies for Translating Non-Concatenative Morphology? Chantal Amrhein Rico Sennrich 88 13 0 02 Sep 2021
LegaLMFiT: Efficient Short Legal Text Classification with LSTM Language Model Pre-Training Benjamin Clavié Akshita Gheewala Paul Briton Marc Alphonsus Rym Labiyaad Francesco Piccoli VLM AILaw 67 5 0 02 Sep 2021
Survey of Low-Resource Machine Translation Barry Haddow Rachel Bawden Antonio Valerio Miceli Barone Jindvrich Helcl Alexandra Birch AIMat 118 162 0 01 Sep 2021
Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition Maxime Burchi Valentin Vielzeuf 71 88 0 31 Aug 2021
Shatter: An Efficient Transformer Encoder with Single-Headed Self-Attention and Relative Sequence Partitioning Ran Tian Joshua Maynez Ankur P. Parikh ViT 56 2 0 30 Aug 2021
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation Jian Guan Zhuoer Feng Yamei Chen Ru He Xiaoxi Mao Changjie Fan Minlie Huang 118 33 0 30 Aug 2021
Code-switched inspired losses for generic spoken dialog representations E. Chapuis Pierre Colombo Matthieu Labeau Chloe Clave 177 12 0 27 Aug 2021
Injecting Text in Self-Supervised Speech Pretraining Zhehuai Chen Yu Zhang Andrew Rosenberg Bhuvana Ramabhadran Gary Wang Pedro J. Moreno SSL 88 36 0 27 Aug 2021
YANMTT: Yet Another Neural Machine Translation Toolkit Raj Dabre Eiichiro Sumita 72 13 0 25 Aug 2021
Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts Charangan Vasantharajan Uthayasanker Thayasivam 57 38 0 24 Aug 2021
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision Zirui Wang Jiahui Yu Adams Wei Yu Zihang Dai Yulia Tsvetkov Yuan Cao VLM MLLM 153 799 0 24 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation Samuel Cahyawijaya 103 12 0 24 Aug 2021
C5T5: Controllable Generation of Organic Molecules with Transformers D. Rothchild Alex Tamkin Julie H. Yu Ujval Misra Joseph E. Gonzalez 101 29 0 23 Aug 2021
Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks Weicheng Ma Kai Zhang Renze Lou Lili Wang Soroush Vosoughi 405 17 0 18 Aug 2021
Program Synthesis with Large Language Models Jacob Austin Augustus Odena Maxwell Nye Maarten Bosma Henryk Michalewski ... Ellen Jiang Carrie J. Cai Michael Terry Quoc V. Le Charles Sutton ELM AIMat ReCod ALM 218 2,023 0 16 Aug 2021
On Multi-Modal Learning of Editing Source Code Saikat Chakraborty Baishakhi Ray KELM 68 59 0 15 Aug 2021
How Optimal is Greedy Decoding for Extractive Question Answering? Or Castel Ori Ram Avia Efrat Omer Levy 84 4 0 12 Aug 2021
AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing Katikapalli Subramanyam Kalyan A. Rajasekharan S. Sangeetha VLM LM&MA 103 270 0 12 Aug 2021
The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation Minghan Wang Yuxia Wang Chang Su Jiaxin Guo Yingtao Zhang ... Shimin Tao Xingshan Zeng Liangyou Li Hao Yang Ying Qin 54 6 0 09 Aug 2021
Machine Translation of Low-Resource Indo-European Languages Wei-Rui Chen Muhammad Abdul-Mageed 39 3 0 08 Aug 2021
Facebook AI WMT21 News Translation Task Submission C. Tran Shruti Bhosale James Cross Philipp Koehn Sergey Edunov Angela Fan VLM 206 82 0 06 Aug 2021
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining Machel Reid Mikel Artetxe VLM 96 28 0 04 Aug 2021
Bifocal Neural ASR: Exploiting Keyword Spotting for Inference Optimization J. Macoskey Grant P. Strimel Ariya Rastrow 57 19 0 03 Aug 2021
EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training Hao Zhou Pei Ke Zheng Zhang Yuxian Gu Yinhe Zheng ... Xiaocong Yang Bosi Wen Xiaoyan Zhu Minlie Huang Jie Tang 57 55 0 03 Aug 2021
Perceiver IO: A General Architecture for Structured Inputs & Outputs Andrew Jaegle Sebastian Borgeaud Jean-Baptiste Alayrac Carl Doersch Catalin Ionescu ... Olivier J. Hénaff M. Botvinick Andrew Zisserman Oriol Vinyals João Carreira MLLM VLM GNN 133 585 0 30 Jul 2021
Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation Yuichi Oka Katsuhito Sudoh Satoshi Nakamura 95 4 0 29 Jul 2021
gaBERT -- an Irish Language Model James Barry Joachim Wagner Lauren Cassidy Alan Cowap Teresa Lynn Abigail Walsh Mícheál J. Ó Meachair Jennifer Foster 65 18 0 27 Jul 2021
CarneliNet: Neural Mixture Model for Automatic Speech Recognition A. Kalinov Somshubra Majumdar Jagadeesh Balam Boris Ginsburg MoE 44 3 0 22 Jul 2021
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech Duo Ma Nana Hou Van Tung Pham Haihua Xu Chng Eng Siong 69 22 0 22 Jul 2021
Comparison of Czech Transformers on Text Classification Tasks Jan Lehevcka Jan vSvec VLM 73 13 0 21 Jul 2021