ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.06226
  4. Cited By
SentencePiece: A simple and language independent subword tokenizer and
  detokenizer for Neural Text Processing

SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018
Taku Kudo
John Richardson
ArXivPDFHTML

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

23 / 1,923 papers shown
Title
Word-level Speech Recognition with a Letter to Word Encoder
Word-level Speech Recognition with a Letter to Word Encoder
R. Collobert
Awni Y. Hannun
Gabriel Synnaeve
3DV
27
4
0
10 Jun 2019
The University of Helsinki submissions to the WMT19 news translation
  task
The University of Helsinki submissions to the WMT19 news translation task
Aarne Talman
U. Sulubacak
Raúl Vázquez
Yves Scherrer
Sami Virpioja
Alessandro Raganato
A. Hurskainen
Jörg Tiedemann
VLM
19
7
0
10 Jun 2019
Sequence Tagging with Contextual and Non-Contextual Subword
  Representations: A Multilingual Evaluation
Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation
Benjamin Heinzerling
Michael Strube
13
35
0
04 Jun 2019
Hierarchical Transformers for Multi-Document Summarization
Hierarchical Transformers for Multi-Document Summarization
Yang Liu
Mirella Lapata
16
294
0
30 May 2019
An Investigation of Transfer Learning-Based Sentiment Analysis in
  Japanese
An Investigation of Transfer Learning-Based Sentiment Analysis in Japanese
Enkhbold Bataa
Joshua Wu
21
33
0
23 May 2019
Target Conditioned Sampling: Optimizing Data Selection for Multilingual
  Neural Machine Translation
Target Conditioned Sampling: Optimizing Data Selection for Multilingual Neural Machine Translation
Xinyi Wang
Graham Neubig
25
26
0
20 May 2019
Transformers with convolutional context for ASR
Transformers with convolutional context for ASR
Abdel-rahman Mohamed
Dmytro Okhonko
Luke Zettlemoyer
16
168
0
26 Apr 2019
Importance of Copying Mechanism for News Headline Generation
Importance of Copying Mechanism for News Headline Generation
I. Gusev
19
10
0
25 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable
  Convolutions
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
28
95
0
04 Apr 2019
A Large-Scale Multi-Length Headline Corpus for Analyzing
  Length-Constrained Headline Generation Model Evaluation
A Large-Scale Multi-Length Headline Corpus for Analyzing Length-Constrained Headline Generation Model Evaluation
Yuta Hitomi
Yuya Taguchi
Hideaki Tamori
Ko Kikuta
Jiro Nishitoba
Naoaki Okazaki
Kentaro Inui
Manabu Okumura
31
9
0
28 Mar 2019
Grammatical Error Correction and Style Transfer via Zero-shot
  Monolingual Translation
Grammatical Error Correction and Style Transfer via Zero-shot Monolingual Translation
Elizaveta Korotkova
Agnes Luhtaru
Maksym Del
Krista Liin
Daiga Deksne
Mark Fishel
22
11
0
27 Mar 2019
ETNLP: a visual-aided systematic approach to select pre-trained
  embeddings for a downstream task
ETNLP: a visual-aided systematic approach to select pre-trained embeddings for a downstream task
Xuan-Son Vu
Thanh Tien Vu
Son N. Tran
Lili Jiang
26
6
0
11 Mar 2019
Non-Parametric Adaptation for Neural Machine Translation
Non-Parametric Adaptation for Neural Machine Translation
Ankur Bapna
Orhan Firat
24
73
0
28 Feb 2019
Multimodal Grounding for Sequence-to-Sequence Speech Recognition
Multimodal Grounding for Sequence-to-Sequence Speech Recognition
Ozan Caglayan
Ramon Sanabria
Shruti Palaskar
Loïc Barrault
Florian Metze
29
25
0
09 Nov 2018
How2: A Large-scale Dataset for Multimodal Language Understanding
How2: A Large-scale Dataset for Multimodal Language Understanding
Ramon Sanabria
Ozan Caglayan
Shruti Palaskar
Desmond Elliott
Loïc Barrault
Lucia Specia
Florian Metze
VGen
MLLM
26
287
0
01 Nov 2018
Towards End-to-End Code-Switching Speech Recognition
Towards End-to-End Code-Switching Speech Recognition
Ne Luo
Dongwei Jiang
Shuaijiang Zhao
Caixia Gong
Wei Zou
Xiangang Li
21
47
0
31 Oct 2018
Learning Cross-Lingual Sentence Representations via a Multi-task
  Dual-Encoder Model
Learning Cross-Lingual Sentence Representations via a Multi-task Dual-Encoder Model
Muthuraman Chidambaram
Yinfei Yang
Daniel Cer
Steve Yuan
Yun-hsuan Sung
B. Strope
R. Kurzweil
SSL
21
123
0
30 Oct 2018
Mixture of Expert/Imitator Networks: Scalable Semi-supervised Learning
  Framework
Mixture of Expert/Imitator Networks: Scalable Semi-supervised Learning Framework
Shun Kiyono
Jun Suzuki
Kentaro Inui
33
8
0
13 Oct 2018
Accelerated Reinforcement Learning for Sentence Generation by Vocabulary
  Prediction
Accelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction
Kazuma Hashimoto
Yoshimasa Tsuruoka
11
7
0
05 Sep 2018
R-grams: Unsupervised Learning of Semantic Units in Natural Language
R-grams: Unsupervised Learning of Semantic Units in Natural Language
Ariel Ekgren
Amaru Cuba Gyllensten
Magnus Sahlgren
21
1
0
14 Aug 2018
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhehuai Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
718
6,750
0
26 Sep 2016
Impact of Power System Partitioning on the Efficiency of Distributed
  Multi-Step Optimization
Impact of Power System Partitioning on the Efficiency of Distributed Multi-Step Optimization
Dongliang Chen
A. Bucchiarone
Zhihan Lv
23
12
0
31 May 2016
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,929
0
17 Aug 2015
Previous
123...373839