Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.06226
Cited By
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
19 August 2018
Taku Kudo
John Richardson
Re-assign community
ArXiv (abs)
PDF
HTML
Github (10925★)
Papers citing
"SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"
50 / 1,950 papers shown
Title
Assessing Evaluation Metrics for Speech-to-Speech Translation
Elizabeth Salesky
Julian Mäder
Severin Klinger
74
15
0
26 Oct 2021
Improving Non-autoregressive Generation with Mixup Training
Ting Jiang
Shaohan Huang
Zihan Zhang
Deqing Wang
Fuzhen Zhuang
Furu Wei
Haizhen Huang
Liangjie Zhang
Qi Zhang
38
8
0
21 Oct 2021
Multilingual Unsupervised Neural Machine Translation with Denoising Adapters
Ahmet Üstün
Alexandre Berard
Laurent Besacier
Matthias Gallé
84
47
0
20 Oct 2021
Knowledge distillation from language model to acoustic model: a hierarchical multi-task learning approach
Mun-Hak Lee
Joon‐Hyuk Chang
27
2
0
20 Oct 2021
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Ankur Bapna
Yu-An Chung
Na Wu
Anmol Gulati
Ye Jia
J. Clark
Melvin Johnson
Jason Riesa
Alexis Conneau
Yu Zhang
VLM
137
96
0
20 Oct 2021
Ensemble ALBERT on SQuAD 2.0
Shilun Li
Renee Li
Veronica Peng
MoE
27
6
0
19 Oct 2021
Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement
HyoJung Han
Seokchan Ahn
Yoonjung Choi
Insoo Chung
Sangha Kim
Kyunghyun Cho
63
6
0
18 Oct 2021
EmbRace: Accelerating Sparse Communication for Distributed Training of NLP Neural Networks
Shengwei Li
Zhiquan Lai
Dongsheng Li
Yiming Zhang
Xiangyu Ye
Yabo Duan
FedML
61
3
0
18 Oct 2021
Multilingual unsupervised sequence segmentation transfers to extremely low-resource languages
C.M. Downey
Shannon Drizin
Levon Haroutunian
Shivin Thukral
48
2
0
16 Oct 2021
From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation
Danni Liu
Changhan Wang
Hongyu Gong
Xutai Ma
Yun Tang
J. Pino
98
4
0
15 Oct 2021
Why don't people use character-level machine translation?
Jindrich Libovický
Helmut Schmid
Alexander Fraser
135
29
0
15 Oct 2021
Unifying Cross-lingual Summarization and Machine Translation with Compression Rate
Yu Bai
Heyan Huang
Kai Fan
Yang Gao
Yi-Bo Zhu
Jiaao Zhan
Zewen Chi
Boxing Chen
49
9
0
15 Oct 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
422
1,115
0
13 Oct 2021
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits
Julia Kreutzer
David Vilar
Artem Sokolov
99
15
0
13 Oct 2021
Automated Essay Scoring Using Transformer Models
Sabrina Ludwig
Christian W. F. Mayer
Christopher Hansen
Kerstin Eilers
Steffen Brandt
85
40
0
13 Oct 2021
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition
Yuchen Hu
Nana Hou
Chen Chen
Chng Eng Siong
75
41
0
11 Oct 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Benyou Wang
Qianqian Xie
Jiahuan Pei
Zhihong Chen
Prayag Tiwari
Zhao Li
Jie Fu
LM&MA
AI4CE
154
172
0
11 Oct 2021
Towards Learning (Dis)-Similarity of Source Code from Program Contrasts
Yangruibo Ding
Luca Buratti
Saurabh Pujar
Alessandro Morari
Baishakhi Ray
Saikat Chakraborty
82
36
0
08 Oct 2021
Machine Translation Verbosity Control for Automatic Dubbing
Surafel Melaku Lakew
Marcello Federico
Yue Wang
Cuong Hoang
Yogesh Virkar
Roberto Barra-Chicote
Robert Enyedi
61
24
0
08 Oct 2021
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution
Yangyang Shi
Chunyang Wu
Dilin Wang
Alex Xiao
Jay Mahadeokar
...
Ke Li
Yuan Shangguan
Varun K. Nagaraja
Ozlem Kalinli
M. Seltzer
143
15
0
07 Oct 2021
A Comparative Study of Transformer-Based Language Models on Extractive Question Answering
Kate Pearce
Tiffany Zhan
Aneesh Komanduri
J. Zhan
ELM
87
34
0
07 Oct 2021
How BPE Affects Memorization in Transformers
Eugene Kharitonov
Marco Baroni
Dieuwke Hupkes
247
33
0
06 Oct 2021
Word Acquisition in Neural Language Models
Tyler A. Chang
Benjamin Bergen
85
40
0
05 Oct 2021
Swiss-Judgment-Prediction: A Multilingual Legal Judgment Prediction Benchmark
Joel Niklaus
Ilias Chalkidis
Matthias Sturmer
ELM
AILaw
67
70
0
02 Oct 2021
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models
Robert Wolfe
Aylin Caliskan
125
51
0
01 Oct 2021
FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition
Yichong Leng
Xu Tan
Rui Wang
Linchen Zhu
Jin Xu
...
Linquan Liu
Tao Qin
Xiang-Yang Li
Ed Lin
Tie-Yan Liu
129
42
0
29 Sep 2021
EdinSaar@WMT21: North-Germanic Low-Resource Multilingual NMT
Svetlana Tchistiakova
Jesujoba Oluwadara Alabi
Koel Dutta Chowdhury
Sourav Dutta
Dana Ruiter
VLM
57
6
0
29 Sep 2021
Improving Arabic Diacritization by Learning to Diacritize and Translate
Brian Thompson
A. Alshehri
67
10
0
29 Sep 2021
PicTalky: Augmentative and Alternative Communication Software for Language Developmental Disabilities
Chanjun Park
Yoonna Jang
Seolhwa Lee
Jaehyung Seo
Kisu Yang
Heuiseok Lim
23
6
0
27 Sep 2021
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates
Hirofumi Inaguma
Siddharth Dalmia
Brian Yan
Shinji Watanabe
99
11
0
27 Sep 2021
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
Sneha Kudugunta
Yanping Huang
Ankur Bapna
M. Krikun
Dmitry Lepikhin
Minh-Thang Luong
Orhan Firat
MoE
257
112
0
24 Sep 2021
Cross-Lingual Language Model Meta-Pretraining
Zewen Chi
Heyan Huang
Luyang Liu
Yu Bai
Xian-Ling Mao
LRM
210
0
0
23 Sep 2021
Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents
Biao Zhang
Ankur Bapna
Melvin Johnson
A. Dabirmoghaddam
N. Arivazhagan
Orhan Firat
78
14
0
21 Sep 2021
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
246
375
0
21 Sep 2021
BERTweetFR : Domain Adaptation of Pre-Trained Language Models for French Tweets
Yanzhu Guo
Virgile Rennard
Christos Xypolopoulos
Michalis Vazirgiannis
VLM
AI4CE
89
19
0
21 Sep 2021
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese
Nguyen Luong Tran
Duong Minh Le
Dat Quoc Nguyen
70
55
0
20 Sep 2021
CUNI systems for WMT21: Multilingual Low-Resource Translation for Indo-European Languages Shared Task
Josef Jon
Michal Novák
João Paulo Aires
Duvsan Varivs
Ondrej Bojar
40
3
0
20 Sep 2021
CUNI systems for WMT21: Terminology translation Shared Task
Josef Jon
Michal Novák
João Paulo Aires
Duvsan Varivs
Ondrej Bojar
47
4
0
20 Sep 2021
Augmenting semantic lexicons using word embeddings and transfer learning
Thayer Alshaabi
C. V. Oort
M. Fudolig
M. V. Arnold
C. Danforth
P. Dodds
61
4
0
18 Sep 2021
Primer: Searching for Efficient Transformers for Language Modeling
David R. So
Wojciech Mañke
Hanxiao Liu
Zihang Dai
Noam M. Shazeer
Quoc V. Le
VLM
277
156
0
17 Sep 2021
Translation Transformers Rediscover Inherent Data Domains
Maksym Del
Elizaveta Korotkova
Mark Fishel
43
7
0
16 Sep 2021
Challenges in Detoxifying Language Models
Johannes Welbl
Amelia Glaese
J. Uesato
Sumanth Dathathri
John F. J. Mellor
Lisa Anne Hendricks
Kirsty Anderson
Pushmeet Kohli
Ben Coppin
Po-Sen Huang
LM&MA
313
195
0
15 Sep 2021
Learning When to Translate for Streaming Speech
Qianqian Dong
Yaoming Zhu
Mingxuan Wang
Lei Li
100
30
0
15 Sep 2021
The ELITR ECA Corpus
Philip Williams
Barry Haddow
33
4
0
15 Sep 2021
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training
Bo Zheng
Li Dong
Shaohan Huang
Saksham Singhal
Wanxiang Che
Ting Liu
Xia Song
Furu Wei
VLM
82
22
0
15 Sep 2021
Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
M. Yarmohammadi
Shijie Wu
Marc Marone
Haoran Xu
Seth Ebner
...
Craig Harman
Kenton W. Murray
Aaron Steven White
Mark Dredze
Benjamin Van Durme
76
30
0
14 Sep 2021
Evaluating Transferability of BERT Models on Uralic Languages
Judit Ács
Dániel Lévai
András Kornai
61
6
0
13 Sep 2021
KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation
Marzieh S. Tahaei
Ella Charlaix
V. Nia
A. Ghodsi
Mehdi Rezagholizadeh
110
22
0
13 Sep 2021
Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training
Momchil Hardalov
Arnav Arora
Preslav Nakov
Isabelle Augenstein
88
63
0
13 Sep 2021
Show Me How To Revise: Improving Lexically Constrained Sentence Generation with XLNet
Xingwei He
Victor O.K. Li
BDL
282
24
0
13 Sep 2021
Previous
1
2
3
...
26
27
28
...
37
38
39
Next