Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.13267
Cited By
BPE-Dropout: Simple and Effective Subword Regularization
29 October 2019
Ivan Provilkov
Dmitrii Emelianenko
Elena Voita
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BPE-Dropout: Simple and Effective Subword Regularization"
24 / 24 papers shown
Title
SEA-LION: Southeast Asian Languages in One Network
Raymond Ng
Thanh Ngan Nguyen
Yuli Huang
Ngee Chia Tai
Wai Yi Leong
...
David Ong Tat-Wee
B. Liu
William-Chandra Tjhi
Min Zhang
Leslie Teo
105
14
0
08 Apr 2025
SuperBPE: Space Travel for Language Models
Alisa Liu
J. Hayase
Valentin Hofmann
Sewoong Oh
Noah A. Smith
Yejin Choi
118
7
0
17 Mar 2025
Deterministic Reversible Data Augmentation for Neural Machine Translation
Jiashu Yao
Heyan Huang
Zeming Liu
Yuhang Guo
142
0
0
21 Feb 2025
Number Cookbook: Number Understanding of Language Models and How to Improve It
Haotong Yang
Yi Hu
Shijia Kang
Zhouchen Lin
Muhan Zhang
LRM
90
6
0
06 Nov 2024
Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles
Buu Phan
Brandon Amos
Itai Gat
Marton Havasi
Matthew Muckley
Karen Ullrich
97
2
0
11 Oct 2024
Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks
Yue Zhou
Henry Peng Zou
Barbara Di Eugenio
Yang Zhang
LRM
HILM
67
4
0
01 Jul 2024
Revisiting Low-Resource Neural Machine Translation: A Case Study
Rico Sennrich
Biao Zhang
61
223
0
28 May 2019
A Call for Prudent Choice of Subword Merge Operations in Neural Machine Translation
Shuoyang Ding
Adithya Renduchintala
Kevin Duh
43
64
0
24 May 2019
Learning to Segment Inputs for NMT Favors Character-Level Processing
Julia Kreutzer
Artem Sokolov
65
31
0
02 Oct 2018
FRAGE: Frequency-Agnostic Word Representation
Chengyue Gong
Di He
Xu Tan
Tao Qin
Liwei Wang
Tie-Yan Liu
OOD
59
144
0
18 Sep 2018
Unsupervised Statistical Machine Translation
Mikel Artetxe
Gorka Labaka
Eneko Agirre
45
248
0
04 Sep 2018
Revisiting Character-Based Neural Machine Translation with Capacity and Compression
Colin Cherry
George F. Foster
Ankur Bapna
Orhan Firat
Wolfgang Macherey
64
96
0
29 Aug 2018
Finding Better Subword Segmentation for Neural Machine Translation
Yingting Wu
Hai Zhao
44
23
0
25 Jul 2018
Contextual Augmentation: Data Augmentation by Words with Paradigmatic Relations
Sosuke Kobayashi
84
613
0
16 May 2018
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
Taku Kudo
223
1,169
0
29 Apr 2018
A Call for Clarity in Reporting BLEU Scores
Matt Post
150
2,988
0
23 Apr 2018
Analyzing Uncertainty in Neural Machine Translation
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
UQLM
90
274
0
28 Feb 2018
Unsupervised Machine Translation Using Monolingual Corpora Only
Guillaume Lample
Alexis Conneau
Ludovic Denoyer
MarcÁurelio Ranzato
SSL
110
1,097
0
31 Oct 2017
Mimicking Word Embeddings using Subword RNNs
Yuval Pinter
Robert Guthrie
Jacob Eisenstein
65
159
0
21 Jul 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
701
131,652
0
12 Jun 2017
Data Augmentation for Low-Resource Neural Machine Translation
Marzieh Fadaee
Arianna Bisazza
Christof Monz
99
469
0
01 May 2017
Data Noising as Smoothing in Neural Network Language Models
Ziang Xie
Sida I. Wang
Jiwei Li
Daniel Levy
Allen Nie
Dan Jurafsky
A. Ng
54
238
0
07 Mar 2017
Orthographic Syllable as basic unit for SMT between Related Languages
Anoop Kunchukuttan
P. Bhattacharyya
VLM
69
34
0
03 Oct 2016
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
215
7,745
0
31 Aug 2015
1