ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.13267
  4. Cited By
BPE-Dropout: Simple and Effective Subword Regularization

BPE-Dropout: Simple and Effective Subword Regularization

29 October 2019
Ivan Provilkov
Dmitrii Emelianenko
Elena Voita
ArXivPDFHTML

Papers citing "BPE-Dropout: Simple and Effective Subword Regularization"

24 / 24 papers shown
Title
SEA-LION: Southeast Asian Languages in One Network
SEA-LION: Southeast Asian Languages in One Network
Raymond Ng
Thanh Ngan Nguyen
Yuli Huang
Ngee Chia Tai
Wai Yi Leong
...
David Ong Tat-Wee
B. Liu
William-Chandra Tjhi
Min Zhang
Leslie Teo
105
14
0
08 Apr 2025
SuperBPE: Space Travel for Language Models
SuperBPE: Space Travel for Language Models
Alisa Liu
J. Hayase
Valentin Hofmann
Sewoong Oh
Noah A. Smith
Yejin Choi
118
7
0
17 Mar 2025
Deterministic Reversible Data Augmentation for Neural Machine Translation
Deterministic Reversible Data Augmentation for Neural Machine Translation
Jiashu Yao
Heyan Huang
Zeming Liu
Yuhang Guo
142
0
0
21 Feb 2025
Number Cookbook: Number Understanding of Language Models and How to Improve It
Number Cookbook: Number Understanding of Language Models and How to Improve It
Haotong Yang
Yi Hu
Shijia Kang
Zhouchen Lin
Muhan Zhang
LRM
90
6
0
06 Nov 2024
Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles
Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles
Buu Phan
Brandon Amos
Itai Gat
Marton Havasi
Matthew Muckley
Karen Ullrich
97
2
0
11 Oct 2024
Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks
Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks
Yue Zhou
Henry Peng Zou
Barbara Di Eugenio
Yang Zhang
LRM
HILM
67
4
0
01 Jul 2024
Revisiting Low-Resource Neural Machine Translation: A Case Study
Revisiting Low-Resource Neural Machine Translation: A Case Study
Rico Sennrich
Biao Zhang
61
223
0
28 May 2019
A Call for Prudent Choice of Subword Merge Operations in Neural Machine
  Translation
A Call for Prudent Choice of Subword Merge Operations in Neural Machine Translation
Shuoyang Ding
Adithya Renduchintala
Kevin Duh
43
64
0
24 May 2019
Learning to Segment Inputs for NMT Favors Character-Level Processing
Learning to Segment Inputs for NMT Favors Character-Level Processing
Julia Kreutzer
Artem Sokolov
65
31
0
02 Oct 2018
FRAGE: Frequency-Agnostic Word Representation
FRAGE: Frequency-Agnostic Word Representation
Chengyue Gong
Di He
Xu Tan
Tao Qin
Liwei Wang
Tie-Yan Liu
OOD
59
144
0
18 Sep 2018
Unsupervised Statistical Machine Translation
Unsupervised Statistical Machine Translation
Mikel Artetxe
Gorka Labaka
Eneko Agirre
45
248
0
04 Sep 2018
Revisiting Character-Based Neural Machine Translation with Capacity and
  Compression
Revisiting Character-Based Neural Machine Translation with Capacity and Compression
Colin Cherry
George F. Foster
Ankur Bapna
Orhan Firat
Wolfgang Macherey
64
96
0
29 Aug 2018
Finding Better Subword Segmentation for Neural Machine Translation
Finding Better Subword Segmentation for Neural Machine Translation
Yingting Wu
Hai Zhao
44
23
0
25 Jul 2018
Contextual Augmentation: Data Augmentation by Words with Paradigmatic
  Relations
Contextual Augmentation: Data Augmentation by Words with Paradigmatic Relations
Sosuke Kobayashi
84
613
0
16 May 2018
Subword Regularization: Improving Neural Network Translation Models with
  Multiple Subword Candidates
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
Taku Kudo
223
1,169
0
29 Apr 2018
A Call for Clarity in Reporting BLEU Scores
A Call for Clarity in Reporting BLEU Scores
Matt Post
150
2,988
0
23 Apr 2018
Analyzing Uncertainty in Neural Machine Translation
Analyzing Uncertainty in Neural Machine Translation
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
UQLM
90
274
0
28 Feb 2018
Unsupervised Machine Translation Using Monolingual Corpora Only
Unsupervised Machine Translation Using Monolingual Corpora Only
Guillaume Lample
Alexis Conneau
Ludovic Denoyer
MarcÁurelio Ranzato
SSL
110
1,097
0
31 Oct 2017
Mimicking Word Embeddings using Subword RNNs
Mimicking Word Embeddings using Subword RNNs
Yuval Pinter
Robert Guthrie
Jacob Eisenstein
65
159
0
21 Jul 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
701
131,652
0
12 Jun 2017
Data Augmentation for Low-Resource Neural Machine Translation
Data Augmentation for Low-Resource Neural Machine Translation
Marzieh Fadaee
Arianna Bisazza
Christof Monz
99
469
0
01 May 2017
Data Noising as Smoothing in Neural Network Language Models
Data Noising as Smoothing in Neural Network Language Models
Ziang Xie
Sida I. Wang
Jiwei Li
Daniel Levy
Allen Nie
Dan Jurafsky
A. Ng
54
238
0
07 Mar 2017
Orthographic Syllable as basic unit for SMT between Related Languages
Orthographic Syllable as basic unit for SMT between Related Languages
Anoop Kunchukuttan
P. Bhattacharyya
VLM
69
34
0
03 Oct 2016
Neural Machine Translation of Rare Words with Subword Units
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
215
7,745
0
31 Aug 2015
1