Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.10959
Cited By
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
29 April 2018
Taku Kudo
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates"
50 / 628 papers shown
Title
Deep Learning Based Text Classification: A Comprehensive Review
Shervin Minaee
Nal Kalchbrenner
Min Zhang
Narjes Nikzad
M. Asgari-Chenaghlu
Jianfeng Gao
AILaw
VLM
AI4TS
116
1,116
0
06 Apr 2020
Finding the Optimal Vocabulary Size for Neural Machine Translation
Thamme Gowda
Jonathan May
33
3
0
05 Apr 2020
Give your Text Representation Models some Love: the Case for Basque
Rodrigo Agerri
Iñaki San Vicente
Jon Ander Campos
Ander Barrena
X. Saralegi
Aitor Soroa Etxabe
Eneko Agirre
59
63
0
31 Mar 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Takuya Yoshioka
85
122
0
28 Mar 2020
Morfessor EM+Prune: Improved Subword Segmentation with Expectation Maximization and Pruning
Stig-Arne Gronroos
Sami Virpioja
M. Kurimo
VLM
84
21
0
06 Mar 2020
AraBERT: Transformer-based Model for Arabic Language Understanding
Wissam Antoun
Fady Baly
Hazem M. Hajj
166
975
0
28 Feb 2020
Language-Independent Tokenisation Rivals Language-Specific Tokenisation for Word Similarity Prediction
Danushka Bollegala
Ryuichi Kiryo
K. Tsujino
Haruki Yukawa
23
7
0
25 Feb 2020
Semi-Supervised Speech Recognition via Local Prior Matching
Wei-Ning Hsu
Ann Lee
Gabriel Synnaeve
Awni Y. Hannun
SSL
138
31
0
24 Feb 2020
Fixed Encoder Self-Attention Patterns in Transformer-Based Machine Translation
Alessandro Raganato
Yves Scherrer
Jörg Tiedemann
100
92
0
24 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
106
140
0
18 Feb 2020
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Kohei Matsuura
Sei Ueno
Masato Mimura
S. Sakai
Tatsuya Kawahara
CVBM
39
13
0
16 Feb 2020
CBAG: Conditional Biomedical Abstract Generation
Justin Sybrandt
Ilya Safro
MedIm
AI4CE
53
8
0
13 Feb 2020
fastai: A Layered API for Deep Learning
Jeremy Howard
Sylvain Gugger
AI4CE
135
873
0
11 Feb 2020
Aligning the Pretraining and Finetuning Objectives of Language Models
Nuo Wang Pierse
Jing Lu
AI4CE
37
2
0
05 Feb 2020
Scaling Up Online Speech Recognition Using ConvNets
Vineel Pratap
Qiantong Xu
Jacob Kahn
Gilad Avidov
Tatiana Likhomanenko
Awni Y. Hannun
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
242
39
0
27 Jan 2020
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
Jingqing Zhang
Yao-Min Zhao
Mohammad Saleh
Peter J. Liu
RALM
3DGS
327
2,058
0
18 Dec 2019
Cross-Lingual Ability of Multilingual BERT: An Empirical Study
Karthikeyan K
Zihan Wang
Stephen D. Mayhew
Dan Roth
LRM
98
340
0
17 Dec 2019
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
140
332
0
04 Dec 2019
A Subword Level Language Model for Bangla Language
Aisha Khatun
Anisur Rahman
Hemayet Ahmed Chowdhury
Md. Saiful Islam
A. Tasnim
34
4
0
15 Nov 2019
CamemBERT: a Tasty French Language Model
Louis Martin
Benjamin Muller
Pedro Ortiz Suarez
Yoann Dupont
Laurent Romary
Eric Villemonte de la Clergerie
Djamé Seddah
Benoît Sagot
145
981
0
10 Nov 2019
Domain Robustness in Neural Machine Translation
Mathias Müller
Annette Rios Gonzales
Rico Sennrich
109
95
0
08 Nov 2019
Unsupervised Cross-lingual Representation Learning at Scale
Alexis Conneau
Kartikay Khandelwal
Naman Goyal
Vishrav Chaudhary
Guillaume Wenzek
Francisco Guzmán
Edouard Grave
Myle Ott
Luke Zettlemoyer
Veselin Stoyanov
239
6,618
0
05 Nov 2019
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data
Guillaume Wenzek
Marie-Anne Lachaux
Alexis Conneau
Vishrav Chaudhary
Francisco Guzmán
Armand Joulin
Edouard Grave
124
658
0
01 Nov 2019
BPE-Dropout: Simple and Effective Subword Regularization
Ivan Provilkov
Dmitrii Emelianenko
Elena Voita
99
289
0
29 Oct 2019
Multitask Learning For Different Subword Segmentations In Neural Machine Translation
Tejas Srinivasan
Ramon Sanabria
Florian Metze
37
5
0
27 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
816
20,447
0
23 Oct 2019
Deja-vu: Double Feature Presentation and Iterated Loss in Deep Transformer Networks
Andros Tjandra
Chunxi Liu
Frank Zhang
Xiaohui Zhang
Yongqiang Wang
Gabriel Synnaeve
Satoshi Nakamura
Geoffrey Zweig
ViT
89
46
0
23 Oct 2019
End-to-End Speech Recognition: A review for the French Language
Florian Boyer
Jean-Luc Rouas
AI4TS
66
10
0
18 Oct 2019
Controlling Utterance Length in NMT-based Word Segmentation with Attention
Pierre Godard
Laurent Besacier
François Yvon
44
2
0
18 Oct 2019
Learning Invariant Representations of Social Media Users
Nicholas Andrews
M. Bishop
79
37
0
11 Oct 2019
Federated Learning of N-gram Language Models
Mingqing Chen
A. Suresh
Rajiv Mathews
Adeline Wong
Cyril Allauzen
F. Beaufays
Michael Riley
FedML
117
75
0
08 Oct 2019
Modeling Color Terminology Across Thousands of Languages
Arya D. McCarthy
Winston Wu
S. Cascianelli
Bill Watson
Rita Cucchiara
50
11
0
03 Oct 2019
Regressing Word and Sentence Embeddings for Regularization of Neural Machine Translation
Inigo Jauregi Unanue
E. Z. Borzeshi
Massimo Piccardi
AI4TS
44
0
0
30 Sep 2019
On the Importance of Subword Information for Morphological Tasks in Truly Low-Resource Languages
Yi Zhu
Benjamin Heinzerling
Ivan Vulić
Michael Strube
Roi Reichart
Anna Korhonen
65
20
0
26 Sep 2019
Self-Training for End-to-End Speech Recognition
Jacob Kahn
Ann Lee
Awni Y. Hannun
SSL
69
236
0
19 Sep 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Yiming Wang
Tongfei Chen
Hainan Xu
Shuoyang Ding
Hang Lv
Yiwen Shao
Nanyun Peng
Lei Xie
Shinji Watanabe
Sanjeev Khudanpur
VLM
96
73
0
18 Sep 2019
Subword ELMo
Jiangtong Li
Hai Zhao
Z. Li
Wei Bi
Xiaojiang Liu
20
1
0
18 Sep 2019
Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation
Chengyi Wang
Yu-Huan Wu
Shujie Liu
Zhenglu Yang
M. Zhou
89
84
0
17 Sep 2019
MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
Julian Martin Eisenschlos
Sebastian Ruder
Piotr Czapla
Marcin Kardas
Sylvain Gugger
Jeremy Howard
69
99
0
10 Sep 2019
Neural Machine Translation with Byte-Level Subwords
Changhan Wang
Kyunghyun Cho
Jiatao Gu
96
178
0
07 Sep 2019
Subword Language Model for Query Auto-Completion
Gyuwan Kim
36
15
0
02 Sep 2019
Learning a Multi-Domain Curriculum for Neural Machine Translation
Wei Wang
Ye Tian
Jiquan Ngiam
Yinfei Yang
Isaac Caswell
Zarana Parekh
82
39
0
28 Aug 2019
Parsimonious Morpheme Segmentation with an Application to Enriching Word Embeddings
Ahmed El-Kishky
Frank F. Xu
Aston Zhang
Jiawei Han
49
4
0
18 Aug 2019
Transformer-based Automatic Post-Editing with a Context-Aware Encoding Approach for Multi-Source Inputs
WonKee Lee
Junsuk Park
Byung-Hyun Go
Jong-Hyeok Lee
KELM
30
3
0
15 Aug 2019
IMS-Speech: A Speech to Text Tool
Pavel Denisov
Ngoc Thang Vu
80
11
0
13 Aug 2019
A Baseline Neural Machine Translation System for Indian Languages
Jerin Philip
Vinay P. Namboodiri
C. V. Jawahar
107
17
0
29 Jul 2019
Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness Task
Alexandre Berard
Ioan Calapodescu
Claude Roux
VLM
80
59
0
15 Jul 2019
The University of Edinburgh's Submissions to the WMT19 News Translation Task
Rachel Bawden
Nikolay Bogoychev
Ulrich Germann
Roman Grundkiewicz
Faheem Kirefu
Antonio Valerio Miceli Barone
Alexandra Birch
59
32
0
12 Jul 2019
NTT's Machine Translation Systems for WMT19 Robustness Task
Soichiro Murakami
Makoto Morishita
Tsutomu Hirao
Masaaki Nagata
VLM
49
9
0
09 Jul 2019
Lattice Transformer for Speech Translation
Pei Zhang
Boxing Chen
Niyu Ge
Kai Fan
80
50
0
13 Jun 2019
Previous
1
2
3
...
11
12
13
Next