Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.10959
Cited By
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
29 April 2018
Taku Kudo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates"
50 / 617 papers shown
Title
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Naoyuki Kanda
Xuankai Chang
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
14
48
0
11 Aug 2020
Subword Regularization: An Analysis of Scalability and Generalization for End-to-End Automatic Speech Recognition
Egor Lakomkin
Jahn Heymann
Ilya Sklyar
Simon Wiesler
25
8
0
10 Aug 2020
KR-BERT: A Small-Scale Korean-Specific Language Model
Sangah Lee
Hansol Jang
Yunmee Baik
Suzi Park
Hyopil Shin
22
51
0
10 Aug 2020
A Survey of Orthographic Information in Machine Translation
Bharathi Raja Chakravarthi
P. Rani
Mihael Arcan
John P. Mccrae
17
33
0
04 Aug 2020
Audio Adversarial Examples for Robust Hybrid CTC/Attention Speech Recognition
Ludwig Kurzinger
Edgar Ricardo Chavez Rosas
Lujun Li
Tobias Watzel
Gerhard Rigoll
AAML
19
4
0
21 Jul 2020
Drinking from a Firehose: Continual Learning with Web-scale Natural Language
Hexiang Hu
Ozan Sener
Fei Sha
V. Koltun
CLL
35
27
0
18 Jul 2020
Contrastive Code Representation Learning
Paras Jain
Ajay Jain
Tianjun Zhang
Pieter Abbeel
Joseph E. Gonzalez
Ion Stoica
SSL
DRL
34
149
0
09 Jul 2020
Alleviating the Burden of Labeling: Sentence Generation by Attention Branch Encoder-Decoder Network
Tadashi Ogura
A. Magassouba
K. Sugiura
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
Hisashi Kawai
24
11
0
09 Jul 2020
Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters
Vineel Pratap
Anuroop Sriram
Paden Tomasello
Awni Y. Hannun
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
11
143
0
06 Jul 2020
LMVE at SemEval-2020 Task 4: Commonsense Validation and Explanation using Pretraining Language Model
Shilei Liu
Yu Guo
Bochao Li
Feiliang Ren
LRM
26
4
0
06 Jul 2020
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
Pavel Denisov
Ngoc Thang Vu
17
30
0
03 Jul 2020
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Tianyan Zhou
Takuya Yoshioka
8
74
0
19 Jun 2020
On the Multi-Property Extraction and Beyond
Tomasz Dwojak
Michal Pietruszka
Łukasz Borchmann
Filip Graliñski
Jakub Chlkedowski
8
0
0
15 Jun 2020
ParsBERT: Transformer-based Model for Persian Language Understanding
Mehrdad Farahani
Mohammad Gharachorloo
Marzieh Farahani
Mohammad Manthouri
14
199
0
26 May 2020
A systematic comparison of grapheme-based vs. phoneme-based label units for encoder-decoder-attention models
Mohammad Zeineldeen
Albert Zeyer
Wei Zhou
T. Ng
Ralf Schluter
Hermann Ney
22
2
0
19 May 2020
Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Frank Zhang
Yongqiang Wang
Xiaohui Zhang
Chunxi Liu
Yatharth Saraf
Geoffrey Zweig
12
20
0
19 May 2020
Are All Languages Created Equal in Multilingual BERT?
Shijie Wu
Mark Dredze
25
316
0
18 May 2020
T-VSE: Transformer-Based Visual Semantic Embedding
M. Bastan
Arnau Ramisa
Mehmet Tek
ViT
24
7
0
17 May 2020
An Evaluation of Recent Neural Sequence Tagging Models in Turkish Named Entity Recognition
Gizem Aras
Didem Makaroglu
Seniz Demir
Altan Cakir
6
30
0
14 May 2020
The Unstoppable Rise of Computational Linguistics in Deep Learning
James Henderson
AI4CE
6
28
0
13 May 2020
On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation
Chaojun Wang
Rico Sennrich
22
155
0
07 May 2020
2kenize: Tying Subword Sequences for Chinese Script Conversion
Pranav A
Isabelle Augenstein
27
1
0
07 May 2020
A Multi-Perspective Architecture for Semantic Code Search
Rajarshi Haldar
Lingfei Wu
Jinjun Xiong
J. Hockenmaier
23
55
0
06 May 2020
Dynamic Programming Encoding for Subword Segmentation in Neural Machine Translation
Xuanli He
Gholamreza Haffari
Mohammad Norouzi
21
45
0
03 May 2020
Evaluating Robustness to Input Perturbations for Neural Machine Translation
Xing Niu
Prashant Mathur
Georgiana Dinu
Yaser Al-Onaizan
AAML
22
64
0
01 May 2020
A Study in Improving BLEU Reference Coverage with Diverse Automatic Paraphrasing
Rachel Bawden
Biao Zhang
Lisa Yankovskaya
Andre Tattar
Matt Post
6
1
0
30 Apr 2020
Data and Representation for Turkish Natural Language Inference
Emrah Budur
Rıza Özçelik
Tunga Güngör
Christopher Potts
14
1
0
30 Apr 2020
Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding
Samson Tan
Chenyu You
L. Varshney
Min-Yen Kan
17
34
0
30 Apr 2020
Vocabulary Adaptation for Distant Domain Adaptation in Neural Machine Translation
Shoetsu Sato
Jin Sakuma
Naoki Yoshinaga
Masashi Toyoda
M. Kitsuregawa
22
3
0
30 Apr 2020
WT5?! Training Text-to-Text Models to Explain their Predictions
Sharan Narang
Colin Raffel
Katherine Lee
Adam Roberts
Noah Fiedel
Karishma Malkan
16
197
0
30 Apr 2020
AxCell: Automatic Extraction of Results from Machine Learning Papers
Marcin Kardas
Piotr Czapla
Pontus Stenetorp
Sebastian Ruder
Sebastian Riedel
Ross Taylor
Robert Stojnic
6
74
0
29 Apr 2020
Adversarial Subword Regularization for Robust Neural Machine Translation
Jungsoo Park
Mujeen Sung
Jinhyuk Lee
Jaewoo Kang
22
8
0
29 Apr 2020
All Word Embeddings from One Embedding
Sho Takase
Sosuke Kobayashi
9
10
0
25 Apr 2020
Multiple Segmentations of Thai Sentences for Neural Machine Translation
Alberto Poncelas
Wichaya Pidchamook
Chao-Hong Liu
J. Hadley
Andy Way
8
7
0
23 Apr 2020
ESPnet-ST: All-in-One Speech Translation Toolkit
Hirofumi Inaguma
Shun Kiyono
Kevin Duh
Shigeki Karita
Nelson Yalta
Tomoki Hayashi
Shinji Watanabe
31
161
0
21 Apr 2020
Transfer learning and subword sampling for asymmetric-resource one-to-many neural translation
Stig-Arne Gronroos
Sami Virpioja
M. Kurimo
28
6
0
08 Apr 2020
Byte Pair Encoding is Suboptimal for Language Model Pretraining
Kaj Bostrom
Greg Durrett
16
200
0
07 Apr 2020
Deep Learning Based Text Classification: A Comprehensive Review
Shervin Minaee
Nal Kalchbrenner
Min Zhang
Narjes Nikzad
M. Asgari-Chenaghlu
Jianfeng Gao
AILaw
VLM
AI4TS
21
1,090
0
06 Apr 2020
Finding the Optimal Vocabulary Size for Neural Machine Translation
Thamme Gowda
Jonathan May
14
3
0
05 Apr 2020
Give your Text Representation Models some Love: the Case for Basque
Rodrigo Agerri
Iñaki San Vicente
Jon Ander Campos
Ander Barrena
X. Saralegi
Aitor Soroa Etxabe
Eneko Agirre
14
61
0
31 Mar 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Takuya Yoshioka
8
113
0
28 Mar 2020
Morfessor EM+Prune: Improved Subword Segmentation with Expectation Maximization and Pruning
Stig-Arne Gronroos
Sami Virpioja
M. Kurimo
VLM
34
21
0
06 Mar 2020
AraBERT: Transformer-based Model for Arabic Language Understanding
Wissam Antoun
Fady Baly
Hazem M. Hajj
46
939
0
28 Feb 2020
Language-Independent Tokenisation Rivals Language-Specific Tokenisation for Word Similarity Prediction
Danushka Bollegala
Ryuichi Kiryo
K. Tsujino
Haruki Yukawa
13
7
0
25 Feb 2020
Semi-Supervised Speech Recognition via Local Prior Matching
Wei-Ning Hsu
Ann Lee
Gabriel Synnaeve
Awni Y. Hannun
SSL
27
31
0
24 Feb 2020
Fixed Encoder Self-Attention Patterns in Transformer-Based Machine Translation
Alessandro Raganato
Yves Scherrer
Jörg Tiedemann
32
92
0
24 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
22
138
0
18 Feb 2020
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Kohei Matsuura
Sei Ueno
Masato Mimura
S. Sakai
Tatsuya Kawahara
CVBM
13
13
0
16 Feb 2020
CBAG: Conditional Biomedical Abstract Generation
Justin Sybrandt
Ilya Safro
MedIm
AI4CE
22
8
0
13 Feb 2020
fastai: A Layered API for Deep Learning
Jeremy Howard
Sylvain Gugger
AI4CE
20
857
0
11 Feb 2020
Previous
1
2
3
...
10
11
12
13
Next